Integrates detailed visual information by combining features from low-level (pixel) to high-level (global semantic) extracted from various layers (depth) of the vision encoder.
Depth-Breadth Fusion
Creator
Creator
Seonglae ChoCreated
Created
2024 Dec 6 23:43Editor
Editor
Seonglae ChoEdited
Edited
2024 Dec 6 23:46Refs
Refs