Loading views...

Vision Transformer with graphical segmentation

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 Nov 18 19:38
Editor
Edited
Edited
2024 Nov 18 19:40
Refs
Refs
  1. image sementation
  1. build image nearest feature graph generation
  1. graphical positional embedding
  1. vision transformer inference
인간은 pixel기반이지만 convolution 하지 않고 selective attention 한다는 intuition
 
 
 
 
 
 
 
 

Recommendations