CLIP

Creator
Creator
Seonglae Cho
Created
Created
2022 Apr 20 2:46
Editor
Edited
Edited
2025 Mar 24 23:1
Refs

Contrastive Learning-Image Pre-training

Connecting text and images trained by
Contrastive Learning
with text and image at the same time. CLIP makes input text to
Embedding vector
for image processing. This zero-shot capability made
Image Labeling
requirement very low.
notion image
notion image
CLIP Usages
 
 
 
 
 

Enhanced usages for
Image Segmentation
and
Object Detection

Minderer et al., Simple Open-Vocabulary Object Detection with Vision Transformers, 2022
Minderer et al., Simple Open-Vocabulary Object Detection with Vision Transformers, 2022
Luddecke and Ecker, Image Segmentation Using Text and Image Prompts., 2022
Luddecke and Ecker, Image Segmentation Using Text and Image Prompts., 2022
 
 
 
 

Recommendations