Computer Vision needs classification, segmentation and object detection through prompts. A single model that can do all of this, just like GPT for NLP. Are Open-Vocabulary object detectors like ViLD [https://t.co/vKBZizGVef] or OV-DETR [https://t.co/tvM9
Computer Vision – ECCV 2022
Springer Nature Switzerland