Oscar and VinVL
-
Updated
Aug 28, 2023 - Python
Oscar and VinVL
Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.
Python dictionary storing object tags for MS-COCO images. Data from 3 different sources (COCO ground truths, VG classifier and Microsoft's VinVL) are availible.
A simplified visual backbone for feature extraction, bounding boxes, and object detection using VinVL.
Use MDSANet for Image Captions Vietnamese
Vision-language model for generating Arabic image captions using Bidirectional Transformers (BiT) and advanced feature fusion.
Add a description, image, and links to the vinvl topic page so that developers can more easily learn about it.
To associate your repository with the vinvl topic, visit your repo's landing page and select "manage topics."