Comments (3)
also the image loading process seems not adapting to the path defined in lavis/configs/default.yaml, the provided annotation json file for visual genome uses image location of '/export/xxxx', which will cause error when loading images
from lavis.
Hi @happywu , thanks for raising the issue.
We have now updated the SBU annotation URL, see commit for information.
For conceptual captions, there are usually a missing portion of data during downloading. Therefore, our annotations will not be universally applicable to users' local copies. You may instead use the following scripts to generate your annotations and modify the configuration files with the annotation path accordingly.
lavis/datasets/download_scripts/DownloadConceptualCaptions/create_annotation_3m.ipynb
lavis/datasets/download_scripts/DownloadConceptualCaptions/create_annotation_12m.ipynb
Let us know how it works.
Thanks.
from lavis.
Thanks for updating the url and guidance for generating CC annotations!
from lavis.
Related Issues (20)
- instruct-blip output long meanless string HOT 1
- question about text localization
- How to fine-tune BLIP-2 on a local Chinese dataset? HOT 3
- ModuleNotFoundError: No module named 'lavis.models.blip_diffusion_models'
- Why do I always encounter CUDA out of memory problem when I load the load_model_process function? Can the RTX 3090 be used for the BLIP-2 model?" HOT 2
- The results of DocVQA, infoVQA, and OCRVQA evaluating the instructblip model are very low
- Score difference in ITM and ITC ?
- Can existing large datasets be used to fine tune the blip2 caption task?
- OPT2.7B underperforming & weird behavior compared to flant5xl on image captioning? HOT 5
- The role of modeling_opt.py in the BLIP2 model
- Image use to present LAVIS
- How to run InstructBLIP with other LLM model
- How can I calculate the similarity between multimodal features and Unimodal features
- Potentially wrong inherence in lavis.datasets.datasets.base_dataset
- Input of multiple images
- how use it output target class。
- how to deal with “Missing keys ” HOT 1
- huggingface_hub.utils._validators.HFValidationError: Repo id must use alphanumeric chars or '-', '_', '.', '--' and '..' are forbidden, '-' and '.' cannot start or end the name, max length is 96: ''.
- How should I use blip2 for vqa task training? HOT 3
- XInstructBLIP demo text generation
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lavis.