Comments (6)
Hi, @1024er,
Thanks for raising this. This definitely needs fixing. I'll work on this this week.
Thanks.
from lavis.
Hi, @1024er,
Thanks for raising this. This definitely needs fixing. I'll work on this this week.
Thanks.
Has it been fixed ? thank you ~
from lavis.
Hi @1024er ,
It seems the annotations of SBU captions are not properly addressing the image directory structure in the zip.
I have now updated the downloading script to directly fetch images from urls. Though I wouldn't be surprised if some urls deprecate as they will.
Let me know how it works.
Thanks.
from lavis.
Hi, I tried the new annotation file, but I still found a lot of images were missing. I am wondering if there is a script to generate an annotations file based on available images.
from lavis.
Hi, I tried the new annotation file, but I still found a lot of images were missing. I am wondering if there is a script to generate an annotations file based on available images.
I've encountered the same issue. Would you mind providing the processed images via google drive? @dxli94
from lavis.
Hi, I tried the new annotation file, but I still found a lot of images were missing. I am wondering if there is a script to generate an annotations file based on available images.
I've encountered the same issue. Would you mind providing the processed images via google drive? @dxli94
I'll give a processing script for masking the non-valid records of sbu captions.
import tqdm
import os
nonvalid_records=[]
valid_records=[]
with open('sbu_captions/annotations/sbu.json', "r") as f:
dset=json.load(f)
def check_file_exists(filename,path):
exist=os.path.exists(os.path.join(path,filename))
return exist
for ann in tqdm.tqdm(dset):
exist=check_file_exists(ann['image'],'sbu_captions/images')
if exist:
valid_records.append(ann)
else:
nonvalid_records.append(ann)
print('not valid records',len(nonvalid_records),'valid records',len(valid_records))
print("saving valid")
with open('sbu_captions/annotations/sbu_valid.json', "w") as f:
dset=json.dump(valid_records,f)
print("saving nonvalid")
with open('sbu_captions/annotations/sbu_nonvalid.json', "w") as f:
dset=json.dump(nonvalid_records,f)
from lavis.
Related Issues (20)
- how to deal with โMissing keys โ HOT 1
- huggingface_hub.utils._validators.HFValidationError: Repo id must use alphanumeric chars or '-', '_', '.', '--' and '..' are forbidden, '-' and '.' cannot start or end the name, max length is 96: ''.
- How should I use blip2 for vqa task training? HOT 3
- XInstructBLIP demo text generation
- salesforce-lavis 1.0.2 requires transformers<4.27,>=4.25.0, but you have transformers 4.40.0 which is incompatible.
- Use BLIP-2 for Image Captioning HOT 1
- safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooLarge HOT 2
- Loss calculation across GPUs using all_gather_with_grad function
- what should be samples["text_output"] during finetuning HOT 3
- AttributeError: 'NoneType' object has no attribute 'from_pretrained' HOT 1
- Blip2-caption only generate "a photo of"
- About Text Preprocessing of InstructBLIP
- How are Learned Queries generated? What line of code is implemented?
- CUstom dataset Inference
- How is `Total Params` calculated?
- cache_version Value Error
- How can I change cache_root?
- generated output error HOT 2
- A question for BLIP2 Q-Former HOT 1
- BLIP-2 paper finetune replicate low performance: BLEU_4 score is 0.15 for finetuning COCO_caption HOT 8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lavis.