Comments (2)
Hi, thanks for your interest in TextDiffuser. Does this issue exist in few cases? Sometimes the image owner may replace the image, resulting in the mismatch of downloaded image.
from unilm.
Hi, thanks for your interest in TextDiffuser. Does this issue exist in few cases? Sometimes the image owner may replace the image, resulting in the mismatch of downloaded image.
Finally, I have to use the index you provided in “laion-ocr-index-url.txt” and write a new script to move and rename the image, so that can match the annotaions.
Another question is could I resize the image directly in the "train.py/inference.py", such as
image = Image.open(image_path).convert("RGB")
image = image.resize((512,512))
from unilm.
Related Issues (20)
- Kosmos-2.5 - Image-to-markdown generation for images outside the sample-set provided is almost entirely garbled - output markdown is completely unusable. HOT 1
- Kosmos-2.5 - Python version 3.10.x (and any other confirmed working versions) should be mentioned as a requirement to deploy & infer the Kosmos-2.5 model
- Kosmos-2 batch modality and processing speed
- how to resume training for layoutlmv3-publaynet
- TextDiffuser - SD2.1 code and weights
- kosmos-2.5 | trying to connect to `openaipublic.blob.core.windows.net`
- layoutlmv3 pretrain
- Compile LayoutLMv3 using Neuron SDK for AWS Inferentia (inf1)
- Beit-3 , clarification on classification task and associated pretrained weights
- Maybe some bugs of YOCO HOT 5
- using beit-3 to extract text and images into feature vectors (numpy type)
- Error when Installing Requirements
- Error doing Textdiffuser2 inpainting demo HOT 1
- Unable to download the models and test dataset of trocr HOT 2
- layoutlmv3 chinese 模型训练和推理
- Possible TrOCR License Conflict
- [BEATs] How to handle different length of audio files?
- How can I have dit document layout analysis checkpoints?
- a problem by textdiffuser dataset
- Textdiffuser-2 : runwayml/stable-diffusion-v1-5 model unavailable HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from unilm.