Comments (4)
Thanks for your note.
I've also spotted this before. But the preprocessing is just following the Karpathy's step, in which he just applied the nltk tokenizer.
Hope this helps.
from adaptive.
But it's actually skewing the evaluation results because a different tokenizer creates a different vocabulary. For accurate results, both the training and the validation/test captions need to be tokenized in the same way.
from adaptive.
Yeah, you are totally right. That's what most of the recent papers are doing, as they were following Karpathy's standard.
from adaptive.
@ KeepingItClassy
Could you share your test code? Code can be achieved on the screen images and captions. Thanks very much.
from adaptive.
Related Issues (19)
- How to make a test? HOT 4
- When test has a problem,please help HOT 1
- When I try to run train.py the attribute error occurs HOT 4
- train.py
- No module named 'bleu_scorer' for image captioning
- when testing HOT 1
- is batch_size_t work?
- adaptive.py dimension error HOT 9
- how to find accuracy for train.py ?
- bug,help
- I got CIDer 0.82 only ,could you please help me about how I can imporve the score?thanks
- is this project completely implement the result of the paper? HOT 3
- bug
- pre-trained model? HOT 7
- How to count len( data_loader )? HOT 1
- Training on MSCOCO and testing on Flickr HOT 4
- Encoder encodes the same image differently HOT 10
- Unexpected bus error encountered in worker. This might be caused by insufficient shared memory (shm)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from adaptive.