GithubHelp home page GithubHelp logo

Comments (10)

dandelin avatar dandelin commented on August 23, 2024 1

@JACKHAHA363
Sure, you can grab it here https://www.dropbox.com/s/lcqmbx587szaox3/vilt_100k_wwm_pretrain.ckpt?dl=0 (will be expired someday)

from vilt.

dandelin avatar dandelin commented on August 23, 2024

@JACKHAHA363

The fine-tuning results can be unstable due to augmentations. Also, we have only trained the IR/TR fine-tuning models for a single time.
You may increase the training epochs (greater than 10 epochs, maybe 20 epochs?) to get more stable and better results.

from vilt.

JACKHAHA363 avatar JACKHAHA363 commented on August 23, 2024

I tried longer epochs but that end up overfitting with increasing val loss. Would you mind providing the checkpoint for 100k steps also?

from vilt.

JACKHAHA363 avatar JACKHAHA363 commented on August 23, 2024

thanks @dandelin!

from vilt.

yangxiaofeng avatar yangxiaofeng commented on August 23, 2024

Are you able to solve this issue? @JACKHAHA363 I have similar issues on both flicker and coco retrieval.

from vilt.

byougert avatar byougert commented on August 23, 2024

Hi, bro.
I found ir/tr evaluation result on flickr is still unstable even using official finetuned checkpoint. Sometimes I got 63.94(ir)/83.6(tr), sometimes it changed to 64.3(ir)/83.7(tr). How do you think it? @dandelin @JACKHAHA363

from vilt.

byougert avatar byougert commented on August 23, 2024

from vilt.

dandelin avatar dandelin commented on August 23, 2024

Hi @byougert

Oops, you got the mail. I deleted the comment right after posted it as I noticed I put shuffle=False in DistributedSampler(image_dset, shuffle=False).

Though after quick investigation, I found the true reason.
It was the precision=16, set in https://github.com/dandelin/ViLT/blob/master/run.py#L51.
After setting precision=32 during evaluation I was able to get stable result.

I guess the score from rank_output is very cluttered so they need larger precision.
Thanks for the report and I will revise the EVAL.md. :)

from vilt.

byougert avatar byougert commented on August 23, 2024

Hi, bro.
Yes, i received your message in my mail but couldn't find the reply in github. hhhh....
Thanks for your reply and nice work.

from vilt.

byougert avatar byougert commented on August 23, 2024

Hi, @dandelin
I'm sorry to say that the result seems still puzzled. Last night, when I changed precision to 32 during evaluation, two similar but NOT SAME results appeared, which showed one was 0.6480(ir)/0.8370(tr) but the other was 0.6460(ir)/0.8370(tr).
Acatlly, seed is exactly fixed to 0. I have no idea what causes the differece. Y_Y

from vilt.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.