GithubHelp home page GithubHelp logo

Model size conflit about electra HOT 3 CLOSED

google-research avatar google-research commented on July 2, 2024
Model size conflit

from electra.

Comments (3)

heartcored98 avatar heartcored98 commented on July 2, 2024 1

Hi, I really like ELECTRA model. but I am also wondering the reason you chosen to upload the generator model with the hidden size of 256 in case of small model as the official model. Because Figure 3 in your paper shows that generator hidden size with 64 was the best model when the discriminator hidden size was 256.

Is generator of hidden size 256 best for the small model?

from electra.

clarkkev avatar clarkkev commented on July 2, 2024

Yes, as we note in the README the generator size is 1 for the ELECTRA-Small model. We will update the paper with this information too.

from electra.

lucadiliello avatar lucadiliello commented on July 2, 2024

Which generator_size did you use in the paper when reporting results on electra small?

from electra.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.