GithubHelp home page GithubHelp logo

good job!i have a question about acon HOT 12 OPEN

nmaac avatar nmaac commented on July 28, 2024
good job!i have a question

from acon.

Comments (12)

nmaac avatar nmaac commented on July 28, 2024 1

@jinfagang It depends on hardware platform, normally 10%-20% latency increment.

from acon.

nmaac avatar nmaac commented on July 28, 2024

没错。p1, p2 和 conv 权重都是可学习参数。

from acon.

xuhao-anhe avatar xuhao-anhe commented on July 28, 2024

你好,我想咨询一下meta-acon里面的r具体作用是啥,可以随便改变默认值吗

from acon.

nmaac avatar nmaac commented on July 28, 2024

r是channel放缩系数,可以根据你的需要调整,对准确率影响不大。这是个很常用的减少参数量的技巧,早在2016年的HyperNetworks就已经是常规做法。

这里可以引用原文的解释来回答:“a one-layered hypernetwork would have Nz × Nin × fsize × Nout × fsize learnable
parameters which is usually much bigger than a two-layered hypernetwork does.”

from acon.

lucasjinreal avatar lucasjinreal commented on July 28, 2024

@nmaac Did u tested how many speed drop if using MetaAcon compare with normal activation without learnable params? I saw there is no such comparsion in paper but seems will introduce latency increase

from acon.

lucasjinreal avatar lucasjinreal commented on July 28, 2024

@nmaac Oh....

from acon.

nmaac avatar nmaac commented on July 28, 2024

@jinfagang But ACON is a good choice which has the same speed with Swish, and they have the same speed with ReLU if using hard-sigmoid to implement :)

from acon.

lucasjinreal avatar lucasjinreal commented on July 28, 2024

@nmaac You mean ACON-ABC?

from acon.

nmaac avatar nmaac commented on July 28, 2024

@jinfagang I suggest ACON-C which improves the performance with a negligible overhead and shows a good accuracy-speed tradeoff.

from acon.

yxNONG avatar yxNONG commented on July 28, 2024

@nmaac I had a question about beta in MetaACON, in paper it mention when beta goes infinite, the loss function will change to the max(x1, x2). However, in MetaACON, beta is generaed by sigmoid function which means that the range of beta is (0, 1).
Is there any reason for this choice?

from acon.

nmaac avatar nmaac commented on July 28, 2024

@yxNONG MetaAcon uses a small network to generate beta, in this work we try some network examples which show sigmoid has good performance. More choices and designs of this small network is not the focus of this work but is a promising future direction.

from acon.

yxNONG avatar yxNONG commented on July 28, 2024

@nmaac got it, i will try the ReLU and identities which is much more make sense to me, thanks for your reply!

from acon.

Related Issues (16)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.