GithubHelp home page GithubHelp logo

Comments (3)

yujiali avatar yujiali commented on September 2, 2024

We have more detailed description about the data we used in the paper.

For the control-flow graphs, we compile source code, e.g. a function using different compilers / compiler options to get a set of different binaries. The binaries corresponding to the same function are labeled as positive (similar), and binaries associated with different functions are labeled as negative (not similar). For COIL-DEL objects within the same class are labeled as similar, otherwise not similar.

In general, if you have a similarity value, instead of a binary label, you have a few different options (these are beyond the scope of our paper, and are purely suggestions): (1) use the similarity value to get binary labels, for example by applying a threshold on it to get pairs; or (2) if you want to do triplet training, then if the similarity value between A and B are larger than A and C then (A,B) is positive in the triplet, and (A, C) is negative; (3) potentially even better, you could change the loss to make this more of a regression problem to fully utilize the similarity values you have.

from deepmind-research.

diegolascasas avatar diegolascasas commented on September 2, 2024

Closing this issue, feel free to reopen it if you have further questions.

from deepmind-research.

RZZBlackMagic avatar RZZBlackMagic commented on September 2, 2024

@psp3dcg hello, I would like learn GMN, Is there a public data set for this project? If not, Could you tell me the format of the dataset?
Thanks
Best wishes

from deepmind-research.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.