GithubHelp home page GithubHelp logo

Comments (5)

FreyrS avatar FreyrS commented on August 23, 2024 1

Hi @ngszyba,

What @rmwu is absolutely correct.
The first 16 values are the "input features" consisting of 10 mean and gaussian curvatures at different scales + 6 learned chemical features, then there come 16 values which are the point descriptors after the convolution ("embedding"), then there are interface predictions (1 value) and interface labels (1 value) if there are any

from dmasif.

rmwu avatar rmwu commented on August 23, 2024

Not sure if the authors are still replying but you can find the information that it saves in data_iteration.py line 94

coloring = torch.cat([inputs, embedding, predictions, labels], axis=1)

Where inputs is computed from line 358 of model.py and embedding is the output from their model, computed at line 413 of the same file. If I recall correctly, embedding is only dim=16 or so.

from dmasif.

ngszyba avatar ngszyba commented on August 23, 2024

Hi, @rmwu

Thanks so much for such a quick answer and pointing towards data_iteration.py and model.py.
This is already a great hint. I assume that the columns could be deciphered in this section of the model.py (line 481) :

"P1": P1,
"P2": P2,
"R_values": R_values,
"conv_time": conv_time,
"memory_usage": memory_usage,

However, I still find it pretty difficult to get whatever is packed into P1 and P2.
It would be great to get a feature list with a brief description e.g. is time in seconds?, etc.

cheers,

from dmasif.

jeanfeydy avatar jeanfeydy commented on August 23, 2024

Hi @ngszyba, @rmwu,

Thanks for your interest in our work!

Just a quick word to say that we are still replying but:

  • Unfortunately, it’s hard to be as reactive as we would like to be (e.g. I’m now working mostly on public health problems, quite far away from proteins, and am discovering the joys of paperwork as a “tenured/senior” researcher…).
  • I’m only sharp on low-level problems related to e.g. KeOps or the convolutional layers.
  • @FreyrS is by far the sharpest person on the global structure of the dMaSIF repository, but he may be very busy at the moment - plus his PhD is coming to an end soon.

I have lost track of the structure of our files, but an important thing to note is that our dMaSIF model produces trained ”neural” features that do not have a clear physical meaning. P1 and P2 just contain “one vector of trained features per surface point”, where our feature extractor has been optimized with respect to a certain task (see our three papers on this: Nature methods, CVPR and MLDD). The lack of interpretability is unavoidable here, even though we’re trying to mitigate this as much as possible with our architectures.

I hope that @FreyrS will be able to confirm that predfeatures_emb1.npy does indeed contain the output of our convolutional layers :-)
Best regards,
Jean

from dmasif.

ngszyba avatar ngszyba commented on August 23, 2024

Hi @jeanfeydy!

Thanks for your input!
I understand that interpretation of these features in an "absolute" manner is not advisable. In our project we are focusing on side-by-side comparisons of the surface features and projecting it on other protein properties like enzymatic activity and we wanted to give dMaSIF a shot.

Would be great if @FreyrS can come back with the column/features list at a later time.

thank you guys,

cheers!

from dmasif.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.