I'm comparing results between the pretrained m3gnet in this repo and in the original m

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Are discrepancies between old m3gnet repo expected? about matgl HOT 8 CLOSED

matthewkuner commented on June 24, 2024

Are discrepancies between old m3gnet repo expected?

from matgl.

Comments (8)

shyuep commented on June 24, 2024

How different are the atomic positions? While we adopted mostly the same training protocols, the pre-trained M3GNet in this repo is not an exact replica of the previous M3GNet-TF. I would expect the differences in atomic positions to not be large. Energy errors within the MAE of the potentials (30-40 meV/atom) are not surprising.

from matgl.

shyuep commented on June 24, 2024

I should add that there is no easy way to port model weights directly over from TF to DGL/Pytorch. So that's why we had to retrain. In any case, this is a baseline model (just to make sure we are reproducing the broad error characteristics of the TF version) and we will provide improved models as we go along.

from matgl.

matthewkuner commented on June 24, 2024

How different are the atomic positions?
Not significantly different in the structures I've tested, but enough that I could tell it wasn't just noise. The energy is the more noteworthy difference imo

from matgl.

shyuep commented on June 24, 2024

Yeah for the atomic positions, we usually get to within 1% of the DFT. So I would expect the deviation in atomic positions to be less significant (but not below noise level). There are definite uncertainties in the energies. Better for some systems (e.g., oxides) but worse for others.

from matgl.

shyuep commented on June 24, 2024

I have redone the cubic crystal test (see examples) with the new matgl implementation. The error characteristics are largely similar to the old m3gnet. We did discover some minor data issues and the new M3gnet is fitted with further filtered data (e.g., some problematic structures with very large forces were removed). So again, not an exact replica of the TF M3GNet but basically similar performance-wise. I will close this issue but feel free to reopen if you discover any serious issues with the new implementation.

from matgl.

shyuep commented on June 24, 2024

@kenko911 Can provide further details on the additional filtering done. Pls write it in the README.

from matgl.

ThePauliPrinciple commented on June 24, 2024

Do I understand correctly that the new architecture of the model is also slightly different (the number of parameters seems to be different). Can any details be given about this? It might be relevant.

from matgl.

shyuep commented on June 24, 2024

The differences are relatively minor. The embedding sizes etc. are all the same. The only slight difference is in the length of the bond expansion I believe. Otherwise, the activation, optimizers, etc. are all the same.

It is not possible to exactly replicate the old model given we are moving to an entirely different code base. But this is pretty close. Our focus is on improving the models going forward and this model is just a baseline.

from matgl.

Recommend Projects

Are discrepancies between old m3gnet repo expected? about matgl HOT 8 CLOSED

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs