GithubHelp home page GithubHelp logo

comp6's Introduction

COmprehensive Machine-learning Potential (COMP6) Benchmark Suite

This repository contains the COMP6 benchmark for evaluating the extensibility of machine-learning based molecular potentials.

If you use the COMP6 benchmark please cite this paper:

Active learning-based (ANI-1x):

Justin S. Smith, Ben Nebgen, Nicholas Lubbers, Olexandr Isayev, Adrian E. Roitberg. Less is more: sampling chemical space with active learning. The Journal of Chemical Physics 148, 241733 (2018), (https://aip.scitation.org/doi/abs/10.1063/1.5023802)

Usage

Please read the README.md in the repository linked below for instructions on how to extract the COMP6 HDF5 (extention *.h5) files. https://github.com/isayev/ANI1_dataset

The following paper contains a description of the file format: https://www.nature.com/articles/sdata2017193

COMP6 Benchmark Results:

These results represent the errors (MAE/RMSE) over the entire benchmark using a single ML potential (column 1). Please read https://aip.scitation.org/doi/abs/10.1063/1.5023802 Section IID for a detailed description of the error metrics.

Please contact Justin S. Smith at [email protected] if you'd like to add your results from the COMP6 benchmark.

Complete COMP6 benchmark results:

Potential Energy Relative Energy Force
ANI-1x1 1.93/3.37 1.85/2.95 3.09/5.29
ANI-11 5.01/16.9 3.01/6.97 3.70/7.13

Units: kcal/mol and kcal/mol/A (errors are NOT per atom) Error key: MAE/RMSE

Related work

ANAKIN-ME ML Potential Method:

Justin S. Smith, Olexandr Isayev, Adrian E. Roitberg. ANI-1: An extensible neural network potential with DFT accuracy at force field computational cost. Chemical Science, 2017, DOI: 10.1039/C6SC05720A

Original ANI-1 data:

Justin S. Smith, Olexandr Isayev, Adrian E. Roitberg. ANI-1, A data set of 20 million calculated off-equilibrium conformations for organic molecules. Scientific Data, 4, Article number: 170193, DOI: 10.1038/sdata.2017.193 https://www.nature.com/articles/sdata2017193

Active learning and transfer learning-based (ANI-1ccx):

Justin S. Smith, Benjamin T. Nebgen, Roman Zubatyuk, Nicholas Lubbers, Christian Devereux, Kipton Barros, Sergei Tretiak, Olexandr Isayev, Adrian Roitberg. Outsmarting Quantum Chemistry Through Transfer Learning. ChemRxiv, 2018, DOI: [https://doi.org/10.26434/chemrxiv.6744440.v1]

comp6's People

Contributors

jussmith01 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

comp6's Issues

Units

Apologies if this information is available somewhere and I missed it. What are the units of various quantities in this data set?

My best guesses are:

  • coordinates: Bohr
  • energy: Hartree
  • force: Hartree/Bohr
  • charge/density: a.u.
  • dipole: debye

The sign of forces is incorrect

I found that the sign of forces in the dataset is incorrect. Take the first frame of ANI-MD dataset as an example:
Below is the forces given by gaussian (unit: Hartree/Bohr)
image
and here is the forces in the dataset (extracted using example_data_sampler.py, unit: Hartree/Angstrom):
image
Considering the conversion from Bohr to Angstrom (1 Bohr = 0.53 Angstrom), it can be deduced that the sign is wrong.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.