GithubHelp home page GithubHelp logo

rpw199912j / mit_classification_demographics Goto Github PK

View Code? Open in Web Editor NEW
0.0 0.0 0.0 2.79 MB

A survey analysis to evaluate how well domain experts can classify materials with metal-insulator transition.

License: MIT License

Jupyter Notebook 98.16% R 1.84%

mit_classification_demographics's Introduction

MIT Classification Demographics

This repository is to accompany the supporting demographics analysis in the paper "Database, Features, and Machine Learning Model to Identify Thermally Driven Metal−Insulator Transition Compounds".

Check out our paper on Chemistry of Materials:

Georgescu, A. B.; Ren, P.; Toland, A. R.; Zhang, S.; Miller, K. D.; Apley, D. W.; Olivetti, E. A.; Wagner, N.; Rondinelli, J. M. Database, Features, and Machine Learning Model to Identify Thermally Driven Metal−Insulator Transition Compounds. Chem. Mater. 2021. DOI: 10.1021/acs.chemmater.1c00905.

The code for the XGBoost classifiers for metal-insulator transition compounds can be found at this repository.

Research Question

This project involves the use of a survey sent out to around 200 people (e.g. materials scientists, physicists, chemists) to see how well domain experts will do when classifying the Metal-Insulator Transition (MIT) compounds and to examine whether or not MIT classification is a trivial task for human experts.

Workflow

Data procurement

The survey contained 18 chemical compounds, including 6 metals, 7 insulators and 5 MIT compounds. For each compound, the respondents were asked to perform 3 tasks:

  1. classify the compound as either a metal or an insulator
  2. determine if the compound exhibits MIT behavior
  3. identify chemical and structural descriptors used in determining the conductivity class of the materials.

A list of 11 descriptors were provided in the survey and respondents were asked to add in descriptors they used that were not already provided.

Please refer to this section for the complete list of compounds and descriptors used in the survey.

Data cleaning

After the raw survey data was imported, a data cleaning process was carried out to tidy the dataset and to prepare it for later post-processing. This step used several packages from the Tidyverse collection as implemented in the R programming language.

The data cleaning script used can be found here. Even better, you can immediately start a RStudio interface right in your web browser without installing any dependecies by clicking on the launch binder icon below.

Binder

Data analysis

After the data cleaning process was complete, the processed datasets were used to analyze the classification accuracy by different demographic groups. The descriptor usage was also analyzed to see what physical and chemical descriptors were used when people were classifying MIT materials.

The Jupyter notebook used for analysis can be found here. Just like before, you can also launch an interactive JupyterLab notebook right in your web browser by clicking on the launch binder icon below.

Binder

Supporting Information

Compounds

Metals Insulators MITs
LaRuO3 LaFeO3 CaFeO3
LaNiO3 MoO3 Ca2RuO4
ReO3 MnO NbO2
MoO2 Sr2TiO4 Ti2O3
TiO Cr2O3 BaVS3
SrCrO3 KVO3
Ag2BiO3

Descriptors

Descriptors provided
Stoichiometry
Crystal structure (e.g. perovskite, rock salt, rutile)
Average metal-oxygen bond distance
Total number of valence electrons
d electron count
Mass density
Mean electronegativity of elements in formula
Polarizability of the compound
Standard deviation of average ionic radius of elements
Crystal field splitting energy
Electronic correlation

mit_classification_demographics's People

Contributors

rpw199912j avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.