GithubHelp home page GithubHelp logo

altamiracorp / awesome-xai Goto Github PK

View Code? Open in Web Editor NEW
122.0 8.0 12.0 225 KB

Awesome Explainable AI (XAI) and Interpretable ML Papers and Resources

License: Other

awesome awesome-list xai explanations saliency-methods saliency-maps

awesome-xai's Introduction

Awesome XAI Awesome

A curated list of XAI and Interpretable ML papers, methods, critiques, and resources.

Explainable AI (XAI) is a branch of machine learning research which seeks to make various machine learning techniques more understandable.

Contents

Papers

Landmarks

These are some of our favorite papers. They are helpful to understand the field and critical aspects of it. We believe this papers are worth reading in their entirety.

  • Explanation in Artificial Intelligence: Insights from the Social Sciences - This paper provides an introduction to the social science research into explanations. The author provides 4 major findings: (1) explanations are constrastive, (2) explanations are selected, (3) probabilities probably don't matter, (4) explanations are social. These fit into the general theme that explanations are -contextual-.
  • Sanity Checks for Saliency Maps - An important read for anyone using saliency maps. This paper proposes two experiments to determine whether saliency maps are useful: (1) model parameter randomization test compares maps from trained and untrained models, (2) data randomization test compares maps from models trained on the original dataset and models trained on the same dataset with randomized labels. They find that "some widely deployed saliency methods are independent of both the data the model was trained on, and the model parameters".

Surveys

Evaluations

XAI Methods

  • Ada-SISE - Adaptive semantice inpute sampling for explanation.
  • ALE - Accumulated local effects plot.
  • ALIME - Autoencoder Based Approach for Local Interpretability.
  • Anchors - High-Precision Model-Agnostic Explanations.
  • Auditing - Auditing black-box models.
  • BayLIME - Bayesian local interpretable model-agnostic explanations.
  • Break Down - Break down plots for additive attributions.
  • CAM - Class activation mapping.
  • CDT - Confident interpretation of Bayesian decision tree ensembles.
  • CICE - Centered ICE plot.
  • CMM - Combined multiple models metalearner.
  • Conj Rules - Using sampling and queries to extract rules from trained neural networks.
  • CP - Contribution propogation.
  • DecText - Extracting decision trees from trained neural networks.
  • DeepLIFT - Deep label-specific feature learning for image annotation.
  • DTD - Deep Taylor decomposition.
  • ExplainD - Explanations of evidence in additive classifiers.
  • FIRM - Feature importance ranking measure.
  • Fong, et. al. - Meaninful perturbations model.
  • G-REX - Rule extraction using genetic algorithms.
  • Gibbons, et. al. - Explain random forest using decision tree.
  • GoldenEye - Exploring classifiers by randomization.
  • GPD - Gaussian process decisions.
  • GPDT - Genetic program to evolve decision trees.
  • GradCAM - Gradient-weighted Class Activation Mapping.
  • GradCAM++ - Generalized gradient-based visual explanations.
  • Hara, et. al. - Making tree ensembles interpretable.
  • ICE - Individual conditional expectation plots.
  • IG - Integrated gradients.
  • inTrees - Interpreting tree ensembles with inTrees.
  • IOFP - Iterative orthoganol feature projection.
  • IP - Information plane visualization.
  • KL-LIME - Kullback-Leibler Projections based LIME.
  • Krishnan, et. al. - Extracting decision trees from trained neural networks.
  • Lei, et. al. - Rationalizing neural predictions with generator and encoder.
  • LIME - Local Interpretable Model-Agnostic Explanations.
  • LOCO - Leave-one covariate out.
  • LORE - Local rule-based explanations.
  • Lou, et. al. - Accurate intelligibile models with pairwise interactions.
  • LRP - Layer-wise relevance propogation.
  • MCR - Model class reliance.
  • MES - Model explanation system.
  • MFI - Feature importance measure for non-linear algorithms.
  • NID - Neural interpretation diagram.
  • OptiLIME - Optimized LIME.
  • PALM - Partition aware local model.
  • PDA - Prediction Difference Analysis: Visualize deep neural network decisions.
  • PDP - Partial dependence plots.
  • POIMs - Positional oligomer importance matrices for understanding SVM signal detectors.
  • ProfWeight - Transfer information from deep network to simpler model.
  • Prospector - Interactive partial dependence diagnostics.
  • QII - Quantitative input influence.
  • REFNE - Extracting symbolic rules from trained neural network ensembles.
  • RETAIN - Reverse time attention model.
  • RISE - Randomized input sampling for explanation.
  • RxREN - Reverse engineering neural networks for rule extraction.
  • SHAP - A unified approach to interpretting model predictions.
  • SIDU - Similarity, difference, and uniqueness input perturbation.
  • Simonynan, et. al - Visualizing CNN classes.
  • Singh, et. al - Programs as black-box explanations.
  • STA - Interpreting models via Single Tree Approximation.
  • Strumbelj, et. al. - Explanation of individual classifications using game theory.
  • SVM+P - Rule extraction from support vector machines.
  • TCAV - Testing with concept activation vectors.
  • Tolomei, et. al. - Interpretable predictions of tree-ensembles via actionable feature tweaking.
  • Tree Metrics - Making sense of a forest of trees.
  • TreeSHAP - Consistent feature attribute for tree ensembles.
  • TreeView - Feature-space partitioning.
  • TREPAN - Extracting tree-structured representations of trained networks.
  • TSP - Tree space prototypes.
  • VBP - Visual back-propagation.
  • VEC - Variable effect characteristic curve.
  • VIN - Variable interaction network.
  • X-TREPAN - Adapted etraction of comprehensible decision tree in ANNs.
  • Xu, et. al. - Show, attend, tell attention model.

Interpretable Models

Critiques

  • Attention is not Explanation - Authors perform a series of NLP experiments which argue attention does not provide meaningful explanations. They also demosntrate that different attentions can generate similar model outputs.
  • Attention is not --not-- Explanation - This is a rebutal to the above paper. Authors argue that multiple explanations can be valid and that the and that attention can produce a valid explanation, if not -the- valid explanation.
  • Do Not Trust Additive Explanations - Authors argue that addditive explanations (e.g. LIME, SHAP, Break Down) fail to take feature ineractions into account and are thus unreliable.
  • Please Stop Permuting Features An Explanation and Alternatives - Authors demonstrate why permuting features is misleading, especially where there is strong feature dependence. They offer several previously described alternatives.
  • Stop Explaining Black Box Machine Learning Models for High States Decisions and Use Interpretable Models Instead - Authors present a number of issues with explainable ML and challenges to interpretable ML: (1) constructing optimal logical models, (2) constructing optimal sparse scoring systems, (3) defining interpretability and creating methods for specific methods. They also offer an argument for why interpretable models might exist in many different domains.
  • The (Un)reliability of Saliency Methods - Authors demonstrate how saliency methods vary attribution when adding a constant shift to the input data. They argue that methods should fulfill input invariance, that a saliency method mirror the sensistivity of the model with respect to transformations of the input.

Repositories

  • EthicalML/xai - A toolkit for XAI which is focused exclusively on tabular data. It implements a variety of data and model evaluation techniques.
  • MAIF/shapash - SHAP and LIME-based front-end explainer.
  • PAIR-code/what-if-tool - A tool for Tensorboard or Notebooks which allows investigating model performance and fairness.
  • slundberg/shap - A Python module for using Shapley Additive Explanations.

Videos

Follow

Their views aren't necessarily our views. ๐Ÿ˜‰

Who else should we be following!?

Contributing

Contributions of any kind welcome, just follow the guidelines!

Contributors

Thanks goes to these contributors!

awesome-xai's People

Contributors

dgrahn avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

awesome-xai's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.