GithubHelp home page GithubHelp logo

ghost-dl's Introduction


about :: code :: license :: contribute :: issues :: contact

       


On the Value of Oversampling for Deep Learning in Software Defect Prediction

This repository is the reference implementation for the paper in the title above.

GHOST (Goal-oriented Hyperparameter Optimization for Scalable Training) is a paradigm for fast training and tuning of deep learners for software engineering. We use DODGE for hyperparameter optimziation; as such, the code is based off the DODGE repository. We use our lab's internal package, raise-utils, for standardized, high-quality code.

Usage

  • Use pip install -r requirements.txt to install all the python3 dependencies.
  • Run python3 steps.py

Within steps.py, you will see a dictionary of options that can be turned on or off. This was used to perform the ablation study. You can choose how you wish to run GHOST using this dictionary as well as the helper functions below. To run the Scott-Knott tests, we used our package's implementation (see the docs here).

Paper

Our paper was accepted to IEEE Transactions on Software Engineering 2021. You can view it here.

Cite this

@article{yedida2021value,
  title={On the Value of Oversampling for Deep Learning in Software Defect Prediction},
  author={Yedida, Rahul and Menzies, Tim},
  journal={IEEE Transactions on Software Engineering},
  year={2021},
  publisher={IEEE}
}

ghost-dl's People

Contributors

anonymousbooboo avatar timm avatar yrahul3910 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.