GithubHelp home page GithubHelp logo

yourant / ml-malware-classifier Goto Github PK

View Code? Open in Web Editor NEW

This project forked from riccorl/ml-malware-classifier

0.0 0.0 0.0 3.18 MB

Machine Learning Android Malware Classifier

License: MIT License

Python 58.43% Jupyter Notebook 41.57%

ml-malware-classifier's Introduction

Code style: black

ml-malware-classifier

Reference

Daniel Arp, Michael Spreitzenbarth, Malte Huebner, Hugo Gascon, and Konrad Rieck 
"Drebin: Efficient and Explainable Detection of Android Malware in Your Pocket", 
21th Annual Network and Distributed System Security Symposium (NDSS), February 2014
  • The original paper can be found here.
  • The original dataset can be found here.

Usage

The code is inside code folder. Use main.py to run the script.

usage: main.py [-h] [-d DATA] [--type TYPE] [-s S] method
Android Malware Classificator

positional arguments:  method       mnb=MultinomialNB, bnb=BernoulliNB, sgdc=SGDClassifier, 
                                    lsvc=LinearSVC, svm=SVM, rf=RandomForest

optional arguments:
  -h, --help   show this help message and exit
  -d DATA      path to the dataset folder
  --type TYPE
  -s S         Feature subset

The dataset is inside data folder. There are two subfolders, small_drebin, which contains a very little portion of the original dataset, and medium_drebin, which contains roughly 5500 files. By default the script uses the medium folder. To use custom data, put it inside data folder. It should have this format:

- data
  |__ custom_data
      |__ feature_vectors
          |__ file_1
          |__ file_2
          ...
          |__ file_n
      |__ sha256_family.csv  

ml-malware-classifier's People

Contributors

riccorl avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.