GithubHelp home page GithubHelp logo

info's Introduction

Fast and accurate interpretation of workload classification

These are codes for "Fast and accurate interpretation of workload classification", published in Plos One 2023.

Dataset

We use a real-world workload dataset for classification. The detailed information about dataset is summarized in the table below.

dataset # of classes input size # of train # of test of known
SEC-seq 40 1735 586,885 293,444
Memtest86-seq 31 1599 433,334 216,696

We include the preprocessed data of Memtest86-seq dataset in this repository. Due to the size limit, we upload final_data folder on google drive. You can download the data from [link]. final_data folder contains the entire feature vector for workload classification. sequences folder includes n-gram sequences selected as CMD features. The detailed structures of these directories are as follows:

current directory
├── final_data
│   ├── 7-grams
│   ├── 11-grams
│   ├── 15-grams
│   ├── bank_access_counts
│   ├── data_split_ids
│   ├── test_ngram_zeros.npy
│   └── row_col_address_access_counts
└── sequences
    ├── 7-grams.top25_osr.pkl
    ├── 11-grams.top25_osr.pkl
    └── 15-grams.top25_osr.pkl

Model

We provide a pre-trained MLP model for workload classification in models directory.

  • model_mlp.pt: a trained 2-layer MLP model

Code Description

All codes in this repository are implemented based on Python 3.7. This repository contains the code for INFO, INterpretable model For wOrkload classification.

  • The information of codes implemented in this repository.
    • main.py: train and evaluate an interpretable model.
    • cluster.py: cluster features for workload classification to generate super features for interpretation.
    • train.py: train and evaluate a workload classification model.
    • model.py: implement MLP for classification.
    • dataloader.py: load workload sequence data and extract feature vectors for classification.

To interpret the classification results, you have to type the following command.

python main.py

You can train a workload classification model through the command below.

python train.py

These commands run the codes in the default setting.

Citation

@article{shim2023fast,
  title={Fast and accurate interpretation of workload classification model},
  author={Shim, Sooyeon and Kim, Doyeon and Jang, Jun-Gi and Chae, Suhyun and Lee, Jeeyong and Kang, U},
  journal={Plos one},
  volume={18},
  number={3},
  pages={e0282595},
  year={2023},
  publisher={Public Library of Science San Francisco, CA USA}
}

info's People

Contributors

syshim77 avatar snudatalab avatar

Stargazers

Jaemin Yoo avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.