GithubHelp home page GithubHelp logo

vlegoy / rcatt Goto Github PK

View Code? Open in Web Editor NEW
105.0 7.0 31.0 77.13 MB

A python app to predict Att&ck tactics and techniques from cyber threat reports

License: MIT License

Python 78.63% HTML 21.37%
cyber-threat-intelligence attack ttp multi-label-classification

rcatt's Issues

data source

Where did you get your dataset to train the model?

Just labeled data from
https://github.com/mitre/cti ?
I see there are 1700 records in total, which is not enough to train a machine learning model I suppose.

Or have you ever thought about collecting threat reports from other sites and human label them?

BUG: Sklearn is now deprecated.

When I try to run rcATT_gui.py, it runs fine. When I access the website, put some dummy text inside and press "Predict", the whole thing crashes and displays "ModuleNotFoundError: No module named 'sklearn.feature_selection.univariate_selection'", as seen in the screenshots below.
image
Is there anything that can be done?

Added data CSV file not used in training

I noticed that when running the train command, the "added data" CSV file classification_tools/data/training_data_added.csv is not actually appended to the "original" training dataset:

train_data_df.append(train_data_added, ignore_index = True)

Note that the Pandas DataFrame.append function returns a new DataFrame and does not modify its input, so the corrected version would be:

	train_data_df = train_data_df.append(train_data_added, ignore_index = True)

I'd be happy to submit this as a merge request, if that'd work.

Missing requirements

This is a basic requirements.txt just in case you want to add it for an easier installation via pip.

colorama == 0.4.3
flask == 1.1.2
joblib == 0.14.1
nltk == 3.4.5
numpy == 1.17.4
pandas == 0.25.3
sklearn <= 0.22
stix2 == 1.2.1

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.