GithubHelp home page GithubHelp logo

cyber-security-data-research's Introduction

๐Ÿ›ก๏ธ API Security: Access Behavior and Anomaly Detection With Boosting Algorithms (AdaBoost, Gradient Boost and XGBoost)

Welcome to the Cybersecurity Machine Learning project repository! This project delves into the world of cybersecurity, employing various machine learning algorithms to analyze and interpret data related to security behaviors and patterns.

๐Ÿ“Š Overview

The primary goal of this project is to leverage machine learning techniques to scrutinize cybersecurity datasets, focusing on features like inter_api_access_duration(sec), api_access_uniqueness, sequence_length(count), vsession_duration(min), ip_type, behavior, behavior_type, num_sessions, num_users, num_unique_apis, and more.

The dataset serves as a window into the intricate dynamics between microservice-based applications and their gateway to the digital worldโ€”Application Programming Interfaces (APIs). In this realm, APIs act as the linchpin, facilitating seamless interactions between applications and programmatic functions.

Here, I delve into the vulnerabilities that lie within this seemingly robust structure. Attackers often exploit these APIs, manipulating the underlying business logic. This dataset sheds light on the stark differences in user behaviorโ€”drawing a clear line between normal routines and the subtle yet impactful maneuvers of attackers.

With hundreds of APIs orchestrating intricate sequences, variability emerges from various sources: browser refreshes, network errors, or session inconsistencies. These complexities weave together to form access graphsโ€”a reflection of user behavior over time. Analyzing these graphs unveils attack patterns and anomalies, offering a unique perspective into understanding security threats.

๐Ÿ“ˆ Analysis Results

Model Performance ๐Ÿค–

AdaBoost Classifier ๐Ÿš€

  • Best Hyperparameters: {'learning_rate': 0.2, 'n_estimators': 200}
  • Accuracy: Achieved an accuracy score of 83%.
  • ROC-AUC: Demonstrated a strong ROC-AUC of 0.93, indicating a robust performance in classification tasks.

Gradient Boosting Classifier ๐Ÿ”ฅ

  • Best Hyperparameters: {'learning_rate': 0.1, 'max_depth': 5, 'min_samples_split': 4, 'n_estimators': 200}
  • Accuracy: Attained an accuracy score of 86%.
  • ROC-AUC: Exhibited a commendable ROC-AUC of 0.94, reflecting excellent discriminative capability between classes.

XGBoost Classifier ๐ŸŒŸ

  • Best Hyperparameters: {'learning_rate': 0.1, 'max_depth': 7, 'min_child_weight': 5, 'n_estimators': 200}
  • Accuracy: Achieved an accuracy score of 86%.
  • ROC-AUC: Demonstrated an impressive ROC-AUC of 0.95, showcasing superior performance in distinguishing classes.

Image Description

These models showcase competitive accuracy and robustness in their ability to classify instances, with XGBoost leading in both accuracy and ROC-AUC among the evaluated algorithms.

Statistical Analysis ๐Ÿ“Š

  • Chi-Square Test ๐Ÿงฎ

    • ip_type: p-value = 6.58e-50
    • behavior_type: p-value = 0.0
    • source: p-value = 0.0
    • type_ip: p-value = 6.58e-50
    • type_behaviour: p-value = 0.0
    • source_type: p-value = 0.0
  • T-Test ๐Ÿ“

    • inter_api_access_duration(sec) vs vsession_duration(min): p-value = 1.07e-07
  • Linear Regression ๐Ÿ“ˆ

    • R-squared: 0.153

๐Ÿ“ˆ ROC Curve Results

  • AdaBoost: 0.93
  • Gradient Boosting: 0.94
  • XGBoost: 0.95

Image Description Image Description

๐Ÿš€ Getting Started

  1. Clone the repository to your local machine.
  2. Install the dependencies specified in requirements.txt.
  3. Explore the notebooks and Python scripts for analysis and model implementation.
  4. For more detailed insights, refer to individual files.

๐Ÿ“Œ Note

The provided information serves as a summary. For a comprehensive understanding, refer to specific notebooks and analysis files available within the repository.

cyber-security-data-research's People

Contributors

sakib4535 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.