GithubHelp home page GithubHelp logo

joko19 / analyzing-student-performance-using-data-mining-techniques Goto Github PK

View Code? Open in Web Editor NEW

This project forked from raghavm23/analyzing-student-performance-using-data-mining-techniques

0.0 1.0 0.0 1.7 MB

Through this research, we are able to model a student’s final grade in a particular subject and link it directly to certain relevant features that influence the outcome. We use the C5.0 decision tree technique to model the data.

License: GNU General Public License v2.0

R 100.00%

analyzing-student-performance-using-data-mining-techniques's Introduction

Analysing Student Performance Through Data Mining Model

With constant innovation in the field of technology and data science, it is not surprising that education institutions are interested in understanding the performances’ of their students. The best indicator to measure a student’s performance is through their grades, but institutions are more interested in the factors that affect these scores. These institutions are looking to develop tools to enhance the quality of education and ensure a high success rate amongst students, to be facilitated by business intelligence and data mining techniques.

Through this project we will assess the right technique to be applied to our problem dataset and optimize this model. In this research we are able to model a student’s final grade in a particular subject and link it directly to certain relevant features that influence the outcome. We use the C5.0 decision tree technique to model the data.

Dataset

The dataset is a public dataset available on the UCI Machine Learning repository (Archive.ics.uci.edu., 2018). The dataset contains 1043 instances of student data for the two courses – Mathematics and Portuguese. Our target variable to be analyzed is the categorical variable final grade – G3.

Data Preparation

  1. Data Types
  2. Correlation amongst features
  3. Rank of Important Features (Random Forest)
  4. Recursive Feature Elimination Algorithm
  5. C5.0 Decision Tree Model

Conclusion

Decision tree model developed using the C5.0 algorithm was effective in correctly classifying 80.84% instances while only considering 5 features of the dataset. This intelligence will help the school to take measures in order to ensure that the atrisk students get attention and are able to cope up.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.