GithubHelp home page GithubHelp logo

kindlygentleman / ds-da-analysis-of-bandung-ppdb-2022 Goto Github PK

View Code? Open in Web Editor NEW
1.0 2.0 0.0 5.19 MB

This repository contains Jupyter Notebooks analyzing Bandung PPDB 2022, focusing on zonation and student grades with data cleaning and visualization. It provides insight on the enrollment process in different schools in Bandung.

Jupyter Notebook 100.00%
bandung data-analysis data-science data-visualization ppdb-2022

ds-da-analysis-of-bandung-ppdb-2022's Introduction

DS-DA-Analysis-of-Bandung-PPDB-2022

This repository contains a data analysis of the PPDB (Penerimaan Peserta Didik Baru) in Bandung for the year 2022. The data is used to analyze the quotas, scores, and zoning of schools in Bandung, as well as the socio-economic index of each district.

Libraries Used

  1. numpy
  2. pandas
  3. matplotlib
  4. folium

Data

The data used in this analysis is sourced from several CSV files:

dataset/data_kuota.csv: contains information on the quotas for each school
dataset/data_rapor.csv: contains information on the scores of each student
dataset/data_zonasi.csv: contains information on the zoning of each student
dataset/data_koordinat.csv: contains information on the coordinates of each school
dataset/ik-berdasarkan-aspek-dan-kecamatan-2018.csv: contains information on the socio-economic index of each district

Data Cleansing

The data is cleaned and preprocessed before analysis. This includes:

  1. Replacing missing values with np.NaN
  2. Changing the data types of certain columns
  3. Renaming columns for consistency

Analysis

The cleaned data is then used to perform various analyses, such as:

  1. Analyzing the distribution of quotas among schools
  2. Examining the relationship between scores and school quotas
  3. Mapping the locations of schools and the socio-economic index of each district

Note

The code above is just the data preparation and data cleansing, there is no analysis being done here. The analysis would typically happen after this step and would involve using the data to answer specific questions or test hypotheses.

Analysis

To see the analysis, please visit the link below: https://drive.google.com/file/d/18Xhse8jhQA-Pc5om4MDbadIiHT7heFy_/view

ds-da-analysis-of-bandung-ppdb-2022's People

Contributors

kindlygentleman avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.