The crypto_currancy from davidhyongae2

Overview of the analysis

Usage of pandas to create a data frame of values to perform Principle Component Analysis of the dataset.

Resources

Program language: crytocurrancy datafile,Python3, pandas, Jupyter notebook

Results

Deliverable 1

All cryptocurrencies that are not being traded are removed

The IsTrading column is dropped

All the rows that have at least one null value are removed

All the rows that do not have coins being mined are removed

The CoinName column is dropped

A new DataFrame is created that stores all cryptocurrency names from the CoinName column and retains the index from the crypto_df DataFrame

The get_dummies() method is used to create variables for the text features, which are then stored in a new DataFrame, X

The features from the X DataFrame have been standardized using the StandardScaler fit_transform() function

Deliverable 2 Requirements

The PCA algorithm reduces the dimensions of the X DataFrame down to three principal components

The pcs_df DataFrame is created and has the following three columns, PC 1, PC 2, and PC 3, and has the index from the crypto_df DataFrame

Deliverable 3 Requirements

The K-means algorithm is used to cluster the cryptocurrencies using the PCA data, where the following steps have been completed: An elbow curve is created using hvPlot to find the best value for K

Predictions are made on the K clusters of the cryptocurrencies’ data

A new DataFrame is created with the same index as the crypto_df DataFrame and has the following columns: Algorithm, ProofType, TotalCoinsMined, TotalCoinSupply, PC 1, PC 2, PC 3, CoinName, and Class

Deliverable 4 Requirements

The clusters are plotted using a 3D scatter plot, and each data point shows the CoinName and Algorithm on hover

A table with tradable cryptocurrencies is created using the hvplot.table() function

The total number of tradable cryptocurrencies is printed

A DataFrame is created that contains the clustered_df DataFrame index, the scaled data, and the CoinName and Class columns

A hvplot scatter plot is created where the X-axis is "TotalCoinsMined", the Y-axis is "TotalCoinSupply", the data is ordered by "Class", and it shows the CoinName when you hover over the data point

davidhyongae2 / crypto_currancy Goto Github PK

crypto_currancy's Introduction

Overview of the analysis

Resources

Results

Deliverable 1

Deliverable 2 Requirements

Deliverable 3 Requirements

Deliverable 4 Requirements

crypto_currancy's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs