GithubHelp home page GithubHelp logo

crypto_currancy's Introduction

Overview of the analysis

  • Usage of pandas to create a data frame of values to perform Principle Component Analysis of the dataset.

Resources

  • Program language: crytocurrancy datafile,Python3, pandas, Jupyter notebook

Results

Deliverable 1

All cryptocurrencies that are not being traded are removed
Figure 1
The IsTrading column is dropped
Figure 2
All the rows that have at least one null value are removed
Figure 3
All the rows that do not have coins being mined are removed
Figure 4
The CoinName column is dropped
Figure 5

A new DataFrame is created that stores all cryptocurrency names from the CoinName column and retains the index from the crypto_df DataFrame
Figure 6

The get_dummies() method is used to create variables for the text features, which are then stored in a new DataFrame, X
Figure 7

The features from the X DataFrame have been standardized using the StandardScaler fit_transform() function
Figure 8

Deliverable 2 Requirements

The PCA algorithm reduces the dimensions of the X DataFrame down to three principal components
Figure 9

The pcs_df DataFrame is created and has the following three columns, PC 1, PC 2, and PC 3, and has the index from the crypto_df DataFrame
Figure 10

Deliverable 3 Requirements

The K-means algorithm is used to cluster the cryptocurrencies using the PCA data, where the following steps have been completed: An elbow curve is created using hvPlot to find the best value for K
Figure 11

Predictions are made on the K clusters of the cryptocurrencies’ data
Figure 12

A new DataFrame is created with the same index as the crypto_df DataFrame and has the following columns: Algorithm, ProofType, TotalCoinsMined, TotalCoinSupply, PC 1, PC 2, PC 3, CoinName, and Class
Figure 13

Deliverable 4 Requirements

The clusters are plotted using a 3D scatter plot, and each data point shows the CoinName and Algorithm on hover

Figure 14

A table with tradable cryptocurrencies is created using the hvplot.table() function

Figure 15
The total number of tradable cryptocurrencies is printed
Figure 16

A DataFrame is created that contains the clustered_df DataFrame index, the scaled data, and the CoinName and Class columns
Figure 17

A hvplot scatter plot is created where the X-axis is "TotalCoinsMined", the Y-axis is "TotalCoinSupply", the data is ordered by "Class", and it shows the CoinName when you hover over the data point
Figure 18

crypto_currancy's People

Contributors

davidhyongae2 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.