GithubHelp home page GithubHelp logo

Fei Han's Projects

bar-chart-race icon bar-chart-race

Bar chart race is an elegant animation that depicts the progress of multiple categories over time. We can create them in Python.

cnn-for-digits-recognition icon cnn-for-digits-recognition

This is a CNN tutorial for beginners about a digits recognition model trained on the MNIST dataset. I built two models with TensorFlow/Keras and PyTorch/Skorch respectively.

comparison-between-randomforestclassifier-and-balancedrandomforestclassifier icon comparison-between-randomforestclassifier-and-balancedrandomforestclassifier

Imbalanced data commonly exist in real world, especially in anamoly-detection tasks. Handling imbalanced data is important to the tasks, otherwise the predictions are biased towards the majority class. BalancedRandomForestClassifier can deal with the imbalanced data without knowing any novel techniques like SMOTE.

data-crawler-for-imdb-top-250-movies icon data-crawler-for-imdb-top-250-movies

Scrape movie titles, release year, director, cast, rating, users rated, and href from https://www.imdb.com/chart/top/?ref_=nv_mv_250 using Python and Beautiful Soup.

eda-plots-for-regression icon eda-plots-for-regression

This notebook demonstrates the charts I usually plot for exploratory data analysis for regression tasks.

estimate-the-area-of-a-region-using-a-monte-carlo-simulation icon estimate-the-area-of-a-region-using-a-monte-carlo-simulation

Monte Carlo simulation is a computational technique that uses random sampling and statistical methods to estimate the behavior of complex systems or solve problems. It is particularly useful when dealing with problems that involve a high degree of randomness or complexity.

fine-tune-bert-for-sentiment-analysis icon fine-tune-bert-for-sentiment-analysis

BERT is an NLP model developed by Google Research in 2018, after its inception it has achieved state-of-the-art accuracy on several NLP tasks. This notebook demonstrates fine tuning BERT for sentiment analysis.

high-throughput-file-search-engine icon high-throughput-file-search-engine

This is a "happy wife, happy life" project. My wife's work involves repetitive and tiresome file searches on her hard drive. To bring more joy and efficiency into her work life, I've developed an innovative solution. By utilizing its intuitive interface, my wife can swiftly locate the files she needs without the hassle of manual searching.

histogram-of-an-image-and-its-heatmap icon histogram-of-an-image-and-its-heatmap

A histogram of an image provides valuable insights into the distribution of pixel intensities within that image. This notebook gives a brief about how to plot the histogram. Furtherly, we can replot the picture with a heatmap based on its pixel intensities.

impute-missing-data-with-xgboost icon impute-missing-data-with-xgboost

When signaficant amount of data in highly-important features are missing, what can we do? Impute the missing data with mean or median? In this Juyter notebook, I demonstrate embedding a XGBoost model to do the data imputation in the data transformer.

inference-of-a-picture-with-resnet-models icon inference-of-a-picture-with-resnet-models

ResNet models are lightweight computer vision pre-trained models. This notebook demostrates how to infer the object in a picture with ResNet18, ResNet34, ResNet50, ResNet101, and ResNet251.

linear-regression-and-its-regularizations icon linear-regression-and-its-regularizations

Linear regression model is widely used in industry for regression tasks as it is straightforward and easy to interpret. To capature non-linear patterns in data, polynomial features need to be added. However, high-degree polynomial features lead to overfitting. To solve the problem, regularizations can be added to the loss function.

matrix-factorization-with-svd-nmf-and-gradient-descent icon matrix-factorization-with-svd-nmf-and-gradient-descent

Matrix factorization is a class of collaborative filtering algorithms used in recommender systems. Matrix factorization algorithms work by decomposing the user-item interaction matrix into the product of two lower dimensionality rectangular matrices.

monte-carlo-integration icon monte-carlo-integration

Monte Carlo integration is particularly useful when dealing with high-dimensional integrals or integrals over complex, irregularly shaped domains where traditional methods may be impractical. It's widely used in various fields, including physics, finance, and engineering, for solving problems involving numerical integration.

neural-network-models-without-using-wrappers icon neural-network-models-without-using-wrappers

Keras and Starch provide us wrappers which simplify building neural network models. However, the wrappers sacrifice the flexibility of the models. In some scenarios like early stopping and batch reading, building pristine neural network models is still very useful.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.