This repository contains resources and cheatsheets that should be helpful for anyone learning or practicing data science.
- Cheat Sheets - contains a lot of useful cheat sheets for Python, data analysis, machine learning, Git and more.
- Python - contains different Python, Pandas and NumPy guides, tips and tricks.
- Machine Learning - conatins different machine learning guides: supervised learning (regression, classification, tree-based models etc), unsupervised learning (clustering), feature selection, model evaluation, etc.
- Data Visualization - contains various data visualization guides - Pandas plotting, Matplotlib, Seaborn, Bokeh.
- Natural Language Processing - contains Natural Language Processing resources: NLTK, SKLearn NLP and more.
- Statistics - contains mostly theoretical reading to deepen your understanding of statistics.
- Mathematics - contains resources for math topics that are relevant for data scientists.
- Datasets - links to interesting datasets.
- Development Environment - contains development environment resources: Command Line, Git, Jupyter Notebook, etc.