GithubHelp home page GithubHelp logo

batch2-blu04's Introduction

BLU04 - Advanced data wrangling

What is in this BLU?

This is the first BLU in the Data Wrangling specialization. In this specialization you'll learn how to get data from multiple data sources and how to transform raw data into clean enough data so that you can begin your data science workflow.

In this BLU, we'll start by learning some Jupyter notebooks magics, that can be very useful. Then, we'll learn about file formats and how to read them efficiently into pandas DataFrames. Finally, we'll practice some data cleaning, using string methods and other tools.

How to use this repo

  1. Install all the needed dependencies, specified in the requirements.txt file
  • Install via pip
pip install -r requirements.txt
  • Install via conda
conda install --file requirements.txt
  • Install via docker
sh run-jupyter-notebook.sh
  1. Go through the Learning Notebooks (they are in the Learning Notebooks folder)
  2. Do the Exercise notebook, and submit it on the portal as usual

"I need help understanding something"

You can and should ask for help, be it about Learning Notebooks, Exercises, or anything else. Please checkout the How to Ask for Help, and remember not to share code when asking for help about the exercises!

"I think I've found a bug"

This repo is completely open source and is continuously improving over time. When you spot a mistake, please check whether it has been detected in the issues. If it hasn't, please open an issue, explaining in details where it is (e.g. in what notebook, and on what line), and how to reproduce the error. If it is an easy fix, feel free to make a pull request.

batch2-blu04's People

Contributors

hcastilho avatar hershaw avatar

Watchers

Sofia avatar  avatar James Cloos avatar Orlando Costa avatar João Tiago Ascensão avatar Pedro Fonseca avatar  avatar Inês Mendes avatar maria.dominguez avatar Juliane Silva avatar  avatar

batch2-blu04's Issues

Resolved: Problems installing packages

I cannot install the packages. Any channel I should add?
Windows 10.
Thanks

conda install --file C:\Users\e337652\Documents\GitHub\batch2-BLU04\requirements.txt

Solving environment: failed

PackagesNotFoundError: The following packages are not available from current channels:

  - appnope
  - jupyter-core
  - prompt-toolkit
  - ptyprocess
  - ipython-genutils
  - line-profiler
  - jupyter-client
  - jupyter-console

Current channels:

  - https://conda.anaconda.org/conda-forge/win-64
  - https://conda.anaconda.org/conda-forge/noarch
  - https://repo.continuum.io/pkgs/main/win-64
  - https://repo.continuum.io/pkgs/main/noarch
  - https://repo.continuum.io/pkgs/free/win-64
  - https://repo.continuum.io/pkgs/free/noarch
  - https://repo.continuum.io/pkgs/r/win-64
  - https://repo.continuum.io/pkgs/r/noarch
  - https://repo.continuum.io/pkgs/pro/win-64
  - https://repo.continuum.io/pkgs/pro/noarch
  - https://repo.continuum.io/pkgs/msys2/win-64
  - https://repo.continuum.io/pkgs/msys2/noarch

conda install --yes --file requirements.txt Failed

Solving environment: failed

PackagesNotFoundError: The following packages are not available from current channels:

  • line-profiler==2.1.2
  • jupyter-console==5.2.0
  • ptyprocess==0.5.2
  • appnope==0.1.0
  • ipython-genutils==0.2.0
  • jupyter-core==4.4.0
  • prompt-toolkit==1.0.15
  • jupyter-client==5.2.3

Exercises Notebook Q1

When assigning a value to variable num_lines, just copy and paste the number of lines that you found with the ! command.
No need to apply any logic here.

Remember that nbgrader may not be running the same operative system as you are, and thus it won't understand the shell command you're running locally to determine the number of lines.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.