GithubHelp home page GithubHelp logo

reddyprasade / pandas-practice Goto Github PK

View Code? Open in Web Editor NEW
2.0 1.0 2.0 5.11 MB

Pandas

License: GNU General Public License v3.0

Jupyter Notebook 100.00%
pandas flexible data-analysis missing-data missing-values labeling pandas-profiling data-science daat

pandas-practice's Introduction

What is Pandas?


pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with "relational" or "labeled" data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis / manipulation tool available in any language. It is already well on its way towards this goal.

Main Features

Here are just a few of the things that pandas does well:

  • Easy handling of [missing data][missing-data] (represented as NaN) in floating point as well as non-floating point data
  • Size mutability: columns can be [inserted and deleted][insertion-deletion] from DataFrame and higher dimensional objects
  • Automatic and explicit [data alignment][alignment]: objects can be explicitly aligned to a set of labels, or the user can simply ignore the labels and let Series, DataFrame, etc. automatically align the data for you in computations
  • Powerful, flexible [group by][groupby] functionality to perform split-apply-combine operations on data sets, for both aggregating and transforming data
  • Make it [easy to convert][conversion] ragged, differently-indexed data in other Python and NumPy data structures into DataFrame objects
  • Intelligent label-based [slicing][slicing], [fancy indexing][fancy-indexing], and [subsetting][subsetting] of large data sets
  • Intuitive [merging][merging] and [joining][joining] data sets
  • Flexible [reshaping][reshape] and [pivoting][pivot-table] of data sets
  • [Hierarchical][mi] labeling of axes (possible to have multiple labels per tick)
  • Robust IO tools for loading data from [flat files][flat-files] (CSV and delimited), [Excel files][excel], [databases][db], and saving/loading data from the ultrafast [HDF5 format][hdfstore]
  • [Time series][timeseries]-specific functionality: date range generation and frequency conversion, moving window statistics, date shifting and lagging.

# conda
conda install pandas
# or PyPI
pip install pandas

Dependencies

pandas-practice's People

Contributors

reddyprasade avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.