ibis-project / ibis-tutorial Goto Github PK
View Code? Open in Web Editor NEWIbis tutorial repository
Ibis tutorial repository
We should gather some data about tutorial attendees next time we give the tutorial. This would be useful both for steering the tutorial towards topics interesting towards users, as well as better understanding our potential user base.
Questions like:
We might make use of a realtime polling service (post a link, can see live results as they come in). Here's one option, but there are certainly others.
Table.rename
docstringpostgres
for notebook 1 instead ofduckdb
to better differentiate client & backend architecture.duckdb
isto_*
methods, maybe want to diminish_
in notebook 2 feels a bit off. It works, butibis.read_*
methods as well ascreate_table
maybe.%load
for solutions, instead make use of htmlsummary
/detail
tags to do dropdowns in a markdown box.ibis.Table
Some common themes that have come up during previous presentations that we could address directly at the beginning:
nbytes
to check because the limiting factor is your hard-drive (or the hard-drive of the system you are connecting to)More TODOS:
Set up codespace with recent-ish PyPI data in a postgres instance inside the codespace
We can maybe plan to allow users to use either codespace or their local for first few notebooks (with limited support from us)
Codespace required for PyPI exercises b/c postgres instance required
Intro QMD for how to navigate the codespace with links to the notebooks
Stop installing entirety of dev tools, mostly just ibis + duckdb,postgres,polars,altair
Update to work with 9.0
Get textblob with recursive CTE for .sql example
__array__
and __dataframe__
protocolsmotivated by chaining longer expressions, also peek at internals
intermedaite variable vs. chained expression same op tree
huzzah
leads into extra syntactic sugar of selectors
Good q's from pydata nyc 2023
move UDFs to end or out entirely, or just sprinkle in PyPI exercises as necessary
quarto convert '01 - Getting Started.qmd'
quarto convert '01 - Getting Started.ipynb'
Opening this issue to discuss changing the pypi dataset for scipy since in our tutorial proposal for Scipy, we said:
We are looking at swapping out the PyPI data exploration exercise for a more applicable set of problems for the SciPy audience and are currently vetting available datasets. The purpose of those exercises is to bring together all of the various methodologies covered in the previous sections and to demonstrate more realistic end-to-end data analysis problems. Our goal is that even if the particular problem set isn't a perfect match with an attendees field of study, that the lessons learned will be easily transferable to other data domains.
We also, promise a small section showcasing some of the geospatial features.
For the geospatial part we can use some of the datasets that we use in the geospatial blogs. But I'm not sure what would be a good replacement for the pypi part.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.