GithubHelp home page GithubHelp logo

data-umbrella / event-transcripts Goto Github PK

View Code? Open in Web Editor NEW
36.0 9.0 33.0 57.5 MB

transcripts from our recorded events

Home Page: https://www.youtube.com/c/dataumbrella/videos

Jupyter Notebook 100.00%
data-science python hacktoberfest career-development model-deployment open-source pytorch rstats

event-transcripts's Introduction

Event Transcripts

Subscribe to our YouTube Data Umbrella channel.

Review our Contributing Instructions before beginning editing / transcribing work.

# Date Speaker Transcript Video Transcriber Status
2020
01 2020-02-19 Hugo Bowne-Anderson Bayesian Data Science N/A N/A N/A
02 2020-03-07 Bruno Goncalves Time Series Modeling N/A N/A N/A
03 2020-05-07 Ty Shaikh Webscraping Poshmark v03_webscraping ?
04 2020-05-27 Ali Spittel Navigating Your Tech Career v04_career Janine Needs reviewer
05 2020-xx-xx Andreas Mueller Crash Course in Contributing to Scikit-learn v05_acm_sklearn Reshama Complete
06 2020-xx-xx Reshama Shaikh Example PR for Scikit-learn v06_rs_sklearn Reshama / Mark Complete
07 2020-06-11 Shailvi Wakhlu Fixing Bad Data and Using SQL Juanita Complete
08 2020-06-23 Matt Brems Data Science with Missing Data Barbara Complete
09 2020-07-10 Sam Bail Intro to Terminal Isaack Complete
10 2020-07-14 Emily Robinson Build a Career in Data Science Kevin Complete
11 2020-07-20 Rebecca Kelly kdb Time Series Data Coretta Needs reviewer (paragraphs are too long)
12 2020-08-18 Mridu Bhatnagar Build a Bot ?
13 2020-08-25 Liz DiLuzio Creating Nimble Data Processes Lily Complete
14 2020-09-15 Megan Robertson 3 Lessons From 3 Years of Data Science Sethupathy Needs reviewer (headers should not be in capital letters, etc)
15 2020-09-22 Emma Gouillart Data Visualization with Plotly ?
16 2020-10-27 Hugo / James Data Science and Machine Learning at Scale Cynthia Complete
17 2020-11-10 Carol Willing Contributing to Core Python ?
18 2020-11-17 Thomas Fan Streamlit for Data Science ?
19 2020-12-02 Matti Picus Contributing to NumPy Isaack
20 2020-12-15 Marco Gorelli Contributing to pandas ?
2021
21 2021-xx-xx Nick Janetakis Creating a Command Line Focused Development Environment ?
22 2021-xx-xx Melissa Weber Intro to Sphinx for Python Documentation ?
23 2021-xx-xx Ryan Kruse Building a Political Census with Placekey
24 2021-xx-xx Sam Bail Wonderful World of Data Quality in Python
25 2021-xx-xx Oriol Abril Pla Intuitive Bayesian Modeling with PyMC3
26 2021-xx-xx Luisa Rebull Astronomy Data & Image Processing
27 2021-xx-xx Thomas Fan 3 Components of Reviewing a Pull Request
28 2021-xx-xx Liz DiLuzio Excel for Data Analysts
29 2021-xx-xx Sam Bail Intro to Jupyter & Pandas
30 2021-xx-xx Sean Law Time Series with STUMPY
31 2021-xx-xx Brendan Collins Data Visualizations with Bokeh
32 2021-xx-xx Coonoor Behal When Quitting is Good
33 2021-xx-xx Rami Krispin COVID-19 R Dashboard in Production
34 2021-xx-xx Andreas Mueller Data Science with dabl
35 2021-xx-xx Juan Martin Loyola Pair Programming with Visual Studio Code Live Share
36 2021-xx-xx Mariatta Wijaya Continuous Integration and Unit Tests
37 2021-xx-xx Doris Lee Visualize Your Pandas Dataframe with Zero Effort with Lux
38 2021-xx-xx Ian Flores R, an Ecosystem Where Pythonistas Can Thrive
39 2021-xx-xx Eduardo Blancs Ploomber, Maintainable and Collaborative Pipelines in Jupyter
40 2021-xx-xx Meenal Jhajharia Intro to NumPy Array Operations
2022
41 2022-01-xx Austin Rochford Introduction to Probabilistic Programming with PyMC
42 2022-xx-xx Gonzalo Peña-Castellanos Automating Workflows with GitHub Actions
43 2022-xx-xx Ricardo Vieira Contributing to PyMC
44 2022-xx-xx Reshama Shaikh An Example Pull Request to PyMC
45 2022-xx-xx Oriol Abril Pla Contributing to PyMC Documentation
46 2022-03-02 Christy Heaton Geospatial Data and Maps with Python v46_geospatial_data_and_maps_python
47 2022-03-13 Laura Gutierrez Creating a Python Plotly Dashboard v47_python_plotly_dashboard
48 2022-03-15 Clair Sullivan Arrays, Linked Lists and Graphs v48_array_linked-lists_graphs
49 2022-03-22 Lauren Burke Setting up a Personal Website with Jekyll & GitHub Pages v49_jekyll_blog
50 2022-03-29 Julia Signell Introduction to Holoviz v50_Introduction_Holoviz
51 2022-04-05 Rob Lanphier Editing Wikipedia: Because Someone Has to... v51_Editing_Wikipedia
52 2022-04-26 William Lyon Introduction to GraphQL for Data Scientists v52_GraphQL_for_Data_Scientists
53 2022-05-10 Sebastian Raschka and Adrian Wälchli Introduction to PyTorch v53_PyTorch_introduction
54 2022-05-24 Melissa Weber Mendonça Contributing to SciPy v54_SciPy_contributions
55 2022-06-07 Joe Torreggiani Intro to Django v55_Django_intro
56 2022-06-14 Tereza Iofciu Short Stories of Data Visualization v56_data_visualization
57 2022-xx-xx Data Umbrella & PyMC Data Umbrella & PyMC Sprint
58 2022-07-12 Emily Lescak & Diego Saez-Trumper How Data Scientists Can Contribute To Wikimedia Projects v58_Wikimedia_project_contributions
59 2022-07-16 Nidhin Pattaniyil & Vishal Rathi Serving PyTorch Models in Production v59_PyTorch_in_production
60 2022-08-10 Abdulazeez Abdulazeez Adeshina Building APIs with FastAPI v60_API_with_FastAPI
61 2022-08-16 Joe Torreggiani Intro to Front-end Development with React v61_react
62 2022-08-23 Mitzi Morris Software Engineering for Probabilistic Programming v62_SE_probabilistic_prog
63 2022-08-30 Eric Ma Software Testing in Open Source and Data Science v63_testing_OS_DS
64 2022-09-08 Liliana Petrova Build a Personal Brand v64_personal_brand
65 2022-09-13 Shivay Lamba Machine Learning in JavaScript: An Introduction to TensorFlowJS v65_intro_tensorflowJS
66 2022-10-12 Hamel Husain How to Make Open Source Contributions to Fastai v66_fastai
67 2022-10-18 Alex de Siqueira An Overview of 3D Image Processing Using scikit-image v67_scikit-image
68 2022-10-25 Marianne Corvellec A Bioimage Data Analysis Workflow with scikit-image v68_bioimage
69 2022-11-15 Bruno Rocha Introduction to Rust Programming v69_rust
70 2022-11-29 Oriol Abril Pla Contributing to ArviZ and Open Source: Social and Technical Sides v70_arviz
71 2022-12-06 Wendy Grus & Stacey Williams Land First Data Science Job
2023
72 2023-01-10 Marco Gorelli How You (yes, you!) Can Contribute to Pandas
73 2023-01-24 Tirth Patel, Larry Dong and Oriol Abril Pla Google Summer of Code (GSOC) Experience
xx 2023-xx-xx xxx Title xxxx
xx 2023-xx-xx xxx Title xxxx
xx 2023-xx-xx xxx Title xxxx

event-transcripts's People

Contributors

beebeckzzz avatar bgb83 avatar bsenst avatar ceethinwa avatar cjohnsonjava avatar cristinamulas avatar crystal-ctrl avatar greenbrown avatar hiramatsuyuusuke avatar isaacknjama avatar ja9harper avatar jjpal avatar latte-x-sh avatar lilysu avatar lucyjimenez avatar mariam-ke avatar markhannel avatar masif2002 avatar millernoa avatar nestornav avatar nicholeboaz avatar oriolabril avatar reshamas avatar sangamswadik avatar symeneses avatar thayeylolu avatar thesunnydev avatar virgo-alpha avatar zadilkhwaja avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

event-transcripts's Issues

[Video event 65] Machine Learning in JavaScript: An Introduction to TensorFlowJS


Shivay Lamba: Machine Learning in JavaScript: An Introduction to TensorFlowJS

## Timestamps 
00:00 Data Umbrella Introduction
03:06 Speaker Introduction
04:17 Presentation Intro - Machine Learning for the Web: Introduction to TensorFlow.JS
06:27 Why do we need machine learning in JavaScript (JS)?
08:02 What is TensorFlow?
09:03 Versatility & language popularity - ML can be used on any platform JS can run
10:30 ML application ideas (e.g. accessible web apps, sound recognition, etc.) 
11:38 3 options for using TensorFlow
13:00 Option 1: Use pre-trained models with JS classes 
15:14 Real-world examples
19:15 Option 2: Retrain existing neural network models to work with your own data
19:51 Image classification example (100 images) with Teachable Machine (separate tool with downloadable code)
26:24 Pause for Q&A
26:55 Example using Cloud Auto ML for larger image datasets (100,000+)
28:29 Option 3: Coding your own model
29:32 High-level TensorFlow architecture
31:06 Backends and hardware execution
32:07 Chart - Model Inference Performance Only
32:45 Chart - performance comparison between JS and Python of Hugging Face DistilBERT (NLP-based model)
33:05 5 benefits of using TensorFlow on the front end (client side) 
33:59 4 benefits of using TensorFlow on the back end (server side)
35:00 Demo code example 1 - image detection
42:35 Demo code example 2 - TensorFlow.JS converter (converting Python model to JS model)
47:24 More resources for learning and inspiration
49:40 Join the community - #MadeWithTFJS 
50:34 What will you make? Machine learning is for everyone.
50:57 Q&A (Using PyTorch, using TensorFlow in production) & final thoughts


## Resources
Website / API: https://www.tensorflow.org/js 
Models: https://www.tensorflow.org/js/models 
GitHub Code: https://github.com/tensorflow/tfjs 
Google Group: [[email protected]](mailto:[email protected]) 
TensorFlow forum: https://discuss.tensorflow.org/tag/tjfs 
YouTube playlist: goo.gle/made-with-tfjs 
Codepen: https://codepen.io/topic/tensorflow/ 
Glitch: https://glitch.com/@TensorFlowJS 
Sample dataset: https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database 
Recommended Reading: Deep Learning with JavaScript - https://www.manning.com/books/deep-learning-with-javascript 
Book: LearningTensorFlow.js - https://github.com/GantMan/learn-tfjs 
edX: https://www.edx.org/learn/javascript/google-google-ai-for-javascript-developers-with-tensorflow-js

YT video description: R for Pythonistas (Ian)

Here is an example of the description included with the YouTube video.
video: https://youtu.be/5c4cb6kvJGE

Event: R, an Ecosystem Where Pythonistas Can Thrive

## Upcoming Events
Join our Meetup group for more events!
https://www.meetup.com/data-umbrella

## Agenda
00:00:00 to 00:05:50 Introduction to Data Umbrella 
00:05:50 Ian Introduction
00:56:30 Q&A begins
01:08:30 Demo of RStudio IDE
 
## Event
This talk will introduce Python users to the R ecosystem. Attendees should expect that by the end of the talk they will understand how to get started with reporting infrastructure in Python and R; and how to use open standards to share data across data products. As part of the discussion, we will see different use cases for both languages, their integration, and common pitfalls.

## Speaker
Ian is a data person with a background in Data Science and DevOps. He has experience consulting within the pharmaceutical industry, government sector, NGOs and educational institutions in multiple countries. As part of his academic background he holds a Master’s degree in Data Science from the University of British Columbia. Outside of work, he is a certified freediver, loves to surf in the Northeast coast of Puerto Rico and cooks spicy food.

## Slides:  
https://ian-flores.github.io/r-ecosystem-4-python-slides/slides.html#1

## Resources
https://github.com/ian-flores/r-ecosystem-4-python

Renumber transcripts

[ 44 ] Reshama's PR PyMC example
[ 45 ] Oriol's talk

To do:

  • update index
  • update transcript numbers

#80 Solving NLP (Natural Language Processing) Tasks Using LLMs (Large Language Models)

Pablo Duboue: Solving NLP (Natural Language Processing) Tasks Using LLMs (Large Language Models) 

## Timestamps 
00:00 Data Umbrella Introduction
02:43 Speaker Introduction + Land Acknowledgment
04:52 Agenda
06:15 NLP history (rule-based, statistical, deep learning)
09:00 What is a Language Model?
10:35 Large Language Models
11:55 Training LLMs - more than just language
12:48 Speaker background
13:35 About this talk - more background
14:26 Section 1: NLP / LLM Tasks - Part-of-Speech tagging
15:48 POS tagging example
16:50 NLP Tasks - Named Entity Recognition (NER), example
17:50 NLP Tasks - Information Extraction (IE), example
19:08 NLP Tasks - Sentiment Analysis, example
20:32 Q&A - data tagging
22:41 Section 2: Prompting 101
22:51: OpenAI API - intro, CLI, Python
25:44 Zero shot - no examples, temperature, output/hallucinations
28:35 Few shot - training data, output, GPT-4
30:17 Handling priors in exemplars
30:40 Chain-of-thought (CoT)
31:13 LLM role
31:43 Recursing
32:23 Learning more - additional resources
33:14 Section 3: Solving NLP Tasks with OpenAI API
33:27 OpenAI POS tagging
34:11 Output is unstable
34:21 Better prompt
34:40 Annotation Manual
36:03 NER prompt, unstable output, MUC-6 locations
38:28 ChatGPT output
38:41 GPT4 output
39:00 Q&A - AGI
40:28 IE prompt - relation extraction, stable output
42:31 Sentiment Analysis prompt
43:26 Additional discourse
44:22 Section 4: Using open source LLMs
44:39 Why open source LLMs
45:44 Issues with open source models
46:33 Examples of open source LLMs
49:29 Conclusions
51:34 Q&A - contributing to new models v. expanding on older ones, LLMs in cell phones, communication changes and abstraction, etc.

## Resources
- https://tellandshow.org/ (community-owned machine learning)
- http://textualization.com/gptwhitepaper/ 
- http://artoffeatureengineering.com/ 
- http://wiki.duboue.net/A_Dollar_Worth_Of_Ideas (project ideas)

## Connecting
- LinkedIn: https://www.linkedin.com/in/pabloduboue/
- GitHub: https://github.com/drdub 
- Twitter: @pabloduboue

Event #53 (PyTorch): add in "About the Event" and "About the Speaker"

  1. This file should have a markdown extension.
  2. Remove space before "pytorch"
  3. Can you also add a section called "About the Event" and copy the Meetup description in?

https://github.com/data-umbrella/event-transcripts/blob/main/2022/53-Sebastian-Adrian-%20PyTorch

For reference, you can see this one:
https://github.com/data-umbrella/event-transcripts/blob/main/2022/52-will-graphql.md

@Cristinamulas
We can set up a quick call to review this, and go over some missing steps.

Event list: add in date & video

Background (Prereqs)

  1. It will be helpful to be a member of this meetup group for information: https://www.meetup.com/data-umbrella
  2. Once you are a member, you will be able to see past events: https://www.meetup.com/data-umbrella/events/past/
  3. It will be helpful to subscribe to the Data Umbrella YouTube to get information: https://www.youtube.com/c/DataUmbrella/videos

Skills that are helpful to have

  • Markdown
  • Git

Instructions

  1. This is the file that needs updates: https://github.com/data-umbrella/event-transcripts/blob/main/README.md
  2. You can pick one event or more than one event.
  3. Add in the date of the meetup event.
  4. Add in the link to the video.
  5. In the PR (pull request) description, include: Towards #151 (See example)

If you have any questions, please ask here.

Remove "need to update" text in transcript files

In the following files, this text needs to be removed: ** NEED TO UPDATE **

  • 2021/30-sean-stumpy.md
  • 2021/32-coonoor-quit.md
  • 2021/22-melissa-sphinx.md
  • 2021/33-rami-dashboard.md
  • 2021/31-brendan-bokeh.md

Videos: add in timestamps

What We Are Doing: Adding Timestamps to Videos

Adding timestamps to the description section of the videos on Data Umbrella YouTube channel.

Why We Are Adding Timestamps

When timestamps are available:

  1. It makes it easy for viewers to get to the part in the video they are interested in.
  2. It also helps potential viewers find the video based on their search terms.

Your helpful contribution is greatly appreciated!!

Instructions

  1. Pick a video and indicate below the "Event #" and "Video Name" you will work on. If you pick a video to work on, please add a comment below, so two people are not working on the same video (avoid duplicating work).
  2. Watch the video and make a list of descriptive timestamps.
  3. To submit your timestamps, there are two options:
    • Open a new issue. Label the issue with the video speaker and title. Add in the timestamps and notes (Markdown txt format)
    • Create a pull request with the timestamps in the file indicated to add the timestamps.
    • This is the format needed for timestamps. Any other format will require us to do extra work so we can copy and paste it into the YouTube video description.
00:00 Introduction
10:00 example
12:23 example
  1. Below is a list of videos that need timestamps. The video descriptions can be checked to see if the timestamps are there or not: https://www.youtube.com/c/DataUmbrella

Extra

  1. Please pick one video at a time.
  2. After you pick a video, please share the timestamps within two weeks, or sooner. Thank you.

Examples

Example PR Description

Example PR: #143

Added timestamps for [topic x]

Towards #92
Closes #xxx [replace xxx with related issue, if there is one]

List of completed timestamps

List of Videos that need timestamps

[78] MLOps: from Concept to Product (Sandra Yojana Meneses)

Timestamps Description
00:00 Welcome
00:13 Sandra introduces the topic
1:00 What is MLOps?
1:59 What is DevOps?
4:08 What is Continuous Integration/Continuous Deployment(CI/CD)?
5:48 ML systems
5:53 Machine Learning Lifecycle
7:28 Data Team
8:52 Why are ML systems different
10:40 ML challenges in the Dev process
10:46 Experimentation
11:27 Reproducibility
12:59 Tracking and versioning
13:54 Git for Data Science
15:15 Automated Testing
16:27 Deployment
17:04 Monitoring
19:10 MLops Practices
19:26 Data Management
21:21 Model Management
22:14 Model Evaluation
23:15 Online ML system validation
25:06 Responsible AI
25:46 Continuous Training(CT)
28:16 MLOps Maturity Model
31:03 Automated Pipeline
32:50 What did we learn?
34:07 Books
34:23 Sources
34:50 Tools Review

video 83 [Vicuna, OS LLM]: Timestamp for From Vicuna to Human-aligned Evaluation: Comparing Open Source Large Language Models

Timestamp Description
00: 01 Agenda
00: 39 Introduction to Data Umbrella
1: 04 Code of Conduct
1: 24 How to support Data Umbrella
4: 58 Introduce the talk and speaker
6: 15 Speaker introduces herself and topic
7: 48 Background
10: 28 Our datasource: ShareGPT
11: 24 The Vicuna Project
12: 23 Evaluation: GPT-4 as a judge
14: 17 Chatbot Arena: Benchmarking LLMs in the wild
16: 17 Next steps: better benchmark
17: 23 Can we really trust LLM as a judge?
17: 43 Overview
21: 03 Limitations
23: 46 Solutions
24: 29 Positive Side: High Agreement with Humans
26: 35 Summary
30: 36 Human Preference Benchmark and Standardized Benchmark
34: 36 Questions
38: 14 Organizer wrap up
38: 52 Links

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.