anaveenan's Projects
Repository to store files of data science project
Information and content associated with "A/B Testing and Beyond", a continuing eduction certificate offered by the University of San Francisco's Data Institute.
Advanced A/B Testing Workshop
This tutorial shows users how to evaluate advertising response using last click attribution, experiments, marketing mix models and attribution models. By applying these methods to the same (synthetic) data set, users will learn how the methods compare. We also illustrate the data manipulation that is required to prepare typical raw advertising data for analysis. Examples are worked in R and slides are provided in LaTeX.
A python tutorial on bayesian modeling techniques (PyMC3)
repository to hold all blog post
Deliberate Practice for Learning Causal Inference
Causal Inference Crash Course for Scientists - contains slides and Jupyter notebooks
Has ipython notebooks related to ML UW course
Clumper will make it easier to analyze list of dictionaries
Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1
Machine Learning and Data Analysis Case Studies using Spark.
Code from live exploratory analyses of data in R
This repository contains data visualisations
Keep learning something new
The Leek group guide to data sharing
Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzing large existing databases.
Very Basic Optimization of the Email Delivery Time
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.
Plotting Assignment 1 for Exploratory Data Analysis