Name: Sabyasachi Dasgupta
Type: User
Company: Microsoft
Bio: I am a theoretical (bio)physicist. My tools are simulations, analytical models, numerical analysis using using statistics and data visualization.
Location: Toronto
Blog: http://individual.utoronto.ca/softbio/
Sabyasachi Dasgupta's Projects
Scripts for analyzing data from manuscript under revisions
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer.
Simple python example on how to use ARIMA models to analyze and predict time series.
Curated list of Machine Learning, NLP, Vision, Reinforcement Learning Project Ideas
List of software packages for single-cell data analysis, including RNA-seq, ATAC-seq, etc.
Open source documentation of Microsoft Azure
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
TensorFlow code and pre-trained models for BERT
Spark exercises: Spark RDD, SparkSQL, Spark ML pipelines, Spark in Cloud(AWS)
This reference implementation is based on Cloud Adoption Framework for Azure and provides an opinionated implementation that enables ITSG-33 regulatory compliance by using NIST SP 800-53 Rev. 4 and Canada Federal PBMM Regulatory Compliance Policy Sets.
Clustering-Excercise-1D-Data
My implementation of useful data structures, algorithms, as well as my solutions to programming puzzles.
Databricks - Apache Spark™ - 2X Certified Developer
Examples surrounding Databricks.
Just some of my junk data science / machine learning experimentation
a curated list of R tutorials for Data Science, NLP and Machine Learning
Enablement and examples as they relate to Delta Lake for both the Open Source and Databricks Implementation.
Implementation of spectral dimensionality reduction algorithms (PCA, MDS, Difussion maps, LLE)
Slides, Jupyter Notebooks and scripts for the Deep Learning: Do-It-Yourself! lectures at ENS
A very simple framework for state-of-the-art Natural Language Processing (NLP)
the gap statistics for automatically-finding-K value for K-mean clustering
Dynamically get the suggested clusters in the data for unsupervised learning.