dennyglee Goto Github PK
Name: Denny Lee
Type: User
Company: @databricks
Bio: data dork, scribe, geek, ultimate frisbee fan, mountain climber (barely!), wanna be cyclist... occasionally awake
Location: Seattle, WA
Blog: dennyglee.github.io
Name: Denny Lee
Type: User
Company: @databricks
Bio: data dork, scribe, geek, ultimate frisbee fan, mountain climber (barely!), wanna be cyclist... occasionally awake
Location: Seattle, WA
Blog: dennyglee.github.io
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark and Parquet. Apache 2 licensed.
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
Apache Arrow DataFusion SQL Query Engine
Official Rust implementation of Apache Arrow
Automated Machine Learning on Databricks
A Variant Caller, Distributed. Apache 2 licensed.
The most cited deep learning papers
Astronomy Extensions for Spark
Exploring graph queries on top of Azure Cosmos DB with Gremlin
Azure Cosmos DB's Graph API provides the graph data model and Gremlin. This tutorial shows how to get started with the Graph (Gremlin) API and the Java SDK.
Azure Cosmos DB's Graph API provides the graph data model and Gremlin. This tutorial shows how to get started with the Graph (Gremlin) API and the Node.js SDK.
Apache Spark Connector for Azure Cosmos DB
Apache Spark Connector for Azure Cosmos DB
Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.
Explore, transform, and analyze FHIR data with Apache Spark
Caffe: a fast open framework for deep learning.
Web Social SDK based on Scala, Node.js, Hadoop, Hive, and Analysis Services. The kit is based off of the Hadoop Summit presentation "How Klout changed the landscape of social media with Hadoop and BI".
The purpose of this project is to expand on the “Reaching Compliance: SQL Server 2008 Compliance Guide” to more easily handle larger volumes of structured and unstructured data. The end goal is to gain richer and deeper insight using the latest analytics. To achieve this, we are building a Big Data-to-BI project involving HDInsight (Hadoop on Windows or Azure), SQL Server 2012, SQL Server Analysis Service 2012 Tabular, Integration Services, PowerPivot, and Power View.
☕⛵WIP PySpark dependency management
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.
Run Gremlin queries over Azure cosmosDB using Spark.
Fast n-dimensional filtering and grouping of records.
Generic curve fitting package with nonlinear mixed effects model
Repository of sample Databricks notebooks
The Leek group guide to data sharing
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.