Cheng Feng's Projects
Distributed SQL query engine for big data
PostgreSQL protocol gateway for Presto distributed SQL query engine
A lightweight parameter server interface
Source Material for using Python and Hadoop together
Talk and demo notebooks for PyData Chicago, August 2016: http://pydata.org/chicago2016/schedule/presentation/15/
Simple Python version management
A python library for implementing a recommender system
Python SDK for accessing Qubole Data Service
node、golang、Machine Learning、postgreSQL、Deep Learning
最新IP地址数据库-多语言解析以及导入数据库脚本
Parsing and analysis of Vertica, Hive, and Presto SQL.
Simpler, Safer, Faster Unified SQL Analytics Engine for Multi-Datasources
R package for Sublime Text 2/3
Source for "RDDs, DataFrames and Datasets in Apache Spark" NEScala presentation
实现的基于user和item的协同过滤算法
Spark reference applications
regex dict 正则表达式词典
This is my note for personal use, most of file was written in org-mode with emacs.
Cache File System optimized for columnar formats and object stores
A real-world data mining problem
pronounced sUrplus as it's simply better if not best!
A scala library for connecting to a redis server, or a cluster of redis nodes using consistent hashing on the client side.
Collaborative Filtering
IPython notebooks from the scikit-learn video series
NoSQL data store using the seastar framework, compatible with Apache Cassandra
Secor is a service implementing Kafka log persistence
Simplified implementations of deep learning related works
StuQ 技能图谱