Aaron Wang's Projects
Final project for 2021 fall Theory and practice of data-intensive computing
英超非洲球员数据爬虫(源自问题思考:2022年非洲杯对英超各俱乐部实力的影响)
Alluxio, data orchestration for analytics and machine learning in the cloud
vuepress搭建个人博客
ChatGPT SDK and CLI for Java
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
:zap: Dynamically generated stats for your github readmes
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
The integration of HugeGraph with artificial intelligence
HugeGraph Computer - A distributed graph processing system for hugegraph (OLAP)
HugeGraph Website and Doc
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
Automated management of large-scale applications on Kubernetes (incubating project under CNCF)
李航第二版 《统计学习方法》算法代码实现
Implementation of log level prediction.
Implementation of LWE-based fuzzy extractor
SpringBoot+Vue 在线考试系统
Final project for 2022 spring BigData Management
Some convenient scripts&tools for personal use
Signature verification package, for learning representations from signature data, training user-dependent classifiers.
Apache Spark - A unified analytics engine for large-scale data processing
push-based calculation for spark application