Echo🌟's Projects
6.S081课程全记录,包括课程使用的书籍、论文、实验要求的中文翻译,以及实验过程记录
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.
Alluxio, data orchestration for analytics and machine learning in the cloud
Alluxio Python client - Access Any Data Source with Python
Speed up fsspec data access with Alluxio distributed caching.
A cloud native implementation for Apache RocketMQ 5.0
字节跳动第四届青训营大数据实训项目2
Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集
cloud-native distributed storage
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被60多个国家的400多所大学用于教学。
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
A collection of classic computer science books from Internet
A specification that python filesystems should adhere to.
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
Interview = 简历指南 + 算法题 + 八股文 + 源码分析
「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。准备 Java 面试,首选 JavaGuide!
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
Basic Sources for MIT 6.824 Distributed Systems Class
Netty project - an event-driven asynchronous network application framework
Text2SQL 语义解析数据集、解决方案、paper资源整合项目
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
MIT6.830
StoneDB is an Open-Source MySQL HTAP and MySQL-Native DataBase for OLTP, Real-Time Analytics, a counterpart of MySQLHeatWave. (https://stonedb.io)
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
A course to build distributed key-value service based on TiKV model