jackory Goto Github PK
Name: Yuhua Jiang
Type: User
Company: Tsinghua University
Bio: Nurture talents in obscurity.
Location: Beijing
Name: Yuhua Jiang
Type: User
Company: Tsinghua University
Bio: Nurture talents in obscurity.
Location: Beijing
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
Use ChatGPT to summarize the arXiv papers.
ChatReviewer: use ChatGPT to review papers; ChatResponse: use ChatGPT to respond to reviewers.
一个通过socket实现的聊天室程序
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
2020全国大学数学建模大赛 赛题B 穿越沙漠
Online Judge 刷题
multi-agent deep reinforcement learning for networked system control.
Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥
为GPT/GLM提供图形交互界面,特别优化论文阅读润色体验,模块化设计支持自定义快捷按钮&函数插件,支持代码块表格显示,Tex公式双显示,新增Python和C++项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持清华chatglm等本地模型
A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model
A clean code base for imitation learning and reinforcment learning , written in Pytorch
图片分类
A beautiful, simple, clean, and responsive Jekyll theme for academics
LightZero: A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkit.
Recorded projects completed in NJU
OmniSafe is an infrastructural framework for accelerating SafeRL research.
This is the official implementation of Multi-Agent PPO (MAPPO).
CUI版植物大战僵尸
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)
An educational resource to help anyone learn deep reinforcement learning.
应用统计与R语言大作业
天池人工智能技术创新大赛赛道三
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
Train transformer language models with reinforcement learning.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.