GithubHelp home page GithubHelp logo

Hi there 👋

I am Wei Xiong, currently a first-year Ph.D. student in computer science at UIUC. I work on RLHF for aligning language models.

Previously, I have spent time on the mathematical foundation of RL, where I was fortunate to collaborate with many great senior mentors and talented peers. I also spent time on deep RL at Microsoft Research Asia.

You can find more information about me at:

Wei Xiong's Projects

awesome-rlhf icon awesome-rlhf

A curated list of reinforcement learning with human feedback resources (continually updated)

functionary icon functionary

Chat language model that can use tools and interpret the results

markdown4zhihu icon markdown4zhihu

一键解决知乎导入Markdown文件时图片和公式等问题。

mpmab_beacon icon mpmab_beacon

This is the official implementation for the paper "Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization" in NeurIPS 2021.

nemo-skills icon nemo-skills

A pipeline to improve skills of large language models

observe_then_incentivize icon observe_then_incentivize

This is the official implementation for the paper "(Almost) Free Incentivized Exploration from Decentralized Learning Agents" in NeurIPS 2021.

reward-bench icon reward-bench

RewardBench: the first evaluation tool for reward models.

tora icon tora

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

ultrafeedback icon ultrafeedback

A large-scale, fine-grained, diverse preference dataset (and models).

xwin-lm icon xwin-lm

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.