GithubHelp home page GithubHelp logo

Hi there 👋

😉 I am Siyu Ren.

🎓 I got my Bachelor degree from Tong Ji University and Ph.D degree at Shanghai Jiao Tong University.

🔎 Currently, my research interest includes Efficient Methods for NLP/Large Language Models and techniques around mechanistic understanding of LLMs.

📚 For my academic publications, please refer to https://drsy.github.io/.

DRSY's github stats主要使用语言

profile

任思宇's Projects

awesome-rl-nlp icon awesome-rl-nlp

Curated Reinforcement Learning Resources for Natural Language Processing

commonsensevae icon commonsensevae

Diverse and Informative Commonsense Inference with Relation-Specific Gaussian Mixture Prior

dgen icon dgen

[AAAI 2021]Knowledge-Driven Distractor Generation for Cloze-Style Multiple Choice Questions

dotfiles icon dotfiles

Personal dotfiles including zshrc/vimrc/tmux.conf

drsy.github.io icon drsy.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

dualrl icon dualrl

A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer (IJCAI 2019)

easykv icon easykv

Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)

emo icon emo

[ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)

fastformers icon fastformers

FastFormers - highly efficient transformer models for NLU

hdegraph icon hdegraph

Code for ACL 2019 paper "Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs"

kv_compression icon kv_compression

[EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens

lamp icon lamp

[NAACL 2022 Findings]Specializing Pre-trained Language Models for Better Relational Reasoning via Network Pruning

llcsa icon llcsa

Lifelong Commonsense Knowledge Acquisition

maml-pytorch icon maml-pytorch

Elegant PyTorch implementation of paper Model-Agnostic Meta-Learning (MAML)

motis icon motis

[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)

pins icon pins

[ACL2023 Findgins]Pruning Pre-trained Language Models with Principled Importance and Self-regularization

plmpapers icon plmpapers

Must-read Papers on pre-trained language models.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.