GithubHelp home page GithubHelp logo

patrick-tssn / lm-research-hub Goto Github PK

View Code? Open in Web Editor NEW
15.0 2.0 3.0 4.59 MB

Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language models (LMs), with a particular focus on large language models (LLMs)

License: MIT License

Python 53.59% Jupyter Notebook 43.93% sed 2.48%
accelerate api-wrapper instruction-tuning large-language-models multi-modal pretraining rlhf llm open-source

lm-research-hub's Introduction

Language Modeling Research Hub

Welcome to the Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language models (LMs), with a particular focus on large language models (LLMs). This repository is meticulously organized to facilitate easy navigation and access to a wealth of information.

  • Learning Resources: A selection of articles, courses, open-source projects, and data designed to enhance your LM knowledge.
  • Practices: A showcase of my experiments and code, offering interactive demos and frameworks that highlight LM's potential.

Dive in to explore, contribute, and expand the horizons of language modeling with us!

"In theory, theory and practice are the same. In practice, they are not." --Albert Einstein

📑 Table of Contents

📖 Learning Resources

Reading List

Reading list and related notes for LLM research, see Reading List for details.

  • Key Findings
  • Architecture
  • Causality
  • Code Learning
  • Dialogue
  • Efficiency
  • Human Alignment
  • Information Extraction
  • Instruction Tuning
  • Interpretability
  • In Context Learning
  • Knowledge Update
  • Mixture of Experts (MoE)
  • Non-Autoregressive Generation
  • Reasoning
    • Abstract Reasoning
    • Chain of Thought
    • Symbolic Reasoning
  • Retrieval
  • Social

Notes

My notes for LM research, see notes for details.

  • Tokenization
  • Position Encoding

Courses

Books

Blogs

Talks

  • LLM-Talk, LLMs in Five Formulas, by Alexander Rush

Open Source LLMs

Collection of various open-source LLMs, see Open-source LLMs for details.

  • Pretrained Model
  • Multitask Supervised Finetuned Model
  • Instruction Finetuned Model
    • English
    • Chinese
    • Multilingual
  • Human Feedback Finetuned Model
  • Domain Finetuned Model
  • Open Source Projects
    • reproduce/framework
    • accelerate
    • evaluation
    • deployment/demo

Related Collections

  • open-llms GitHub last commit (by committer)Dynamic JSON Badge, A list of open LLMs available for commercial use.
  • LLM-Zoo GitHub last commit (by committer)Dynamic JSON Badge, collects information of various open- and closed-source LLMs
  • FindTheChatGPTer GitHub last commit (by committer)Dynamic JSON Badge, 汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等
  • **大模型 GitHub last commit (by committer)Dynamic JSON Badge, 旨在记录**大模型情况
  • Awesome-Chinese-LLM GitHub last commit (by committer)Dynamic JSON Badge, 整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等

Dataset Collections

Datasets for Pretrain/Finetune/Instruction-tune LLMs, see Datasets for details.

  • Pretraining Corpora
  • Instruction

Related Collections

  • Awesome-LLMs-Datasets GitHub last commit (by committer)Dynamic JSON Badge, Summarize existing representative LLMs text datasets.
  • LLMDataHub GitHub last commit (by committer)Dynamic JSON Badge, A quick guide (especially) for trending instruction finetuning datasets
  • Awesome Instruction Datasets GitHub last commit (by committer)Dynamic JSON Badge, A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型
  • sft_datasets GitHub last commit (by committer)Dynamic JSON Badge, 开源SFT数据集整理,随时补充

Evaluation Benchmarks

Collection of automatic evaluation benchmarks, see Evaluation Benchmarks for details.

  • English
    • Comprehensive
    • Knowledge
    • Reason
      • Hard Mathematical, Theorem
    • Code
    • Personalization
  • Chinese
    • Comprehensive
    • Safety
  • Multilingual

Related Collections

Prompt Engineering

Collection of tricks of writing a perfect prompt, see Prompt for details.

💻 Practice

API

LLM API demos (including mirror links), see API for details.

  • openai

Instruction Tuning

  1. Instruction Construct: Construct Instruction by mixture or self-instruct
  2. Fine Tuning: Instruction Tuning on 4 LLM with multilingual instructions

see Instruction Tuning for details.

  • Experiments
    • Datasets
      • Collection
      • Bootstrap
    • Model Cards
    • Usage
  • Results

Constrained Generation

constrain LLM to generate specific answer (e.g., some open ended QA, limited vocabulary tasks), see Constrained Generate for details.

  • Common method (constrain vocabulary + sample algorithm)
  • Trie + Beam search (has issues currently)

🤝 Related Collections

  • LLMSurvey GitHub last commit (by committer)Dynamic JSON Badge, A collection of papers and resources related to Large Language Models
  • LLMsPracticalGuide GitHub last commit (by committer)Dynamic JSON Badge, A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
  • Awesome-LLM GitHub last commit (by committer)Dynamic JSON Badge, a curated list of Large Language Model
  • llm-action GitHub last commit (by committer)Dynamic JSON Badge, 本项目旨在分享大模型相关技术原理以及实战经验

lm-research-hub's People

Contributors

patrick-tssn avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.