GithubHelp home page GithubHelp logo

Shida Wang's Projects

h3 icon h3

Language Modeling with the H3 State Space Model

hyena icon hyena

JAX/Flax implementation of the Hyena Hierarchy

interest icon interest

Temporal re-weighting improve the long-term memory learning.

keops icon keops

KErnel OPerationS, on CPUs and GPUs, with autodiff and without memory overflows

lhj icon lhj

Can we use lightning as data loader and jax as the models?

lit-gpt icon lit-gpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

mamba-minimal icon mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

picogpt icon picogpt

An unnecessarily tiny implementation of GPT-2 in NumPy.

pyprobml icon pyprobml

Python code for "Probabilistic Machine learning" book by Kevin Murphy

rwkv-cuda icon rwkv-cuda

The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )

rwkv-lm icon rwkv-lm

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

rwkv-v2-rnn-pile icon rwkv-v2-rnn-pile

RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.