GithubHelp home page GithubHelp logo

Hi, I'm Vachan. I like to build and train Deep Neural Networks from scratch.

Projects:

  • GPT written in jax, trained on tiny shakespeare dataset (1.1 MB text data) and scaled it on the tiny stories dataset (~2 GB text data)
    Model-Params d_model n_heads maximum_context_length num_layers vocab_size Estimated Validation Loss on tiny stories dataset
    280K 64 8 512 5 512 1.33
    15M 288 6 256 6 32000 1.19
    45M 512 8 1024 8 32000 TODO
    110M 768 12 2048 12 32000 TODO
  • Vision Transformers in jax, trained on MNIST dataset

Mugen

  • Going to make a website for music generation completely from scratch using Pytorch. On-going project...
  • Models for this Project

      • Non Autoregressive Transformer
      • Autoregressive Transformer
      • Diffusion Transformer

Vachan V Y's Projects

detr icon detr

Simplifying Object detection. DEtection TRansformer (DETR) in JAX. (To be Tested)

dreamnet icon dreamnet

Seeing What Deep Convolutional Networks Dream

gpt.jax icon gpt.jax

Generative Pretrained Model (GPT) in JAX. A step by step guide to train LLMs on large datasets from scratch

neuroforge icon neuroforge

Unveiling the Layers: Building a Neural Network (MLP) from Scratch

rotary-embeddings icon rotary-embeddings

Simple Implementation of Rotary Positional Embeddings (RoPE) and Sinusoidal Positional Embeddings in JAX

vivit icon vivit

ViViT: A Video Vision Transformer in PyTorch

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.