GithubHelp home page GithubHelp logo

sdsc-hackathon-llm-playground's Introduction

SDSC_hackathon_LLM_playground

Under development

Right now the total image size is around 7GB without including the model. The model selected will be downloaded on the first run of the script.

Follow the notebook tutorial to run your first model.

Links below only valid after starting the jupyter server.

How to use it?

docker pull caviri/sdsc-llm-playground:latest
docker pull caviri/sdsc-llm-playground:nonroot_user
docker run --rm -it  --gpus all -p 8888:8888 -e JUPYTER_TOKEN=TEST caviri/sdsc-llm-playground:nonroot_user

Enter in the server in: http://127.0.0.1:8888/ and use the password defined

How to use this on runai?

runai submit testllm4 -i caviri/sdsc-llm-playground:nonroot_user -e JUPYTER_TOKEN=TEST --service-type=portforward --port 8888:8888 --attach --interactive --node-type "A100" -g 0.2

How to build this docker?

docker build -t caviri/sdsc-llm-playground:latest .

How to run Fastchat?

  1. Open a new terminal in jupyter lab
  2. Install Fastchat with pip install fschat
  3. Run a model:
    • python3 -m fastchat.serve.cli --model-path lmsys/fastchat-t5-3b-v1.0 (~10GB VRAM)
    • python3 -m fastchat.serve.cli --model-path databricks/dolly-v2-7b (~16GB VRAM)

This will open an interactive session with the model. --load-8bit flag will reduce the size of the models in memory but is not working.

sdsc-hackathon-llm-playground's People

Contributors

caviri avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.