GithubHelp home page GithubHelp logo

modal_finetune_sql's Introduction

Finetuning LLaMa + Text-to-SQL

This walkthrough shows you how to fine-tune LLaMa 2 7B on a Text-to-SQL dataset, and then use it for inference against any database of structured data using LlamaIndex.

Check out our full blog here: https://medium.com/llamaindex-blog/easily-finetune-llama-2-for-your-text-to-sql-applications-ecd53640e10d

This code is taken and adapted from the Modal doppel-bot repo: https://github.com/modal-labs/doppel-bot.

Stack

  • LlamaIndex
  • Modal
  • Hugging Face datasets
  • OpenLLaMa
  • Peft

Setup

To get started, clone or fork this repo:

git clone https://github.com/run-llama/modal_finetune_sql.git

Steps for Running

Please load the notebook tutorial.ipynb for full instructions.

cd modal_finetune_sql
jupyter notebook tutorial.ipynb

In the meantime you can run each step individually as below:

Loading data: modal run src.load_data_sql

Finetuning: modal run --detach src.finetune_sql

Inference: modal run src.inference_sql_llamaindex::main --query "Which city has the highest population?" --sqlite-file-path "nbs/cities.db"

(Optional) Downloading model weights: modal run src.download_weights --output-dir out_model

modal_finetune_sql's People

Contributors

jerryjliu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.