Light

squeezeailab / llm2llm Goto Github PK

View Code? Open in Web Editor NEW

108.0 6.0 8.0 214 KB

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Home Page: https://arxiv.org/abs/2403.15042

License: MIT License

Python 100.00%

data-augmentation llama llama2 llm llms natural-language-processing nlp synthetic-dataset-generation transformer

llm2llm's Introduction

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement [Paper]

This is the code for the LLM2LLM paper.

Reproducing Main Experiments

We have provided code required to reproduce our main experiments for GSM8K. Instructions for other datasets will be uploaded soon.

Download a copy of LLaMA-2-7B, and the appropriate dataset
Clone the GSM8K dataset by running

cd GSM8K
git clone https://github.com/openai/grade-school-math.git

Run generate_seed_data.py and adjust SUBSAMPLE_SPLIT to get seed data.
Ensure that all settings in config.yaml are accurate
Run python GSM8K/generator_data.py GSM8K/config.yaml
cd into your experiment folder and run ./run_all.sh
After all of the iterations have finished, run

python report_results.py --results_file_name test_0.jsonl GSM8K/grade-school-math/grade_school_math/data/test.jsonl $EXP_FOLDER

to get a detailed breakdown of the performance of the model at each iteration.

This will produce an output folder that contains all the data and model checkpoints.

Roadmap

We are planning on adding the code required to reproduce our experiments on other datasets.

Citation

LLM2LLM has been developed as part of the following paper. We would appreciate if you would please cite this paper if you found this library useful for your work:

@article{lee2024llm2llm,
      title={LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement}, 
      author={Lee, Nicholas and Wattanawong, Thanakul and Kim, Sehoon and Mangalam, Karttikeya and Shen, Sheng and Anumanchipali, Gopala and Mahoney, Michael W and Keutzer, Kurt and Gholami, Amir},
      journel={arXiv},
      year={2024},
}

llm2llm's People

Contributors

Stargazers

Watchers

Forkers

mivanovitch dtbinh krish240574 mojowebs ttcoding shackleslay zeformula ohmygod32

llm2llm's Issues

where is run_all.sh

Thank you for open-sourcing this work . I'd like to try it on my own dataset. But I cannot find the complete running pipeline 'run_all.sh'. Is it missing?

Insightful Connection to My Previous Paper

I recently read your paper and it is a great paper. Your research provides valuable insights into LLM-based data augmentation.

As I was reading your paper, I couldn't help but notice the parallels between your findings and the work AI2 and I published last year in EMNLP, titled "[Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation]." Our paper delves into the targeted data augmentation for MWP solving, which might complement and extend the discussions in your paper.

Therefore, I was wondering if you might consider acknowledging our work in your paper, as it could provide additional depth to the understanding and implications of your findings for the readers. I would be more than happy to discuss this further or provide any additional information you might need regarding my work.

What is instruction_per_seed_task in your experiment?

It seems that instruction_per_seed_task is not reported in the Experiment Setup section of your paper? I find in your code that it's four by default.

expected dtype float for `end` but got dtype c10::BFloat16

Running the gitcode encountered the above error, the full error is as follows:

transformers == 4.32.1
torch==2.2.0

Other Tasks

There just provide code for GSM8K, Will the code on other datasets be provided？
What is the approximate time if provided, thx~

where is step 6 from the readme

Readme says:
6. cd into your experiment folder and run ./run_all.sh

where is the experiments folder?

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs