GithubHelp home page GithubHelp logo

EAGLE-2 is slower than EAGlE-1 about eagle HOT 8 CLOSED

yjdy avatar yjdy commented on August 24, 2024
EAGLE-2 is slower than EAGlE-1

from eagle.

Comments (8)

yjdy avatar yjdy commented on August 24, 2024

I make a mistake above, the inference speed of EAGLE-2 is 66 and EAGLE-1 is 69. Besides, temperature is 0

from eagle.

hongyanz avatar hongyanz commented on August 24, 2024

It is not normal. Can you provide more details (e.g., if you are running something else on your machine, what is your environment, which codes are your running)? Without them, it is hard to debug your code.

from eagle.

yjdy avatar yjdy commented on August 24, 2024

Some details of my environment are listed as follow:
1 GPU V100 32G memory
python 3.10.14
CUDA 11.7
Driver Version: 515.65.01
torch 2.1.0
triton 2.1.0
transformers 4.36.2

I just run the evaluation script gen_ea_answer_vicuna.py as suggested in Readme
batch size= 1
temperature=0

from eagle.

Liyuhui-12 avatar Liyuhui-12 commented on August 24, 2024

The possible reason is that total_token was not set correctly.

from eagle.

yjdy avatar yjdy commented on August 24, 2024

Thanks for the response. Can you provide me some advise to set total_token? Should I set larger or smaller?

from eagle.

Lucas-TY avatar Lucas-TY commented on August 24, 2024

HI, the benchmark can't record new token correctly, I don't know if that's normal.

python -m eagle.evaluation.gen_ea_answer_vicuna\
        --ea-model-path yuhuili/EAGLE-Vicuna-7B-v1.3\
        --base-model-path lmsys/vicuna-7b-v1.3

python -m eagle.evaluation.gen_baseline_answer_vicuna\
		 --ea-model-path yuhuili/EAGLE-Vicuna-7B-v1.3\
		 --base-model-path lmsys/vicuna-7b-v1.3
{"question_id": 81, "answer_id": "TP4CRrbLYBqFHdQqoeb7ug", "model_id": "ess-vicuna-70b-fp16-baseline-temperature-1.0", "choices": [{"index": 0, "turns": ["....... "idxs": [603, 603], "new_tokens": [0, 0], "wall_time": [8.09636378288269, 7.946403741836548]}], "tstamp": 1720166094.253764}

from eagle.

Liyuhui-12 avatar Liyuhui-12 commented on August 24, 2024

Thanks for the response. Can you provide me some advise to set total_token? Should I set larger or smaller?

Overall, the smaller the model and the more powerful the computational capacity, the larger this value should be.

from eagle.

Liyuhui-12 avatar Liyuhui-12 commented on August 24, 2024

HI, the benchmark can't record new token correctly, I don't know if that's normal.

It is normal for the baseline not to return new tokens.

from eagle.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.