GithubHelp home page GithubHelp logo

v0.4.3 Release Tracker about vllm HOT 12 OPEN

simon-mo avatar simon-mo commented on June 15, 2024 7
v0.4.3 Release Tracker

from vllm.

Comments (12)

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on June 15, 2024 3

Is there any particular PR that we're waiting for before cutting the release?

The model support for Phi and Deepseek

from vllm.

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on June 15, 2024 1

I am going to try to get these in

probably will not make it but tracking to v0.4.4:

from vllm.

njhill avatar njhill commented on June 15, 2024 1

Sounds like we may want to include #4894 @rkooo567?

from vllm.

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on June 15, 2024 1
  • Fix for mistral-v0.3: #5005

from vllm.

sasha0552 avatar sasha0552 commented on June 15, 2024 1

With the patch, like you, running fp16 models (Mistral 7B for example) with no issues.

Not only fp16, but AQLM works well too (#5058)

image

from vllm.

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on June 15, 2024 1

re: #4409 --> I did not have any issues running an fp16 model on a P40 when I installed from source.

Hi @robertgshaw2-neuralmagic - was this without the patch? I couldn't get a source build to run on P100's without the patch of #4409. With the patch, like you, running fp16 models (Mistral 7B for example) with no issues.

P40 requires building with the patch.

from vllm.

sasha0552 avatar sasha0552 commented on June 15, 2024

Hi, is it possible to include the following PRs?

from vllm.

simon-mo avatar simon-mo commented on June 15, 2024

Thanks for bring these up @sasha0552!

#4167 is unlikely to be finished in time.
#4409 might need a little bit more discussion given what features are supported for Pascal GPUs and whether building from source might be a better option.
#4638 can be included if it gets merged in time.

We do commit to biweekly release cadence so don't worry many of these will get into soon enough!

from vllm.

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on June 15, 2024

Thanks for bring these up @sasha0552!

#4167 is unlikely to be finished in time. #4409 might need a little bit more discussion given what features are supported for Pascal GPUs and whether building from source might be a better option. #4638 can be included if it gets merged in time.

We do commit to biweekly release cadence so don't worry many of these will get into soon enough!

re: #4409 --> I did not have any issues running an fp16 model on a P40 when I installed from source.

from vllm.

rkooo567 avatar rkooo567 commented on June 15, 2024

Yeah +1 on that PR @njhill

from vllm.

jasonacox avatar jasonacox commented on June 15, 2024

re: #4409 --> I did not have any issues running an fp16 model on a P40 when I installed from source.

Hi @robertgshaw2-neuralmagic - was this without the patch? I couldn't get a source build to run on P100's without the patch of #4409. With the patch, like you, running fp16 models (Mistral 7B for example) with no issues.

from vllm.

vrdn-23 avatar vrdn-23 commented on June 15, 2024

Is there any particular PR that we're waiting for before cutting the release?

from vllm.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.