GithubHelp home page GithubHelp logo

Comments (4)

rusenask avatar rusenask commented on July 21, 2024 1

Hi @mrwyattii, I think the problem with the grpc client is this

assert self.num_gpus > 0, "GPU count must be greater than 0"
- it's checking whether GPU is available when in fact it's on the client side so it shouldn't care :) Commenting that part out makes it work with the remote GRPC server

from deepspeed-mii.

mrwyattii avatar mrwyattii commented on July 21, 2024

If I understand, you are trying to stand up a DeepSpeed-MII GRPC server and then send queries to that server remotely. Is that correct?

As for the second question, we do not currently support loading/unloading models at query time. You might be able to achieve this by using mii.terminate(old_deployment_name) and then mii.deploy(new_deployment_name, ...) when you detect that the query you are running does not match the current deployment.

from deepspeed-mii.

Thytu avatar Thytu commented on July 21, 2024

If I understand, you are trying to stand up a DeepSpeed-MII GRPC server and then send queries to that server remotely. Is that correct?

In my understanding DeepSpeed is both an optimisation solution and an inference engine (cf).

Is there a way, using DeepSpeed-MII, to have the DeepSpeed engine in a remote server and specify to the client the target host/post?

As for the second question, we do not currently support loading/unloading models at query time. You might be able to achieve this by using mii.terminate(old_deployment_name) and then mii.deploy(new_deployment_name, ...) when you detect that the query you are running does not match the current deployment.

Do you plan to handle this feature? I would be happy to help to implement this feature if needed πŸ˜‰
(Using mii.terminate(old_deployment_name) and then mii.deploy(new_deployment_name, ...) would be really slow)

from deepspeed-mii.

mrwyattii avatar mrwyattii commented on July 21, 2024

Sorry the late reply on this. There is a way to use MII on a remote server and have a client send queries and receive a response. This functionality right now is currently limited to AML deployments. You could probably adapt what we are doing in the AML deployment docker image to achieve this functionality yourself:

I realize you have asked about providing a Docker image for MII previously (#83). I've been working on automatically generating up-to-date images that we can share on DockerHub and on Azure Marketplace to make deploying MII easier. I think we could also bundle some things from our AML docker builds to enable this remote server capability on non-AML deployments.

Do you plan to handle this feature? I would be happy to help to implement this feature if needed πŸ˜‰

We currently do not have plans to add this feature, but we are always open to outside contributions. Is the main goal here to have a persistent GRPC server that can swap which model it's running on the fly?

from deepspeed-mii.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.