GithubHelp home page GithubHelp logo

Comments (6)

shrekris-anyscale avatar shrekris-anyscale commented on July 23, 2024

Could you provide a minimal repro for your setup? Do the vLLM workers die randomly, or is there a pattern?

from ray-llm.

rtwang1997 avatar rtwang1997 commented on July 23, 2024

Hi,

We are finding that the vLLM workers die after running for around 1 hour (give or take), consistently.

We tried with a very simple ray service, where a deployment creates an actor in it's init function and I get the same behaviour where the actor dies after ~1h because all reference to it get removed. Here's the sample code:

import ray
from ray import serve
from fastapi import FastAPI

app = FastAPI()


@ray.remote
class Actor:
    def __call__(self):
        return "hello world"


@serve.deployment
@serve.ingress(app)
class APIIngress:
    def __init__(self):
        self.actor = Actor.remote()

    @app.get("/health")
    async def healthcheck(self):
        """
        checks if server is up.
        """
        return {"status": "server up"}

    @app.get("/test")
    async def test(self):
        return await self.actor.remote()


SERVE_APP = APIIngress.bind()

from ray-llm.

shrekris-anyscale avatar shrekris-anyscale commented on July 23, 2024

Thanks for the repro! What Ray version are you using?

from ray-llm.

rtwang1997 avatar rtwang1997 commented on July 23, 2024

Hi,

We are using Ray version 2.7.1

from ray-llm.

shrekris-anyscale avatar shrekris-anyscale commented on July 23, 2024

Are there any updates here @rtwang1997? I synced with one of your coworkers on Slack and proposed some approaches, but I'm not sure how it went.

from ray-llm.

shrekris-anyscale avatar shrekris-anyscale commented on July 23, 2024

@rtwang1997 I'll close this issue for now. Feel free to reopen if you're still running into the problem.

from ray-llm.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.