GithubHelp home page GithubHelp logo

Comments (3)

ask avatar ask commented on July 17, 2024

This is only a problem with the in-memory store, right? If so I think it's OK to not clean up as I consider the memory store a development backend, not suited for production use.

from faust.

vineetgoel avatar vineetgoel commented on July 17, 2024

Wait so if we use rocksdb, do we not keep anything in memory at that point? Is everything just in rocks?

from faust.

ask avatar ask commented on July 17, 2024

Yeah, everything is in RocksDB!† Removing a partition is as easy as deleting the database directory for that partition, which is something that we can do when using RocksDB but I'm unsure if deleting
it is such a good idea. I guess it depends on how fast the data spreads to all nodes in the cluster,
but maybe the partitioner should also take into consideration the partitions a node has data for?

It would be neat if the partitioner did this:

  1. Chose a node that already have a copy of the partition in a Rocks db. I guess the nodes could
    report list of partitions and checkpoints as metadata.
  2. If there are no nodes with the data, assign a fresh node so that it will transfer the data.

If the partitioner implemented this, I think you would rarely come across a situation
where scenario 2) happens, and then you have a more efficient implementation
for recovery, without the needed for cleanup.

† Apart from the Key->Partition index, but that's a LRU cache of max 10k objects by default.

from faust.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.