Comments (10)
Hi @greensky00 ,
Thanks for your responses. Any suggestions on the API for cluster membership tracking?
Thanks
from nuraft.
Hi @kishorekrd
"membership" itself never changes according to online/offline status, unless users explicitly do add/remove server to/from the cluster.
The term "offline" is ambiguous and cannot be defined clearly due to millions of situations as follows:
- The follower is still "online", but the leader just cannot see it due to network partition.
- The follower is still "online", but just slow so that cannot respond to a few heartbeats.
- The follower is still "online", and still responds to heartbeats, but its log index is lagging behind the majority of members. And the log gap is getting bigger.
- The leader is isolated so that it thinks all the others are "offline", but actually the other members are alive; they form a quorum and elect a new leader, but the old leader does not know about that. From the other members' point of view, the old leader is "offline".
Hence, keeping tracking of "offline" status cannot give us meaningful information for the Raft protocol itself. If you want to track "offline" status, you need to define "what offline is", and implement it outside Raft.
Please note that we provide a few callbacks for the "freshness" of followers:
NuRaft/include/libnuraft/callback.hxx
Lines 99 to 113 in 8699f3b
from nuraft.
+)
One thing NuRaft can do additionally is providing a new API that reports "the last time each follower responded", like the below:
follower1: responded 10 ms ago
follower2: responded 900 ms ago
follower3: responded 5,120 ms ago
From that information, you can decide the up/down status of each follower based on your criteria.
Please let me know if it works for you. Thanks.
from nuraft.
Thanks for the response. What is the callback function for the leader to call if any follower is not responding or some follower, which went down before came back ?
from nuraft.
@kishorekrd
There is no such callback as response failure is quite common although followers are alive. It will be super verbose and will cause lots of false positives.
from nuraft.
That make sense. Thanks.
in some thread you mentioned that , auto_forward is not used and not tested fully. Is that means clients has to send their requests to the leader always? Like 2 step method, connect to any node and find the leader details, then send request to the leader.
from nuraft.
connect to any node and find the leader details, then send request to the leader.
Yes. But usually, a leader does not change frequently so that you can remember the last leader and directly reach out to the leader next time. Then the 2-step method will happen only when the leader changes.
What about the API that I mentioned here? Do you think it will be helpful for you?
#199 (comment)
from nuraft.
Yes, API that you mentioned at #199 (comment) is helpful. Is this API already available or planing to add?
from nuraft.
Ok. It is a brand new API and will be added soon.
from nuraft.
Thanks,
from nuraft.
Related Issues (20)
- buffer overflow vulnerability HOT 2
- Whether we should add consistency test. HOT 2
- bug: runtests.sh calls missing buffer_test script HOT 2
- RPC should set connection timeout. HOT 3
- OpenSSF Security Scorecard improvements
- Wrong "src" and "dst" when cs_new<req_msg> HOT 2
- deadblock HOT 1
- Clean up CMakeLists HOT 2
- nuraft 2.0.0 cannot work on centos7 + centos8 HOT 2
- Preconditions of apply_pack HOT 1
- handle_append_entries_resp() declined append HOT 1
- How to use NuRaft with CMake's FetchContent? HOT 12
- CMake targets should be namespaced using ALIAS targets
- CMake targets are missing usage requirements
- Leadership yielding is not synchronized with replicated log HOT 1
- Should the type be uint8_t instead of size_t for serialize_v1p(...) in srv_state.hxx line 133 HOT 1
- This is a question, not an issue. About `state_machine::pre_commit` HOT 2
- Does nuraft support linearizable read if generating no raft log entry of read requests? HOT 2
- Out of order call to state_machine::create_snapshot() when manually triggering a snapshot HOT 4
- Found a MSan error in Asio HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nuraft.