I continue to do tests with my now infamous set of tests in github/ericleigh007. <

Updated my github repo <a href="https://github.com/ericleigh007/DurableFunctionBenchm

Thanks for the analysis. Is this: a limitation of the design

Thanks, <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-

The latest release <a href="https://github.com/microsoft/durabletask-netherite/release

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Netherite with medium to large payloads performance dive about durabletask-netherite HOT 8 OPEN

ericleigh007 commented on June 4, 2024

Netherite with medium to large payloads performance dive

from durabletask-netherite.

Comments (8)

ericleigh007 commented on June 4, 2024 1

Updated my github repo https://github.com/ericleigh007/DurableFunctionBenchmark-Ne.git in case anything I have there helps testing. I've implemented and tested the compressed object to optimize the amount of data the backend has to handle.

from durabletask-netherite.

ericleigh007 commented on June 4, 2024

Ping!

Any thoughts on this?

from durabletask-netherite.

sebastianburckhardt commented on June 4, 2024

The expected behavior when using large messages is that the part where Netherite calls the EH client to send packets to event hubs becomes the throughput bottleneck of the application. It looks to me like that is the case, based on what I can see in the logs, they show that the event hubs sender is taking a long time to send the packets. Specifically, it measures how long it takes one worker to send a batch of packets to one partition and prints something like EventHubsSender partitions/13 sent batch of 1 packets (102796 bytes) in 684.28ms, throughput=0.14MB/s

Because sending of large packets is slow, this also impacts the latency of all other operations. The messages go through the same event hub partitions, so a small message (like a query for the orchestration status) can get stuck behind a large message.

There can be several parts to contribute to the "message sending bottleneck", and can be somewhat addressed:

The EH namespace limits total ingress (this includes everything sent to any partition in the namespace, from any worker) to 1 MB/s per throughput unit. This can be changed by purchasing additional throughput units.
On each worker, each partition sender has some throughput limit based on how the EH client is implemented. This sending throughput of an individual partition sender (one worker sending packets to one partition) is what is printed in the log, e.g. in EventHubsSender partitions/13 sent batch of 1 packets (102796 bytes) in 684.28ms, throughput=0.14MB/s . Scaling out the number of workers and the number of partitions can help with this, because (a) the sending work is spread across more workers, and (b) the sending work is spread across more partitions.

I don't at the moment have useful estimates on what throughput to expect from a partition sender. Also, it is possible that there are other factors that lead to low throughput on those senders. I have some suspicions but no concrete leads yet.

from durabletask-netherite.

ericleigh007 commented on June 4, 2024

Thanks for the analysis. Is this:

a limitation of the design we'd like to remove by doing a,b,c in some future version
a limitation that just means that netherite is going to be totally slow in handling medium/large message traffic
a limitation that could be documented how it can be mitigated by purchasing more throughput units (3TU, 4TU), and that will definitely help
a limitation that suggests netherite should be written off for these larger-messages case?

As a somewhat related question -- is there some sort of sweet spot where messages give great throughput, but when that is reached, throughput falls off quickly?

Our real system processes sets of banking data with debits and credits grouped into a balancing unit. Contention of operating on one of these at a time currently makes handling them one per orchestrator problematic for our main durable function system and for downstream systems using more dated technology like SQL server.

My hope was to bundle more messages into a single SQL update, but in doing so, with this behavior, we might totally kill our excellent netherite throughput.

On another peripheral note, my data is pretty "compressable" so I have built an object that can be used to send to orchestrators that cuts the size down from 50% to even 15% of what the original data was. I see in some tests that there is some sort of Compression thing in the tests of the durable framework, but I don't see that is implemented in the actual runtime. My main question is whether netherite and/or the durable function backend itself undertake to compress the data. If so, of course, my compression scheme won't help, and could even hurt.

If there is no built-in compression functionality, then is there a plan to add any? Interestingly, GPT-4 thinks there is a IDurableSerialization interface that let's the user override the default serializer, but alas that's another invention of the AI, but it could be cool.

Happy to continue the conversation, and realize this was a little scatter-brained. We can split these questions into separate places, but would also still like to understand how netherite is "intended to" (now) and "hoped to" (in the future) react to large messages.

Thanks.

from durabletask-netherite.

sebastianburckhardt commented on June 4, 2024

Thanks, @ericleigh007 for pointing this out and giving us a repro.

The handling of large messages is definitely something we can optimize, and should, based on your experience. There is a relatively simple mechanism that I think should solve this problem (store large messages in blobs and then just send the blob address through EH). I will give this a try soon.

from durabletask-netherite.

sebastianburckhardt commented on June 4, 2024

The latest release 1.4.0 now addresses this issue. It contains a blob-batching optimization #275 that improves the performance for cases where partitions or clients transmit medium to large amounts of data (in terms of either total size, or number of messages).

from durabletask-netherite.

ericleigh007 commented on June 4, 2024

Apologies that I have not been able to test this one. As usual, life and other development blocked any progress.
I still plan to do a side-by-side and report back, but it could be a week or more.

from durabletask-netherite.

lilyjma commented on June 4, 2024

Hi @ericleigh007, I've seen several issues you created regarding Netherite - thanks for your interest in this new feature! We'd love to improve it further and were wondering if it's possible to schedule a quick chat with you to learn about your experience, usage scenario, and feedback. If yes, please share your email via a LinkedIn message. (If you don't have LinkedIn, we can figure out another way.) Thanks again!

from durabletask-netherite.

Netherite with medium to large payloads performance dive about durabletask-netherite HOT 8 OPEN

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs