[router] Improve HAR routing in Router #1414

gaojieliu · 2024-12-20T22:48:59Z

Update Venice Router HAR least-loaded algo to take group latency into account and here
are the reasons:
a. When the total qps is relatively low, the pending request count based load rebalancing
algo doesn't work well as most of the time, the pending count is 0. One example: let us
say, the avg latency of one group is 10ms, and if the group qps is 10, that means very likely
only 10-20% of the time, there are some pending requests and even another group latency is
much lower: 1-2ms, the faster group won't be selected, and the latency-based LB algo will kick
in when the above case happens to prefer the faster group if the pending count is equal among all
the groups.
b. Completely getting rid of pending request count based LB will result in another issue as the relatively
slower group will get almost zero request.
c. A combination of pending request and latency will select the faster groups when all groups are busy or idle,
and it will still send some amount of requests to the slower groups in case it is too idle and it will help
bring back the slower group into the rotation when it is recovered from the slowness.

How was this PR tested?

CI

Does this PR introduce any user-facing changes?

No. You can skip the rest of this section.
Yes. Make sure to explain your proposed changes and call out the behavior change.

Update Venice Router HAR least-loaded algo to take group latency into account and here are the reasons: a. When the total qps is relatively low, the pending request count based load rebalancing algo doesn't work well as most of the time, the pending count is 0. One example: let us say, the avg latency of one group is 10ms, and if the group qps is 10, that means very likely only 10-20% of the time, there are some pending requests and even another group latency is much lower: 1-2ms, the faster group won't be selected, and the latency-based LB algo will kick in when the above case happens to prefer the faster group if the pending count is equal among all the groups. b. Completely getting rid of pending request count based LB will result in another issue as the relatively slower group will get almost zero request. c. A combination of pending request and latency will select the faster groups when all groups are busy or idle, and it will still send some amount of requests to the slower groups in case it is too idle and it will help bring back the slower group into the rotation when it is recovered from the slowness.

gaojieliu force-pushed the Router_HAR_enhancement branch from d82217c to f84c662 Compare January 2, 2025 19:48

gaojieliu changed the title ~~[router][server] Fixed the conn metric in server and improve HAR routing in Router~~ [router] Improve HAR routing in Router Jan 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[router] Improve HAR routing in Router #1414

[router] Improve HAR routing in Router #1414

gaojieliu commented Dec 20, 2024 •

edited

Loading

[router] Improve HAR routing in Router #1414

Are you sure you want to change the base?

[router] Improve HAR routing in Router #1414

Conversation

gaojieliu commented Dec 20, 2024 • edited Loading

How was this PR tested?

Does this PR introduce any user-facing changes?

gaojieliu commented Dec 20, 2024 •

edited

Loading