Reconnect after all nodes were offline #375

harunzengin · 2024-12-06T12:14:46Z

Fixes the reconnectivity issue after all nodes were down.

Sends a :host_up event whenever a new control connection is established so that Xandra.Cluster.Pool can start a pool and connect to that host.

Also, fixes the issue where the load balancing state diverges from the peers state.

Closes #373.

lib/xandra/cluster/control_connection.ex

lib/xandra/cluster/pool.ex

whatyouhide · 2024-12-09T10:21:09Z

lib/xandra/cluster/pool.ex

@@ -327,8 +317,7 @@ defmodule Xandra.Cluster.Pool do
      )
      when is_peername(peername) do
    # Not connected anymore, but we're not really sure if the whole host is down.
-    data = put_in(data.peers[peername].status, :up)
-    data = stop_pool(data, data.peers[peername].host)


Wait, why were doing this and why are we not doing it anymore?

Good catch, we should also maybe_start_pools after stop_pool.

Co-authored-by: Andrea Leopardi <an.leopardi@gmail.com>

harunzengin · 2024-12-10T09:47:10Z

@whatyouhide This should be ready

whatyouhide

Last comment and we're good to go 🎉

lib/xandra/cluster/pool.ex

harunzengin · 2024-12-11T11:15:22Z

@whatyouhide Done

whatyouhide · 2024-12-11T11:28:43Z

Fantastic work @harunzengin 💟

harunzengin added 5 commits November 27, 2024 12:14

Send :host_up event after establishing control connection

a85df9a

Make sure load balancing policy state matches peers state

0a9a12f

Remove leftover debugging code

1e88f8a

check load balancing state in tests

e9b6b58

format

d1bf16c

harunzengin changed the title ~~Fix connectivity after all nodes offline~~ Reconnect after all nodes offline Dec 6, 2024

harunzengin changed the title ~~Reconnect after all nodes offline~~ Reconnect after all nodes were offline Dec 6, 2024

harunzengin mentioned this pull request Dec 6, 2024

Xandra can't recover when all nodes are down #373

Closed

whatyouhide reviewed Dec 9, 2024

View reviewed changes

harunzengin and others added 2 commits December 9, 2024 12:22

Refactor set_host_status/3

a705af1

Co-authored-by: Andrea Leopardi <an.leopardi@gmail.com>

Make sure to stop disconnected pool

579ea7d

whatyouhide reviewed Dec 11, 2024

View reviewed changes

lib/xandra/cluster/pool.ex Outdated Show resolved Hide resolved

add comments for why we're maybe_start_pools/1 after stop_pool/1

344ec94

whatyouhide merged commit 5ef30e9 into whatyouhide:main Dec 11, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reconnect after all nodes were offline #375

Reconnect after all nodes were offline #375

harunzengin commented Dec 6, 2024 •

edited by whatyouhide

Loading

whatyouhide Dec 9, 2024

harunzengin Dec 9, 2024 •

edited

Loading

harunzengin commented Dec 10, 2024

whatyouhide left a comment

harunzengin commented Dec 11, 2024

whatyouhide commented Dec 11, 2024

Reconnect after all nodes were offline #375

Reconnect after all nodes were offline #375

Conversation

harunzengin commented Dec 6, 2024 • edited by whatyouhide Loading

whatyouhide Dec 9, 2024

Choose a reason for hiding this comment

harunzengin Dec 9, 2024 • edited Loading

Choose a reason for hiding this comment

harunzengin commented Dec 10, 2024

whatyouhide left a comment

Choose a reason for hiding this comment

harunzengin commented Dec 11, 2024

whatyouhide commented Dec 11, 2024

harunzengin commented Dec 6, 2024 •

edited by whatyouhide

Loading

harunzengin Dec 9, 2024 •

edited

Loading