MDEV-30232: rpl.rpl_gtid_crash fails sporadically in BB

mirror of https://github.com/MariaDB/server.git synced 2026-05-17 20:37:12 +02:00

The root cause of the failure is a bug in the Linux network stack:

  https://lore.kernel.org/netdev/87sf0ldk41.fsf@urd.knielsen-hq.org/T/#u

If the slave does a connect(2) at the exact same time that kill -9 of the
master process closes the listening socket, the FIN or RST packet is lost in
the kernel, and the slave ends up timing out waiting for the initial
communication from the server. This timeout defaults to
--slave-net-timeout=120, which causes include/master_gtid_wait.inc to time
out first and fail the test.

Work-around this problem by reducing the --slave-net-timeout for this test
case. If this problem turns up in other tests, we can consider reducing the
default value for all tests.

Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>

This commit is contained in:

Kristian Nielsen

2024-04-16 10:08:31 +02:00

parent 4a2e03453a

commit 0c249ad718

1 changed files with 1 additions and 1 deletions

2

mysql-test/suite/rpl/t/rpl_gtid_crash-slave.opt

View file

 @ -1 +1 @@
 --master-retry-count=100
 --master-retry-count=100 --slave-net-timeout=10

Rows
Columns

MDEV-30232: rpl.rpl_gtid_crash fails sporadically in BB

2 mysql-test/suite/rpl/t/rpl_gtid_crash-slave.opt Unescape Escape View file

2

mysql-test/suite/rpl/t/rpl_gtid_crash-slave.opt

View file