mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-01-31 19:11:46 +01:00

Author	SHA1	Message	Date
Monty	4bad74e139	Added error checking for all calls to flush_relay_log_info() and stmt_done()	2017-02-28 16:10:47 +01:00
Monty	c5e25c8b40	Added a separate lock for start/stop/reset slave. This solves some possible dead locks when one calls stop slave while slave is starting.	2017-02-28 16:10:46 +01:00
Monty	e65f667bb6	MDEV-9573 'Stop slave' hangs on replication slave The reason for this is that stop slave takes LOCK_active_mi over the whole operation while some slave operations will also need LOCK_active_mi which causes deadlocks. Fixed by introducing object counting for Master_info and not taking LOCK_active_mi over stop slave or even stop_all_slaves() Another benefit of this approach is that it allows: - Multiple threads can run SHOW SLAVE STATUS at the same time - START/STOP/RESET/SLAVE STATUS on a slave will not block other slaves - Simpler interface for handling get_master_info() - Added some missing unlock of 'log_lock' in error condtions - Moved rpl_parallel_inactivate_pool(&global_rpl_thread_pool) to end of stop_slave() to not have to use LOCK_active_mi inside terminate_slave_threads() - Changed argument for remove_master_info() to Master_info, as we always have this available - Fixed core dump when doing FLUSH TABLES WITH READ LOCK and parallel replication. Problem was that waiting for pause_for_ftwrl was not done when deleting rpt->current_owner after a force_abort.	2017-02-28 16:10:46 +01:00
Sujatha Sivakumar	e619295e1b	Bug#24901077: RESET SLAVE ALL DOES NOT ALWAYS RESET SLAVE Description: ============ If you have a relay log index file that has ended up with some relay log files that do not exists, then RESET SLAVE ALL is not enough to get back to a clean state. Analysis: ========= In the bug scenario slave server is in stopped state and some of the relay logs got deleted but the relay log index file is not updated. During slave server restart replication initialization fails as some of the required relay logs are missing. User executes RESET SLAVE/RESET SLAVE ALL command to start a clean slave. As per the documentation RESET SLAVE command clears the master info and relay log info repositories, deletes all the relay log files, and starts a new relay log file. But in a scenario where the slave server's Relay_log_info object is not initialized slave will not purge the existing relay logs. Hence the index file still remains in a bad state. Users will not be able to start the slave unless these files are cleared. Fix: === RESET SLAVE/RESET SLAVE ALL commands should do the cleanup even in a scenario where Relay_log_info object initialization failed. Backported a flag named 'error_on_rli_init_info' which is required to identify slave's Relay_log_info object initialization failure. This flag exists in MySQL-5.6 onwards as part of BUG#14021292 fix. During RESET SLAVE/RESET SLAVE ALL execution this flag indicates the Relay_log_info initialization failure. In such a case open the relay log index/relay log files and do the required clean up.	2017-02-28 10:00:51 +05:30
Nirbhay Choubey	ee8b5c305a	Merge tag 'mariadb-10.0.29' into 10.0-galera	2017-01-13 13:53:59 -05:00
Marko Mäkelä	5044dae239	Merge 10.0 into 10.1	2017-01-10 14:30:11 +02:00
Kristian Nielsen	43378f367c	MDEV-10271: Stopped SQL slave thread doesn't print a message to error log like IO thread does Make the slave SQL thread always output to the error log the message "Slave SQL thread exiting, replication stopped in ..." whenever it previously outputted "Slave SQL thread initialized, starting replication ...". Before this patch, it was somewhat inconsistent in which cases the message would be output and in which not, depending on the exact time and cause of the condition that caused the SQL thread to stop.	2017-01-06 10:46:20 +01:00
Sergei Golubchik	4a5d25c338	Merge branch '10.1' into 10.2	2016-12-29 13:23:18 +01:00
Sergei Golubchik	2f20d297f8	Merge branch '10.0' into 10.1	2016-12-11 09:53:42 +01:00
kevg	780db8e252	fix build and some warnings	2016-11-24 17:36:02 +03:00
Kristian Nielsen	390f2a013b	Fix incorrect reading of events from relaylog in parallel replication. The SQL thread keeps track of the position in the current relay log from which to read the next event. This position is not normally used, but a certain interaction with the IO thread can cause the SQL thread to re-open the relay log and seek to the stored position. In parallel replication, there were a couple of places where the position was not updated. This created a race where a re-open of the relay log could seek to the wrong position and start re-reading and processing events already handled once, causing various kinds of problems. Fix this by moving the position update into a single place in apply_event_and_update_pos(), which should ensure that the position is always updated in the parallel replication case. This problem was found from the testcase of MDEV-10863, but it is logically a separate problem.	2016-11-16 11:00:38 +01:00
Kristian Nielsen	f1fcc1fc10	Back-port Master_info::using_parallel() to 10.0. This has no functional changes, but it helps avoid merge problems from 10.0 to 10.1. In 10.0, code that checks for parallel replication uses opt_slave_parallel_threads > 0, but this check needs to be mi->using_parallel() in 10.1. By using the same check in 10.0 (with unchanged semantics), merge problems to 10.1 are avoided.	2016-11-15 23:00:11 +01:00
Kristian Nielsen	bccd0b5e0e	Merge branch 'mdev10863' into 10.1	2016-11-15 13:10:21 +01:00
Kristian Nielsen	717f212840	MDEV-10863: parallel replication tries to continue from wrong position This occured when the SQL thread (but not the IO thread) stops while GTID and parallel replication are used with multiple domain ids in the GTID position, and is restarted. In this case, the SQL needs to start some way back in the relay log, applying or skipping events within each replication domain as appropriate. The SQL threads starts at the beginning of an old relay log file, and this position may be in the middle of an event group. The bug was that such partial event group could be re-applied, causing replication corruption. This patch fixes the issue, by making sure to skip any initial events that were part of an earlier (already applied) event group.	2016-11-04 12:33:42 +01:00
Kristian Nielsen	b002509b67	MDEV-11065: Compressed binary log. Merge code into current 10.2. Conflicts: sql/share/errmsg-utf8.txt	2016-11-03 14:48:51 +01:00
Sergei Golubchik	a98c85bb50	Merge branch '10.0-galera' into 10.1	2016-11-02 13:44:07 +01:00
vinchen	0e380c3bfe	two fix: 1.Avoid overflowing buffers in case of corrupt events 2.Check the compressed algorithm.	2016-10-29 21:59:20 +08:00
Nirbhay Choubey	5db2195a35	Merge tag 'mariadb-10.0.28' into 10.0-galera	2016-10-28 15:50:13 -04:00
Sergei Golubchik	22490a0d70	MDEV-8345 STOP SLAVE should not cause an ERROR to be logged to the error log cherry-pick from 5.7: commit 6b24763 Author: Manish Kumar <manish.4.kumar@oracle.com> Date: Tue Mar 27 13:10:42 2012 +0530 BUG#12977988 - ON STOP SLAVE: ERROR READING PACKET FROM SERVER: LOST CONNECTION TO MYSQL SERVER BUG#11761457 - ERROR 2013 + "ERROR READING RELAY LOG EVENT" ON STOP SLAVEBUG#12977988 - ON STOP SLAVE: ERROR READING PACKET FROM SERVER: LOST CONNECTION TO MYSQL SERVER	2016-10-26 18:44:34 +02:00
vinchen	07f09df92b	fix the ABI and stop slave hang problem	2016-10-21 13:37:48 +02:00
Kristian Nielsen	c06bc66816	MDEV-11065: Compressed binary log Minor review comments/changes: - A bunch of style-fixes. - Change macros to static inline functions. - Update check_event_type() with compressed event types. - Small .result file update.	2016-10-20 18:00:59 +02:00
vinchen	d4b2c9bb1a	optimize the memory allocation for compressed binlog event	2016-10-19 20:20:47 +02:00
vinchen	640051e06a	Binlog compressed Add some event types for the compressed event, there are: QUERY_COMPRESSED_EVENT, WRITE_ROWS_COMPRESSED_EVENT_V1, UPDATE_ROWS_COMPRESSED_EVENT_V1, DELETE_POWS_COMPRESSED_EVENT_V1, WRITE_ROWS_COMPRESSED_EVENT, UPDATE_ROWS_COMPRESSED_EVENT, DELETE_POWS_COMPRESSED_EVENT. These events inheritance the uncompressed editor events. One of their constructor functions and write function have been overridden for uncompressing and compressing. Anything but this is totally the same. On slave, The IO thread will uncompress and convert them When it receiving the events from the master. So the SQL and worker threads can be stay unchanged. Now we use zlib as compress algorithm. It maybe support other algorithm in the future.	2016-10-19 20:20:35 +02:00
vinchen	0fa39ffba7	fix code style..	2016-10-19 13:52:17 +02:00
vinchen	c334f4fe46	fix the code style for read_binlog_speed_limit	2016-10-19 13:52:17 +02:00
vinchen	43789901c7	Control the binlog read speed for compressed protocol	2016-10-19 13:51:08 +02:00
vinchen	8eb0f5ca1a	Control the Maximum speed(KB/s) to read binlog from master	2016-10-19 13:51:08 +02:00
Kristian Nielsen	e1ef99c3dc	MDEV-7145: Delayed replication Merge feature into 10.2 from feature branch. Delayed replication adds an option CHANGE MASTER TO master_delay=<seconds> Replication will then delay applying events with that many seconds. This creates a replication slave that reflects the state of the master some time in the past. Feature is ported from MySQL source tree. Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>	2016-10-16 23:44:44 +02:00
Kristian Nielsen	3011060b2a	MDEV-7145: Delayed slave. Extend to work also for parallel replication. Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>	2016-10-14 23:15:59 +02:00
Kristian Nielsen	814880711f	BUG#56442: Slave executes delayed statements when STOP SLAVE is issued Problem: When using the delayed slave feature, and the SQL thread is delaying, and the user issues STOP SLAVE, the event we wait for was executed. It should not be executed. Fix: Check the return value from the delay function, slave.cc:slave_sleep(). If the return value is 1, it means the thread has been stopped, in this case we don't execute the statement. Also, refactored the test case for delayed slave a little: added the test script include/rpl_assert.inc, which asserts that a condition holds and prints a message if not. Made rpl_delayed_slave.test use this. The advantage is that the test file is much easier to read and maintain, because it is clear what is an assertion and what is not, and also the expected result can be found in the test file, you don't have to compare it to the result file. Manually merged into MariaDB from MySQL commit fd2b210383358fe7697f201e19ac9779879ba72a Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>	2016-10-14 23:15:59 +02:00
Kristian Nielsen	b2bc6dadee	MDEV-7145: Delayed replication, cleanup some code The original MySQL patch left some refactoring todo's, possibly because of known conflicts with other parallel development (like info-repository feature perhaps). This patch fixes those todos/refactorings. Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>	2016-10-14 23:15:59 +02:00
Kristian Nielsen	a9fb480fd6	MDEV-7145: Delayed replication, fixing test failures. Two merge error fixed, and testsuite updated to removed some other test failues. Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>	2016-10-14 23:15:58 +02:00
Kristian Nielsen	19abe79fd1	MDEV-7145: Delayed replication, intermediate commit. Initial merge of delayed replication from MySQL git. The code from the initial push into MySQL is merged, and the associated test case passes. A number of tasks are still pending: 1. Check full test suite run for any regressions or .result file updates. 2. Extend the feature to also work for parallel replication. 3. There are some todo-comments about future refactoring left from MySQL, these should be located and merged on top. 4. There are some later related MySQL commits, these should be checked and merged. These include: e134b9362ba0b750d6ac1b444780019622d14aa5 b38f0f7857c073edfcc0a64675b7f7ede04be00f fd2b210383358fe7697f201e19ac9779879ba72a afc397376ec50e96b2918ee64e48baf4dda0d37d 5. The testcase from MySQL relies heavily on sleep and timing for testing, and seems likely to sporadically fail on heavily loaded test servers in buildbot or distro build farms. Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>	2016-10-14 23:15:58 +02:00
Kristian Nielsen	50f19ca809	Remove unnecessary global mutex in parallel replication. The function apply_event_and_update_pos() is called with the rli->data_lock mutex held. However, there seems to be nothing in the function actually needing the mutex to be held. Certainly not in the parallel replication case, where sql_slave_skip_counter is always 0 since the non-zero case is handled by the SQL driver thread. So this patch makes parallel replication use a variant of apply_event_and_update_pos() without the need to take the rli->data_lock mutex. This avoids one contended global mutex for each event executed, which might improve performance on CPU-bound workloads somewhat. Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>	2016-10-14 22:44:40 +02:00
Sergei Golubchik	ec59220f2c	post-merge fixes for `ec47bea`	2016-09-12 13:54:44 +02:00
Kristian Nielsen	ec47beaba6	Merge parallel replication async deadlock kill into 10.2. Conflicts: sql/mysqld.cc sql/slave.cc	2016-09-09 12:15:53 +02:00
Sergei Golubchik	06b7fce9f2	Merge branch '10.1' into 10.2	2016-09-09 08:33:08 +02:00
Kristian Nielsen	7e0c9de864	Parallel replication async deadlock kill When a deadlock kill is detected inside the storage engine, the kill is not done immediately, to avoid calling back into the storage engine kill_query method with various lock subsystem mutexes held. Instead the kill is queued and done later by a slave background thread. This patch in preparation for fixing TokuDB optimistic parallel replication, as well as for removing locking hacks in InnoDB/XtraDB in 10.2. Signed-off-by: Kristian Nielsen <knielsen at knielsen-hq.org>	2016-09-08 15:25:40 +02:00
Monty	96e95b5465	Better SHOW PROCESSLIST for replication - When waiting for events, start time is now counted from start of wait - Instead of having "Connect" as "Command" for all replication threads we now have: - Slave_IO for Slave thread reading relay log - Slave_SQL for slave executing SQL commands or distribution queries to Slave workers - Slave_worker for slave threads executin SQL commands in parallel replication	2016-08-29 13:10:17 +03:00
Sergei Golubchik	6b1863b830	Merge branch '10.0' into 10.1	2016-08-25 12:40:09 +02:00
Nirbhay Choubey	c309e99ff9	Merge branch '10.0' into 10.0-galera	2016-08-24 19:30:32 -04:00
Vicențiu Ciorbaru	4eb898bb16	MDEV-10563 Crash during shutdown in Master_info_index::any_slave_sql_running In well defined C code, the "this" pointer is never NULL. Currently, we were potentially dereferencing a NULL pointer (master_info_index). GCC v6 removes any "if (!this)" conditions as it assumes this is always a non-null pointer. In order to prevent undefined behaviour, check the pointer before dereferencing and remove the check within member functions.	2016-08-23 21:24:36 +03:00
Monty	8d5a0d650b	Cleanups and minor fixes - Fixed typos - Added --core-on-failure to mysql-test-run - More DBUG_PRINT in viosocket.c - Don't forget CLIENT_REMEMBER_OPTIONS for compressed slave protocol - Removed not used stage variables	2016-08-21 20:14:13 +03:00
Vladislav Vaintroub	31a8cf54c8	Revert "MDEV-9293 Connector/C integration" This reverts commit `7b89b9f510`.	2016-08-19 15:46:27 +00:00
Vladislav Vaintroub	7b89b9f510	MDEV-9293 Connector/C integration	2016-08-19 15:27:37 +00:00
Oleksandr Byelkin	66ac894c40	MDEV-10455: libmariadbclient18 + MySQL-python leaks memory on failed connections Support of CLIENT_REMEMBER_OPTIONS and freeing options added.	2016-08-11 17:50:21 +02:00
Kristian Nielsen	fb076581f6	MDEV-10271: Stopped SQL slave thread doesn't print a message to error log like IO thread does Make the slave SQL thread always output to the error log the message "Slave SQL thread exiting, replication stopped in ..." whenever it previously outputted "Slave SQL thread initialized, starting replication ...". Before this patch, it was somewhat inconsistent in which cases the message would be output and in which not, depending on the exact time and cause of the condition that caused the SQL thread to stop.	2016-07-25 13:07:50 +02:00
Sergei Golubchik	932646b1ff	Merge branch '10.1' into 10.2	2016-06-30 16:38:05 +02:00
Alexander Barkov	3f32bf627f	More tests for "MDEV-7563 Support CHECK constraint". Testing non-ASCII string literals.	2016-06-30 11:43:02 +02:00
Sergei Golubchik	62e0a4552f	Merge branch '10.0-galera' into 10.1	2016-06-28 22:06:22 +02:00

1 2 3 4 5 ...

2449 commits