mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-01-26 08:44:33 +01:00

Author	SHA1	Message	Date
unknown	bd4153a8c2	MDEV-5262, MDEV-5914, MDEV-5941, MDEV-6020: Deadlocks during parallel replication causing replication to fail. Remove the temporary fix for MDEV-5914, which used READ COMMITTED for parallel replication worker threads. Replace it with a better, more selective solution. The issue is with certain edge cases of InnoDB gap locks, for example between INSERT and ranged DELETE. It is possible for the gap lock set by the DELETE to block the INSERT, if the DELETE runs first, while the record lock set by INSERT does not block the DELETE, if the INSERT runs first. This can cause a conflict between the two in parallel replication on the slave even though they ran without conflicts on the master. With this patch, InnoDB will ask the server layer about the two involved transactions before blocking on a gap lock. If the server layer tells InnoDB that the transactions are already fixed wrt. commit order, as they are in parallel replication, InnoDB will ignore the gap lock and allow the two transactions to proceed in parallel, avoiding the conflict. Improve the fix for MDEV-6020. When InnoDB itself detects a deadlock, it now asks the server layer for any preferences about which transaction to roll back. In case of parallel replication with two transactions T1 and T2 fixed to commit T1 before T2, the server layer will ask InnoDB to roll back T2 as the deadlock victim, not T1. This helps in some cases to avoid excessive deadlock rollback, as T2 will in any case need to wait for T1 to complete before it can itself commit. Also some misc. fixes found during development and testing: - Remove thd_rpl_is_parallel(), it is not used or needed. - Use KILL_CONNECTION instead of KILL_QUERY when a parallel replication worker thread is killed to resolve a deadlock with fixed commit ordering. There are some cases, eg. in sql/sql_parse.cc, where a KILL_QUERY can be ignored if the query otherwise completed successfully, and this could cause the deadlock kill to be lost, so that the deadlock was not correctly resolved. - Fix random test failure due to missing wait_for_binlog_checkpoint.inc. - Make sure that deadlock or other temporary errors during parallel replication are not printed to the the error log; there were some places around the replication code with extra error logging. These conditions can occur occasionally and are handled automatically without breaking replication, so they should not pollute the error log. - Fix handling of rgi->gtid_sub_id. We need to be able to access this also at the end of a transaction, to be able to detect and resolve deadlocks due to commit ordering. But this value was also used as a flag to mark whether record_gtid() had been called, by being set to zero, losing the value. Now, introduce a separate flag rgi->gtid_pending, so rgi->gtid_sub_id remains valid for the entire duration of the transaction. - Fix one place where the code to handle ignored errors called reset_killed() unconditionally, even if no error was caught that should be ignored. This could cause loss of a deadlock kill signal, breaking deadlock detection and resolution. - Fix a couple of missing mysql_reset_thd_for_next_command(). This could cause a prior error condition to remain for the next event executed, causing assertions about errors already being set and possibly giving incorrect error handling for following event executions. - Fix code that cleared thd->rgi_slave in the parallel replication worker threads after each event execution; this caused the deadlock detection and handling code to not be able to correctly process the associated transactions as belonging to replication worker threads. - Remove useless error code in slave_background_kill_request(). - Fix bug where wfc->wakeup_error was not cleared at wait_for_commit::unregister_wait_for_prior_commit(). This could cause the error condition to wrongly propagate to a later wait_for_prior_commit(), causing spurious ER_PRIOR_COMMIT_FAILED errors. - Do not put the binlog background thread into the processlist. It causes too many result differences in mtr, but also it probably is not useful for users to pollute the process list with a system thread that does not really perform any user-visible tasks...	2014-06-10 10:13:15 +02:00
unknown	629b822913	MDEV-5262, MDEV-5914, MDEV-5941, MDEV-6020: Deadlocks during parallel replication causing replication to fail. In parallel replication, we run transactions from the master in parallel, but force them to commit in the same order they did on the master. If we force T1 to commit before T2, but T2 holds eg. a row lock that is needed by T1, we get a deadlock when T2 waits until T1 has committed. Usually, we do not run T1 and T2 in parallel if there is a chance that they can have conflicting locks like this, but there are certain edge cases where it can occasionally happen (eg. MDEV-5914, MDEV-5941, MDEV-6020). The bug was that this would cause replication to hang, eventually getting a lock timeout and causing the slave to stop with error. With this patch, InnoDB will report back to the upper layer whenever a transactions T1 is about to do a lock wait on T2. If T1 and T2 are parallel replication transactions, and T2 needs to commit later than T1, we can thus detect the deadlock; we then kill T2, setting a flag that causes it to catch the kill and convert it to a deadlock error; this error will then cause T2 to roll back and release its locks (so that T1 can commit), and later T2 will be re-tried and eventually also committed. The kill happens asynchroneously in a slave background thread; this is necessary, as the reporting from InnoDB about lock waits happen deep inside the locking code, at a point where it is not possible to directly call THD::awake() due to mutexes held. Deadlock is assumed to be (very) rarely occuring, so this patch tries to minimise the performance impact on the normal case where no deadlocks occur, rather than optimise the handling of the occasional deadlock. Also fix transaction retry due to deadlock when it happens after a transaction already signalled to later transactions that it started to commit. In this case we need to undo this signalling (and later redo it when we commit again during retry), so following transactions will not start too early. Also add a missing thd->send_kill_message() that got triggered during testing (this corrects an incorrect fix for MySQL Bug#58933).	2014-06-03 10:31:11 +02:00
Jan Lindström	d2098b964d	MDEV-6318: MariaDB with XtraDB uses times more of IO events than with InnoDB plugin Fix: os0file.h in XtraDB had OS_AIO_N_PENDING_IOS_PER_THREAD 256 when on InnoDB it is OS_AIO_N_PENDING_IOS_PER_THREAD 32. Changed XtraDB also to use 32.	2014-07-04 08:09:27 +03:00
Jan Lindström	6bd2f900b2	MDEV-6288: Innodb causes server crash after disk full, then can't ALTER TABLE any more.	2014-07-03 14:55:03 +03:00
Sergey Vojtovich	23a5b2eb6d	MDEV-6103 - Adding/removing non-materialized virtual column triggers table recreation Relaxed InnoDB/XtraDB checks to allow online add/drop of non-materialized virtual columns.	2014-06-03 16:57:29 +04:00
Jan Lindström	dabf471547	MDEV-4791: Assertion range_end >= range_start fails in log0online.c on select from I_S.INNODB_CHANGED_PAGES Analysis: limit_lsn_range_from_condition() incorrectly parses start_lsn and/or end_lsn conditions. Fix from SergeyP. Added some test cases.	2014-05-09 11:43:53 +03:00
Sergei Golubchik	99027efd14	post-fix for the merge of "Bug#16216513 INPLACE ALTER DISABLED FOR PARTITIONED TABLES" make this innodb-only patch work for other engines as well	2014-05-08 10:25:09 +02:00
Sergei Golubchik	9927b36e87	merge of "Bug#16216513 INPLACE ALTER DISABLED FOR PARTITIONED TABLES" revno: 4777 committer: Marko Mäkelä <marko.makela@oracle.com> branch nick: mysql-5.6 timestamp: Fri 2013-02-15 10:32:25 +0200 message: Bug#16216513 INPLACE ALTER DISABLED FOR PARTITIONED TABLES	2014-05-08 10:01:31 +02:00
Sergei Golubchik	a2807e41e8	xtradb 5.6.17-65.0	2014-05-07 17:33:33 +02:00
Sergei Golubchik	b968363aac	MDEV-6184 10.0.11 merge XtraDB 5.6.16-64.2	2014-05-06 10:21:34 +02:00
Sergei Golubchik	dc23a9501a	Solaris compilation failure: xtradb is linked in statically, ha_innodb.so needs the linker script.	2014-05-01 14:05:52 +02:00
Sergei Golubchik	948056c535	MDEV-5787 Server crashes in in row_mysql_convert_row_to_innobase on CREATE .. SELECT XtraDB: don't accept MYSQL_TYPE_NULL as a column type	2014-03-19 09:57:57 +01:00
Jan Lindström	cdf6d3ec04	MDEV-5949: Performance of XtraDB slows down significantly on long benchmarks when compressed tables are used. Analysis: Number of flushed pages is incorrectly calculated at buf_do_LRU_batch. This leads to problem when utility function flushes dirty blocks from the end of the flush list of all buffer pool instances in a loop until enough pages are flushed or time limit is reached. As number of flushed pages is incorrectly calculated, the loop mostly try to flush until time limit is reached because the number of pages limit is not reached. Fix: Fix the calculation of flushed pages (very short). This fix was provided by Alexey Stroganov (Percona).	2014-03-26 15:17:12 +02:00
unknown	b352969118	MDEV-5914: Parallel replication deadlock due to InnoDB lock conflicts Due to how gap locks work, two transactions could group commit together on the master, but get lock conflicts and then deadlock due to different thread scheduling order on slave. For now, remove these deadlocks by running the parallel slave in READ COMMITTED mode. And let InnoDB/XtraDB allow statement-based binlogging for the parallel slave in READ COMMITTED. We are also investigating a different solution long-term, which is based on relaxing the gap locks only between the transactions running in parallel for one slave, but not against possibly external transactions.	2014-03-21 13:30:55 +01:00
Jan Lindström	affe1731a1	MDEV-5830: Assertion failure mutex_get_waiters(mutex) == 0 at shutdown. Analysis: XtraDB merge regression, at the end of mutex_spin_wait before goto mutex_loop there is missing if (prio_mutex) { os_atomic_decrement_ulint(&prio_mutex->high_priority_waiters, 1); } Hence we get unbalanced waiter count. Thanks to Laurynas Biveinis for finding this.	2014-03-21 08:39:04 +02:00
Jan Lindström	8250824a12	Remove assertions now that the actual bug has been repeated.	2014-03-20 09:32:37 +02:00
Jan Lindström	a60f227c04	Better to use ut_ad macro.	2014-03-19 19:35:42 +02:00
Jan Lindström	a092a40334	MDEV-5830: Assertion failure mutex_get_waiters(mutex) == 0 at shutdown.	2014-03-19 17:23:38 +02:00
Jan Lindström	f1ca1f37c9	MDEV-5878: Failing assertion: mutex_own(mutex) with innodb_use_fallocate=ON. Analysis: This was merge error on file fil0fil.cc. fil_system mutex was taken twice because of this. Fix: Remove unnecessary mutex_enter and fixed the issue with slow posix_fallocate usage.	2014-03-17 15:49:41 +02:00
Sergei Golubchik	68916bcab3	workaround for xtradb on gcc 4.1.2 RHEL5/x86, gcc atomic ops only work under -march=i686	2014-03-07 17:47:47 +01:00
Sergei Golubchik	a5fdd75980	XtraDB made the default	2014-03-07 15:21:07 +01:00
Sergei Golubchik	75124c5d2b	xtradb, windows, aio: fix the bad merge	2014-03-04 22:25:34 +01:00
Sergei Golubchik	4b3cf4aa26	XtraDB compilation failures on Windows (again)	2014-02-28 21:04:58 +01:00
Sergei Golubchik	41c760b121	merge	2014-02-28 10:00:31 +01:00
Alexander Barkov	57cdc561fc	Fixing AIX compilation failires	2014-02-27 19:44:00 +04:00
Jan Lindström	11826b1bcf	Enable windows builds for XtraDB.	2014-02-27 16:41:49 +02:00
Sergei Golubchik	570c1a6fef	MDEV-5672 MariaDB 10.0.8 doesn't compile without perfschema apply the upstream patch	2014-02-27 12:25:51 +01:00
Sergei Golubchik	ac585e9ed5	Percona-Server-5.6.15-rel63.0.tar.gz merge	2014-02-26 19:21:23 +01:00
Sergei Golubchik	1b3c15f199	merge with 10.0-monty	2014-02-06 16:38:40 +01:00
Sergei Golubchik	37b8691cec	fix the fix and update test results for MDEV-4439	2014-02-06 16:27:23 +01:00
Michael Widenius	313f18be5a	Fixed errors and warnings found by buildbot mysql-test/r/lowercase_table2.result: Updated result (The change happend because we don't try to open the table anymore as part of create table) mysql-test/suite/rpl/r/create_or_replace_mix.result: Fixed result file mysql-test/suite/rpl/r/create_or_replace_row.result: Fixed result file mysql-test/suite/rpl/r/create_or_replace_statement.result: Fixed result file mysql-test/suite/rpl/t/create_or_replace.inc: Drop open temporary table mysys/my_delete.c: Added missing newline plugin/metadata_lock_info/mysql-test/metadata_lock_info/r/user_lock.result: Fixed result (Lock names was before off by one. Was corrected by my previous patch) sql/sql_select.cc: Fixed compiler warnings by adding missing casts storage/connect/ha_connect.cc: Fixed compiler warnings storage/innobase/os/os0file.cc: Fixed compiler warnings storage/xtradb/btr/btr0btr.cc: Fixed compiler warnings storage/xtradb/handler/ha_innodb.cc: removed not used function strings/ctype-uca.c: Fixed compiler warnings support-files/compiler_warnings.supp: Added suppression for warnings that are wrong or are not serious andthat we don't plan to fix.	2014-02-06 16:14:09 +02:00
Sergei Golubchik	5eb145858d	more solaris fixes. xtradb and spider.	2014-02-05 17:27:41 +01:00
Sergei Golubchik	342b098cfb	ha_xtradb.so fix for solaris, gcc 3.4.3	2014-02-04 19:29:58 +01:00
Sergei Golubchik	d929342b0f	Merge the server part of MySQL WL#5522 - InnoDB transportable tablespaces. Syntax. Server support. Test cases. InnoDB bugfixes: * don't mess around with system sprintf's, always use my_error() for errors. * don't use InnoDB internal error codes where OS error codes are expected. * don't say "file not found", when it was.	2014-02-02 10:00:36 +01:00
Sergei Golubchik	27d45e4696	MDEV-5574 Set AUTO_INCREMENT below max value of column. Update InnoDB to 5.6.14 Apply MySQL-5.6 hack for MySQL Bug#16434374 Move Aria-only HA_RTREE_INDEX from my_base.h to maria_def.h (breaks an assert in InnoDB) Fix InnoDB memory leak	2014-02-01 09:33:26 +01:00
Sergei Golubchik	348c962c49	fix xtradb I_S tables to load	2013-12-22 17:08:22 +01:00
Sergei Golubchik	ffa8c4cfcc	Percona-Server-5.6.14-rel62.0 merge support ha_innodb.so as a dynamic plugin. * remove obsolete ,innodb_plugin.rdiff files s/--plugin-load=/--plugin-load-add=/ * MYSQL_PLUGIN_IMPORT glob_hostname[] * use my_error instead of push_warning_printf(ER_DEFAULT) * don't use tdc_size and tc_size in a module update test cases (XtraDB is 5.6.14, InnoDB is 5.6.10) * copy new tests over * disable some tests for (old) InnoDB * delete XtraDB tests that no longer apply small compatibility changes: * s/HTON_EXTENDED_KEYS/HTON_SUPPORTS_EXTENDED_KEYS/ * revert unnecessary InnoDB changes to make it a bit closer to the upstream fix XtraDB to compile on Windows (both as a static and a dynamic plugin) disable XtraDB on Windows (deadlocks) and where no atomic ops are available (e.g. CentOS 5) storage/innobase/handler/ha_innodb.cc: revert few unnecessary changes to make it a bit closer to the original InnoDB storage/innobase/include/univ.i: correct the version to match what it was merged from	2013-12-22 17:06:50 +01:00
Sergei Golubchik	e27c34f9e4	undelete a file	2013-12-16 15:52:36 +01:00
Sergei Golubchik	d28d3ba40d	10.0-base merge	2013-12-16 13:02:21 +01:00
Sergei Golubchik	6bf10fac44	5.5 merge	2013-12-15 15:57:26 +01:00
Sergei Golubchik	c6d30805db	5.5 merge	2013-11-23 00:50:54 +01:00
Alexander Barkov	1345a75922	MroongaSE: addint thd_autoinc and thd_error_context plugin services	2013-12-12 19:18:49 +04:00
Jan Lindström	a5e236db54	Add additional srv_use_fallocate guard for completing the IO with read.	2013-11-28 11:34:43 +02:00
Jan Lindström	57a70a635a	MDEV-5355: InnoDB assertion at shutdown if posix_fallocate is used in ut_a(node->n_pending == 0 \|\| node->space->stop_new_ops); Analysis: When filespace is extended there is first prepare for IO. But if posix_fallocate is used there was no complete IO causing assertion at shutdown indicating that all IO is not finished. Fix: If posix_fallocate is used to extend the filespace, there is no need to wait for IO to complete, thus we treat this operation as a read operation. We need to mark IO as completed or there would be assertion on shutdown at fil_node_close_file() because all pending IO is not finished.	2013-11-27 20:24:52 +02:00
Sergei Golubchik	af2848a423	Percona-Server-5.5.34-rel32.0 merge	2013-11-19 15:43:22 +01:00
Sergei Golubchik	efab095c7f	MDEV-5236 Status variables are not all listed alphabetically sort xtradb status variables	2013-11-19 13:11:42 +01:00
Jan Lindström	e730c91688	Added test case for new system variable innodb_use_stacktrace and made sure that it can be used only on startup. Fixed compiler problems on solaris and other platforms that do not contain necessary headers and functions.	2013-11-15 15:24:42 +02:00
Jan Lindström	338587d2f4	MDEV-5247: DB locked up at btr0cur.c line 568. Additional fixes to inconsistent usage of have_LRU_mutex and added additional debug assertions to guard incorrect usage of this mutex. Fixes issues found on additional testing and mysql test suite.	2013-11-15 11:32:02 +02:00
Jan Lindström	10467ec7b3	Fix compiler error introduced on revision 3937, make sure that stackdump is compiled only on __linux__.	2013-11-14 14:27:46 +02:00
Jan Lindström	5d1ec1b951	MDEV-5247: DB locked up at btr0cur.c line 568. There is inconsistent and non logical usage of have_LRU_mutex and incorrect value on ha_innodb.cc when buf_LRU_free_block is called. Additionally, for future long semaphore wait cases added a new configuration variable innodb_use_stacktrace. If this variable is true a signal handler for SIGUSR2 is installed when InnoDB server starts and when a long semaphore wait is detected at sync/sync0array.c we send SIGUSR2 signal to waiting thread and thread that has acuired RW-latch. For both threads a full stacktrace is produced as well as its is possible.	2013-11-14 12:57:28 +02:00

1 2 3 4 5 ...

454 commits