mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-01-28 17:54:16 +01:00

Author	SHA1	Message	Date
Marko Mäkelä	759deaa0a2	MDEV-26010 fixup: Use acquire/release memory order In commit `5f22511e35` we depend on Total Store Ordering. For correct operation on ISAs that implement weaker memory ordering, we must explicitly use release/acquire stores and loads on buf_page_t::oldest_modification_ to prevent a race condition when buf_page_t::list does not happen to be on the same cache line. buf_page_t::clear_oldest_modification(): Assert that the block is not in buf_pool.flush_list, and use std::memory_order_release. buf_page_t::oldest_modification_acquire(): Read oldest_modification_ with std::memory_order_acquire. In this way, if the return value is 0, the caller may safely assume that it will not observe the buf_page_t as being in buf_pool.flush_list, even if it is not holding buf_pool.flush_list_mutex. buf_flush_relocate_on_flush_list(), buf_LRU_free_page(): Invoke buf_page_t::oldest_modification_acquire().	2021-06-26 11:16:40 +03:00
Marko Mäkelä	5f22511e35	MDEV-26010: Assertion lsn > 2 failed in buf_pool_t::get_oldest_modification In commit `22b62edaed` (MDEV-25113) we introduced a race condition. buf_LRU_free_page() would read buf_page_t::oldest_modification() as 0 and assume that buf_page_t::list can be used (for attaching the block to the buf_pool.free list). In the observed race condition, buf_pool_t::delete_from_flush_list() had cleared the field, and buf_pool_t::delete_from_flush_list_low() was executing concurrently with buf_LRU_block_free_non_file_page(), which resulted in buf_pool.flush_list.end becoming corrupted. buf_pool_t::delete_from_flush_list(), buf_flush_relocate_on_flush_list(): First remove the block from buf_pool.flush_list, and only then invoke buf_page_t::clear_oldest_modification(), to ensure that reading oldest_modification()==0 really implies that the block no longer is in buf_pool.flush_list.	2021-06-24 21:55:10 +03:00
Marko Mäkelä	e329dc8d86	MDEV-25948 fixup: Demote a warning to a note buf_dblwr_t::recover(): Issue a note, not a warning, about pages whose FIL_PAGE_LSN is in the future. This was supposed to be part of commit `762bcb81b5` (MDEV-25948) but had been accidentally omitted.	2021-06-24 18:51:05 +03:00
Marko Mäkelä	60ed479711	MDEV-26004 Excessive wait times in buf_LRU_get_free_block() buf_LRU_get_free_block(): Initially wait for a single block to be freed, signaled by buf_pool.done_free. Only if that fails and no LRU eviction flushing batch is already running, we initiate a flushing batch that should serve all threads that are currently waiting in buf_LRU_get_free_block(). Note: In an extreme case, this may introduce a performance regression at larger numbers of connections. We observed this in sysbench oltp_update_index with 512MiB buffer pool, 4GiB of data on fast NVMe, and 1000 concurrent connections, on a 20-thread CPU. The contention point appears to be buf_pool.mutex, and the improvement would turn into a regression somewhere beyond 32 concurrent connections. On slower storage, such regression was not observed; instead, the throughput was improving and maximum latency was reduced. The excessive waits were pointed out by Vladislav Vaintroub.	2021-06-24 11:01:18 +03:00
Marko Mäkelä	6441bc614a	MDEV-25113: Introduce a page cleaner mode before 'furious flush' MDEV-23855 changed the way how the page cleaner is signaled by user threads. If a threshold is exceeded, a mini-transaction commit would invoke buf_flush_ahead() in order to initiate page flushing before all writers would eventually grind to halt in log_free_check(), waiting for the checkpoint age to reduce. However, buf_flush_ahead() would always initiate 'furious flushing', making the buf_flush_page_cleaner thread write innodb_io_capacity_max pages per batch, and sleeping no time between batches, until the limit LSN is reached. Because this could saturate the I/O subsystem, system throughput could significantly reduce during these 'furious flushing' spikes. With this change, we introduce a gentler version of flush-ahead, which would write innodb_io_capacity_max pages per second until the 'soft limit' is reached. buf_flush_ahead(): Add a parameter to specify whether furious flushing is requested. buf_flush_async_lsn: Similar to buf_flush_sync_lsn, a limit for the less intrusive flushing. buf_flush_page_cleaner(): Keep working until buf_flush_async_lsn has been reached. log_close(): Suppress a warning message in the event that a new log is being created during startup, when old logs did not exist. Return what type of page cleaning will be needed. mtr_t::finish_write(): Also when m_log.is_small(), invoke log_close(). Return what type of page cleaning will be needed. mtr_t::commit(): Invoke buf_flush_ahead() based on the return value of mtr_t::finish_write().	2021-06-23 19:06:52 +03:00
Marko Mäkelä	22b62edaed	MDEV-25113: Make page flushing faster buf_page_write_complete(): Reduce the buf_pool.mutex hold time, and do not acquire buf_pool.flush_list_mutex at all. Instead, mark blocks clean by setting oldest_modification to 1. Dirty pages of temporary tables will be identified by the special value 2 instead of the previous special value 1. (By design of the ib_logfile0 format, actual LSN values smaller than 2048 are not possible.) buf_LRU_free_page(), buf_pool_t::get_oldest_modification() and many other functions will remove the garbage (clean blocks) from buf_pool.flush_list while holding buf_pool.flush_list_mutex. buf_pool_t::n_flush_LRU, buf_pool_t::n_flush_list: Replaced with non-atomic variables, protected by buf_pool.mutex, to avoid unnecessary synchronization when modifying the counts. export_vars: Remove unnecessary indirection for innodb_pages_created, innodb_pages_read, innodb_pages_written.	2021-06-23 19:06:52 +03:00
Marko Mäkelä	8af538979b	MDEV-25801: buf_flush_dirty_pages() is very slow In commit `7cffb5f6e8` (MDEV-23399) the implementation of buf_flush_dirty_pages() was replaced with a slow one, which would perform excessive scans of the buf_pool.flush_list and make little progress. buf_flush_list(), buf_flush_LRU(): Split from buf_flush_lists(). Vladislav Vaintroub noticed that we will not need to invoke log_flush_task.wait() for the LRU eviction flushing. buf_flush_list_space(): Replaces buf_flush_dirty_pages(). This is like buf_flush_list(), but operating on a single tablespace at a time. Writes at most innodb_io_capacity pages. Returns whether some of the tablespace might remain in the buffer pool.	2021-06-23 19:06:52 +03:00
Marko Mäkelä	762bcb81b5	MDEV-25948 Remove log_flush_task Vladislav Vaintroub suggested that invoking log_flush_up_to() for every page could perform better than invoking a log write between buf_pool.flush_list batches, like we started doing in commit `3a9a3be1c6` (MDEV-23855). This could depend on the sequence in which pages are being modified. The buf_pool.flush_list is ordered by oldest_modification, while the FIL_PAGE_LSN of the pages is theoretically independent of that. In the pathological case, we will wait for a log write before writing each individual page. It turns out that we can defer the call to log_flush_up_to() until just before submitting the page write. If the doublewrite buffer is being used, we can submit a write batch of "future" pages to the doublewrite buffer, and only wait for the log write right before we are writing an already doublewritten page. The next doublewrite batch will not be initiated before the last page write from the current batch has completed. When a future version introduces asynchronous writes if the log, we could initiate a write at the start of a flushing batch, to reduce waiting further.	2021-06-23 19:06:52 +03:00
Marko Mäkelä	6dfd44c828	MDEV-25954: Trim os_aio_wait_until_no_pending_writes() It turns out that we had some unnecessary waits for no outstanding write requests to exist. They were basically working around a bug that was fixed in MDEV-25953. On write completion callback, blocks will be marked clean. So, it is sufficient to consult buf_pool.flush_list to determine which writes have not been completed yet. On FLUSH TABLES...FOR EXPORT we must still wait for all pending asynchronous writes to complete, because buf_flush_file_space() would merely guarantee that writes will have been initiated.	2021-06-23 19:06:49 +03:00
Marko Mäkelä	344e59904d	Merge 10.4 into 10.5	2021-06-23 08:17:49 +03:00
Marko Mäkelä	09b03ff31b	Merge 10.3 into 10.4	2021-06-23 08:05:27 +03:00
Daniel Bartholomew	55b3a3f4dd	bump the VERSION	2021-06-22 23:00:01 -04:00
Daniel Bartholomew	bf2680ea09	bump the VERSION	2021-06-22 22:42:42 -04:00
Daniel Bartholomew	1deb630484	bump the VERSION	2021-06-22 22:21:24 -04:00
Igor Babaev	7f24e37fbe	MDEV-25679 Wrong result selecting from simple view with LIMIT and ORDER BY Cherry-picking only test case.	2021-06-22 12:23:13 -07:00
Marko Mäkelä	35a9aaebe2	MDEV-25981 InnoDB upgrade fails trx_undo_mem_create_at_db_start(): Relax too strict upgrade checks that were introduced in commit `e46f76c974` (MDEV-15912). On commit, pages will typically be set to TRX_UNDO_CACHED state. Having the type TRX_UNDO_INSERT in such pages is common and unproblematic; the type would be reset in trx_undo_reuse_cached(). trx_rseg_array_init(): On failure, clean up the rollback segments that were initialized so far, to avoid an assertion failure later during shutdown.	2021-06-22 09:30:25 +03:00
Marko Mäkelä	e07f0a2d80	Merge 10.2 into 10.3	2021-06-22 09:15:38 +03:00
Marko Mäkelä	19716ad5a8	MDEV-25982 Upgrade of MariaDB 10.1 log crashes due to missing encryption key init_crypt_key(): On failure, set info->key_version to ENCRYPTION_KEY_VERSION_INVALID. log_crypt_101_read_block(): Refuse to attempt decryption if info->key_version is ENCRYPTION_KEY_VERSION_INVALID.	2021-06-22 09:11:41 +03:00
Igor Babaev	6e94ef4185	MDEV-25679 Wrong result selecting from simple view with LIMIT and ORDER BY This bug affected queries with views / derived_tables / CTEs whose specifications were of the form (SELECT ... LIMIT <n>) ORDER BY ... Units representing such specifications contains one SELECT_LEX structure for (SELECT ... LIMIT <n>) and additionally SELECT_LEX structure for fake_select_lex. This fact should have been taken into account in the function mysql_derived_fill(). This patch has to be applied to 10.2 and 10.3 only.	2021-06-21 22:25:37 -07:00
Igor Babaev	cc0bd8431f	MDEV-25679 Wrong result selecting from simple view with LIMIT and ORDER BY This bug affected queries with views / derived_tables / CTEs whose specifications were of the form (SELECT ... LIMIT <n>) ORDER BY ... Units representing such specifications contains one SELECT_LEX structure for (SELECT ... LIMIT <n>) and additionally SELECT_LEX structure for fake_select_lex. This fact should have been taken into account in the function mysql_derived_fill(). This patch has to be applied to 10.2 and 10.3 only.	2021-06-21 16:21:17 -07:00
Marko Mäkelä	a1907fed60	Merge 10.4 into 10.5	2021-06-21 18:49:36 +03:00
Marko Mäkelä	ce868cd89e	Merge 10.3 into 10.4	2021-06-21 18:44:14 +03:00
Marko Mäkelä	9dc50ea229	MDEV-25979 Invalid page number written to DB_ROLL_PTR trx_undo_report_row_operation(): Fix a race condition that was introduced in commit `f74023b955` (MDEV-15090). We must not access undo_block after the page latch has been released in mtr_t::commit(), because the block could be evicted or replaced.	2021-06-21 18:13:28 +03:00
Marko Mäkelä	a42c80bd48	Merge 10.4 into 10.5	2021-06-21 14:22:22 +03:00
Marko Mäkelä	baf0ef9a18	After-merge fix: Remove duplicated code In the merge commit `d3e4fae797` a message about innodb_force_recovery was accidentally duplicated.	2021-06-21 14:05:43 +03:00
Anel Husakovic	bcedb4200f	MDEV-25878: mytop bugs: check for mysql driver and sockets - Adding socket check for MariaDB/Mysql driver Reviewed by: serg@mariadb.com	2021-06-21 11:55:16 +02:00
Jean Weisbuch	59e3ac2e67	MDEV-25878: mytop bugs: check for mysql driver and sockets mytop fall-back to DBD::mysql if DBD::MariaDB is not available Apply #1546	2021-06-21 11:55:16 +02:00
Marko Mäkelä	d3e4fae797	Merge 10.3 into 10.4	2021-06-21 12:38:25 +03:00
Marko Mäkelä	e46f76c974	MDEV-15912: Remove traces of insert_undo Let us simply refuse an upgrade from earlier versions if the upgrade procedure was not followed. This simplifies the purge, commit, and rollback of transactions. Before upgrading to MariaDB 10.3 or later, a clean shutdown of the server (with innodb_fast_shutdown=1 or 0) is necessary, to ensure that any incomplete transactions are rolled back. The undo log format was changed in MDEV-12288. There is only one persistent undo log for each transaction.	2021-06-21 12:34:07 +03:00
Marko Mäkelä	241d30d3fa	After-merge fixes for MDEV-14180	2021-06-21 12:09:00 +03:00
Marko Mäkelä	c9a85fb1b1	Merge 10.2 into 10.3	2021-06-21 09:07:40 +03:00
Marko Mäkelä	773a07b655	Remove Travis CI status Builds on travis-ci.org ceased on 2021-06-15.	2021-06-21 08:18:34 +03:00
Sergei Golubchik	c3a1ba0fd9	fix spider tests for --ps in 10.5 see also `068246c006` and `690ae1de45`	2021-06-19 14:06:11 +02:00
Sergei Golubchik	690ae1de45	fix spider tests for --ps in 10.4 see also `068246c006`	2021-06-19 13:46:11 +02:00
Sergei Golubchik	a4f485917e	spider tests aren't big in 10.4 see also `a5f6eca50d`	2021-06-19 13:46:11 +02:00
Sergei Golubchik	068246c006	fix spider tests for --ps spider tests use --let $var= many;sql;statements --eval $var and this doesn't work in ps	2021-06-19 00:45:49 +02:00
Sergei Golubchik	a5f6eca50d	spider tests aren't big and never disable tests in suite.pm based on $::opt_big_test, this will make the test skipped both as too big (for ./mtr) and as too small (for ./mtr --big --big).	2021-06-19 00:22:15 +02:00
Sergei Golubchik	dc82effa5d	MDEV-25867 main.trigger-trans failed in bb, Assertion `ticket->m_duration == MDL_EXPLICIT' failed in MDL_context::release_lock release MDL on the trigger as early as possible, after everything that cannot be done concurrently was done (under MDL), but before relocking tables.	2021-06-18 10:55:46 +02:00
Vladislav Vaintroub	78bd7d86a4	MDEV-25953 Tpool - prevent potential deadlock in simulated AIO Do not execute user callback just after pwrite. Instead, submit user function as task into thread pool. This way, the IO thread would not hog aiocb, which is a limited (in Innodb) resource	2021-06-17 20:39:47 +02:00
Thirunarayanan Balathandayuthapani	cd1a195b22	MDEV-25947 innodb_fts.misc_debug fails in buildbot - Make innodb_fts.misc_debug test case more stable.	2021-06-17 17:22:30 +05:30
Daniel Black	48938c57c7	MDEV-25894: support AIX as a platform in mtr Add fixed for tests mysqld--help,aix.rdiff and sysvars_server_notembedded,aix.rdiff AIX couldn't compile in embedded mode so leaving sysvars_server_embedded for later (if required).	2021-06-16 15:44:18 +10:00
Sergei Golubchik	35b57c37bb	MDEV-25617 10.5.10 upgrade: "scriptlet / line 6 : [: is-active : binary operator expected"	2021-06-16 12:26:43 +07:00
Marko Mäkelä	c307dc6efd	Remove a unused variable In commit `1c35a3f6fd` a useless computation that used the variable was removed.	2021-06-16 07:50:04 +03:00
Julius Goryavsky	2edb8e12e1	MDEV-25880 part 2: Improving reliability of the SST scripts Additional improvements aimed at improving operational reliability of the SST scripts: 1) Script need to give rsync and stunnel a short time to terminate after "kill -9" before the first PID check using ps utility; 2) The temporary file used to create the binlog index could sometimes remain in the data directory if tar failed and then may be reused without being cleaned up (the next time when SST was run) - now it's fixed; 3) The temporary file used to build the binlog index is now created using mktemp and, if this variable is present in the configuration file, in tmpdir; 4) Checking the secret tag in SST via rsync is made faster and does not require creating a temporary file, which could remain in the data directory in case of failure; 5) Added "-F" option to grep to check the tag when using mariabackup/xtrabackup-v2 - to avoid possible collisions in case of special characters in the tag value (unlikely scenario, but the new check is more reliable).	2021-06-15 14:27:22 +02:00
Julius Goryavsky	18d5be5b54	MDEV-25880: rsync may be mistakenly killed when overlapping SST This commit fixes a bug was originally discovered during the galera_nbo_sst_slave mtr test for 10.6 branch. However it is relevant for all versions and can lead to intermittent SST crashes via rsync on very fast server restarts - when a new SST process (for example, after starting a new server instance) overlaps the old SST process started by the previous, already terminated server. This overlap can result in the new rsync being killed instead of the old rsync, or the pid file from the new rsync being killed, which then lead to problems.	2021-06-15 14:27:22 +02:00
Eugene Kosov	1c35a3f6fd	fix clang build a new warning -Wunused-but-set-variable was introduced recently to clang	2021-06-15 13:10:16 +03:00
Thirunarayanan Balathandayuthapani	7d591cf850	MDEV-24713 Assertion `dict_table_is_comp(index->table)' failed in row_merge_buf_add() - During online alter conversion from compact to redundant, virtual column field length already set during innobase_get_computed_value(). Skip the char(n) check for virtual column in row_merge_buf_add()	2021-06-15 13:15:32 +05:30
Thirunarayanan Balathandayuthapani	7229107e3e	MDEV-25872 InnoDB: Assertion failure in row_merge_read_clustered_index upon ALTER on table with indexed virtual columns - InnoDB fails to check DB_COMPUTE_VALUE_FAILED error in row_merge_read_clustered_index() and wrongly asserts that the buffer shouldn't be ran out of memory. Alter table should give warning when the column value is being truncated.	2021-06-15 13:15:32 +05:30
Thirunarayanan Balathandayuthapani	8c7d8b716c	MDEV-14180 Automatically disable key rotation checks for file_key_managment plugin Problem: ======= - InnoDB iterates the fil_system space list to encrypt the tablespace in case of key rotation. But it is not necessary for any encryption plugin which doesn't do key version rotation. Solution: ========= - Introduce a new variable called srv_encrypt_rotate to indicate whether encryption plugin does key rotation fil_space_crypt_t::key_get_latest_version(): Enable the srv_encrypt_rotate only once if current key version is higher than innodb_encyrption_rotate_key_age fil_crypt_must_default_encrypt(): Default encryption tables should be added to default_encryp_tables list if innodb_encyrption_rotate_key_age is zero and encryption plugin doesn't do key version rotation fil_space_create(): Add the newly created space to default_encrypt_tables list if fil_crypt_must_default_encrypt() returns true Removed the nondeterministic select from innodb-key-rotation-disable test. By default, InnoDB adds the tablespace to the rotation list and background crypt thread does encryption of tablespace. So these select doesn't give reliable results.	2021-06-15 13:15:32 +05:30
Thirunarayanan Balathandayuthapani	7d2c338c48	MDEV-14180 preparation: Rename key_rotation_list - Rename the rotation list to default_encrypt_tables in fil_system_t. Because rotation list naming could be misleading when it comes to key version rotation - Rename is_in_rotation_list to is_in_default_encrypt in fil_space_t - Rename keyrotate_next function to default_encrypt_next fil_system_t::default_encrypt_next(): Find the next suitable default encrypt table if beginning of default_encrypt_tables list has been scheduled to be deleted	2021-06-15 13:15:32 +05:30

1 2 3 4 5 ...

192505 commits