mariadb

mirror of https://github.com/MariaDB/server.git synced 2026-05-15 19:37:16 +02:00

Author	SHA1	Message	Date
Marko Mäkelä	a635c40648	MDEV-27774 Reduce scalability bottlenecks in mtr_t::commit() A prominent bottleneck in mtr_t::commit() is log_sys.mutex between log_sys.append_prepare() and log_close(). User-visible change: The minimum innodb_log_file_size will be increased from 1MiB to 4MiB so that some conditions can be trivially satisfied. log_sys.latch (log_latch): Replaces log_sys.mutex and log_sys.flush_order_mutex. Copying mtr_t::m_log to log_sys.buf is protected by a shared log_sys.latch. Writes from log_sys.buf to the file system will be protected by an exclusive log_sys.latch. log_sys.lsn_lock: Protects the allocation of log buffer in log_sys.append_prepare(). sspin_lock: A simple spin lock, for log_sys.lsn_lock. Thanks to Vladislav Vaintroub for suggesting this idea, and for reviewing these changes. mariadb-backup: Replace some use of log_sys.mutex with recv_sys.mutex. buf_pool_t::insert_into_flush_list(): Implement sorting of flush_list because ordering is otherwise no longer guaranteed. Ordering by LSN is needed for the proper operation of redo log checkpoints. log_sys.append_prepare(): Advance log_sys.lsn and log_sys.buf_free by the length, and return the old values. Also increment write_to_buf, which was previously done in log_close(). mtr_t::finish_write(): Obtain the buffer pointer from log_sys.append_prepare(). log_sys.buf_free: Make the field Atomic_relaxed, to simplify log_flush_margin(). Use only loads and stores to avoid costly read-modify-write atomic operations. buf_pool.flush_list_requests: Replaces export_vars.innodb_buffer_pool_write_requests and srv_stats.buf_pool_write_requests. Protected by buf_pool.flush_list_mutex. buf_pool_t::insert_into_flush_list(): Do not invoke page_cleaner_wakeup(). Let the caller do that after a batch of calls. recv_recover_page(): Invoke a minimal part of buf_pool.insert_into_flush_list(). ReleaseBlocks::modified: A number of pages added to buf_pool.flush_list. ReleaseBlocks::operator(): Merge buf_flush_note_modification() here. log_t::set_capacity(): Renamed from log_set_capacity().	2022-02-10 16:37:12 +02:00
Marko Mäkelä	8c7c92adf3	MDEV-27787 mariadb-backup --backup is allocating extra memory for log records In commit `685d958e38` (MDEV-14425), the log parsing in mariadb-backup --backup was rewritten. The parameter STORE_IF_EXISTS that is being passed to recv_sys.parse_mtr() or recv_sys.parse_pmem() instead of STORE_NO caused unnecessary additional memory allocation for redo log records.	2022-02-10 15:39:27 +02:00
Vincent Milum Jr	e375f51924	MDEV-27790: Fix mis-matched braces for non-Linux targets Ran into this while compiling on FreeBSD 13.0-RELEASE After this one change, it compiles and runs just fine on my FreeBSD Aarch64 server.	2022-02-10 17:18:37 +11:00
Marko Mäkelä	c75e3770dc	Merge 10.7 into 10.8	2022-02-09 16:24:19 +02:00
Marko Mäkelä	70a8875564	Merge 10.6 into 10.7	2022-02-09 16:04:49 +02:00
Marko Mäkelä	cce994057b	Merge 10.5 into 10.6	2022-02-09 15:49:50 +02:00
Marko Mäkelä	fd101daa84	MDEV-27716 mtr_t::commit() acquires log_sys.mutex when writing no log mtr_t::is_block_dirtied(), mtr_t::memo_push(): Never set m_made_dirty for pages of the temporary tablespace. Ever since commit `5eb539555b` we never add those pages to buf_pool.flush_list. mtr_t::commit(): Implement part of mtr_t::prepare_write() here, and avoid acquiring log_sys.mutex if no log is written. During IMPORT TABLESPACE fixup, we do not write log, but we must add pages to buf_pool.flush_list and for that, be prepared to acquire log_sys.flush_order_mutex. mtr_t::do_write(): Replaces mtr_t::prepare_write().	2022-02-09 15:10:10 +02:00
Oleksandr Byelkin	12cd3dc78d	Merge branch '10.8' into bb-10.8-release	2022-02-09 09:13:22 +01:00
Oleksandr Byelkin	bbd4837f1c	Merge branch '10.7' into bb-10.7-release	2022-02-09 09:09:40 +01:00
Oleksandr Byelkin	1bed56400e	Merge branch '10.6' into bb-10.6-release	2022-02-09 09:05:27 +01:00
Oleksandr Byelkin	34c5019698	Merge branch '10.5' into bb-10.5-release	2022-02-09 08:57:41 +01:00
Marko Mäkelä	5c46751f23	MDEV-27734 Set innodb_change_buffering=none by default The aim of the InnoDB change buffer is to avoid delays when a leaf page of a secondary index is not present in the buffer pool, and a record needs to be inserted, delete-marked, or purged. Instead of reading the page into the buffer pool for making such a modification, we may insert a record to the change buffer (a special index tree in the InnoDB system tablespace). The buffered changes are guaranteed to be merged if the index page actually needs to be read later. The change buffer could be useful when the database is stored on a rotational medium (hard disk) where random seeks are slower than sequential reads or writes. Obviously, the change buffer will cause write amplification, due to potentially large amount of metadata that is being written to the change buffer. We will have to write redo log records for modifying the change buffer tree as well as the user tablespace. Furthermore, in the user tablespace, we must maintain a change buffer bitmap page that uses 2 bits for estimating the amount of free space in pages, and 1 bit to specify whether buffered changes exist. This bitmap needs to be updated on every operation, which could reduce performance. Even if the change buffer were free of bugs such as MDEV-24449 (potentially causing the corruption of any page in the system tablespace) or MDEV-26977 (corruption of secondary indexes due to a currently unknown reason), it will make diagnosis of other data corruption harder. Because of all this, it is best to disable the change buffer by default.	2022-02-09 08:36:41 +02:00
Daniel Bartholomew	ac07749042	bump the VERSION	2022-02-08 18:33:56 -05:00
Daniel Bartholomew	9055db2f28	bump the VERSION	2022-02-08 18:15:35 -05:00
Daniel Bartholomew	fa73117bf8	bump the VERSION	2022-02-08 17:51:29 -05:00
Daniel Bartholomew	f7704d74cb	bump the VERSION	2022-02-08 17:31:40 -05:00
Monty	38058c04a4	MDEV-26585 Wrong query results when `using index for group-by` The problem was that "group_min_max optimization" does not work if some aggregate functions, like COUNT(), is used. The function get_best_group_min_max() is using the join->sum_funcs array to check which aggregate functions are used. The bug was that aggregates in HAVING where not yet added to join->sum_funcs at the time get_best_group_min_max() was called. Fixed by populate join->sum_funcs already in prepare, which means that all sum functions will be in join->sum_funcs in get_best_group_min_max(). A benefit of this approach is that we can remove several calls to make_sum_func_list() from the code and simplify the function. I removed some wrong setting of 'sort_and_group'. This variable is set when alloc_group_fields() is called, as part of allocating the cache needed by end_send_group() and does not need to be set by other functions. One problematic thing was that Spider is using join->sum_funcs to detect at which stage the optimizer is and do internal calculations of aggregate functions. Updating join->sum_funcs early caused Spider to fail when trying to find min/max values in opt_sum_query(). Fixed by temporarily resetting sum_funcs during opt_sum_query(). Reviewer: Sergei Petrunia	2022-02-08 14:32:29 +02:00
Monty	d314bd2664	MDEV-27442 Wrong result upon query with DISTINCT and EXISTS subquery The problem was that get_best_group_min_max() did not check if fields used by the "group_min_max optimization" where used in sub queries. Because of this, it did not detect that a key (b,a) was used in the WHERE clause for the statement: SELECT DISTINCT b FROM t1 WHERE EXISTS ( SELECT 1 FROM DUAL WHERE a > 1 ). Fixed by also traversing the sub queries when checking if a field is used. This disables group_min_max_optimization for the above query. Reviewer: Sergei Petrunia	2022-02-08 14:32:28 +02:00
Monty	a1c2380753	MENT-328 Retry BACKUP STAGE BLOCK DDL in case of deadlocks MENT-328 wrongly assumed that the backup failed because of warnings from mariabackup about not found files. This is normal (and the error message should be deleted). randgen failed because mariabackup didn't retry BACKUP STAGE BLOCK DDL if it failed with a deadlock. To simplify things, I implemented the retry loop in the server as this particular deadlock should be quickly resolved.	2022-02-08 14:32:28 +02:00
Monty	0ec27d7b1f	Don't run innodb_defgragment under valgrind (too slow)	2022-02-08 14:32:28 +02:00
Monty	88fb89acb7	Fixes some compiler issues on AIX (	2022-02-08 14:32:28 +02:00
Monty	df02de68f3	Fixed my_addr_resolve (cherry picked from 10.6) When a server is compiled with -fPIE, my_addr_resolve needs to subtract the info.dli_fbase from symbol addresses in memory for addr2line to recognize them. When a server is compiled without -fPIE, my_addr_resolve should not do it. Unfortunately not all compilers define __PIE__ when -fPIE was used (e.g. older gcc doesn't), so we have to resort to run-time detection.	2022-02-08 14:32:28 +02:00
Vladislav Vaintroub	881918bf77	MDEV-27754 : Assertion with innodb_flush_method=O_DSYNC If innodb_flush_method=O_DSYNC, log_sys.flushed_to_disk_lsn is changed without 'flush_lock' protection inside log_write(). This leads to a race condition, if there are 2 threads running in parallel, doing log_write_up_to() with different values for 'flush_to_disk' In this case, log_write() and log_write_flush_to_disk_low() can execute at the same time, and both would change flushed_lsn. The fix is to remove special treatment of durable writes from log_write(). There is no apparent reason for this special treatment, log_write_flush_to_disk_low() is already optimized for durable writes. Nor there is an apparent reason to call log_flush_notify() more often in for O_DSYNC.	2022-02-07 09:14:00 +01:00
Oleksandr Byelkin	307b2991d6	Fix JSON statistics time format and added tests for it and server version. mariadb-10.8.1	2022-02-07 08:44:32 +01:00
Sergei Golubchik	a319220e62	update test result	2022-02-06 23:00:34 +01:00
Sergei Golubchik	34564587f4	Merge branch '10.7' into 10.8	2022-02-06 18:05:12 +01:00
Sergei Golubchik	cb1316b8d2	wrong merge mariadb-10.7.2	2022-02-06 12:28:49 +01:00
Sergei Golubchik	2150ad3fdb	Merge branch '10.6' into 10.7	2022-02-06 10:14:47 +01:00
Sergei Golubchik	4ffffd98a5	update columnstore mariadb-10.6.6	2022-02-05 14:50:25 +01:00
Vladislav Vaintroub	5ded88ebb3	Remove incorrect narrowing size_t->ulong casts. Fix printf format error.	2022-02-05 02:03:37 +01:00
Sergei Golubchik	4c2c1e6185	enable main.mysqldump-system test	2022-02-05 02:03:04 +01:00
Sergei Golubchik	6009f9b859	make zstd in C/C optional and disable it for now in RPM/DEB	2022-02-05 02:03:04 +01:00
Sergei Golubchik	e70bd5f695	.gitignore	2022-02-04 15:43:52 +01:00
Oleksandr Byelkin	2f29d0eaab	Merge branch '10.7' into 10.8	2022-02-04 14:53:58 +01:00
Oleksandr Byelkin	47f42ce130	Merge branch '10.6' into 10.7	2022-02-04 14:53:19 +01:00
Oleksandr Byelkin	64e358821e	Revert "don't build with OpenSSL 3.0, it doesn't work before MDEV-25785" This reverts commit `c9beef4315`, because we have OpenSSL 3.0 support here.	2022-02-04 14:52:03 +01:00
Oleksandr Byelkin	4fb2cb1a30	Merge branch '10.7' into 10.8	2022-02-04 14:50:25 +01:00
Oleksandr Byelkin	a806c993e7	Fix for compiling under clang.	2022-02-04 14:40:42 +01:00
Oleksandr Byelkin	9ed8deb656	Merge branch '10.6' into 10.7	2022-02-04 14:11:46 +01:00
Oleksandr Byelkin	d87979b48c	Merge branch '10.5' into 10.6	2022-02-04 10:01:08 +01:00
Oleksandr Byelkin	ad3ac55641	fix 32bit embedded result file. mariadb-10.5.14	2022-02-04 09:55:04 +01:00
Oleksandr Byelkin	2cf52736de	Fix for clang compilation	2022-02-04 09:54:45 +01:00
Marko Mäkelä	82f5981e72	MDEV-27058 fixup: Crash in innodb.leaf_page_corrupted_during_recovery buf_page_get_low(): If the page was read-fixed, validate the page ID because the page could have been marked as corrupted. We should retry the page read in this case, instead of returning a soon-to-be-evicted corrupted page to the caller. This was initially only observed on Microsoft Windows. On Linux, this was repeated after adding a sleep to buf_pool_t::corrupted_evict() between bpage->zip.fix.fetch_sub() and bpage->lock.x_unlock().	2022-02-03 17:02:27 +01:00
Marko Mäkelä	05c33d6216	MDEV-27736 Allow seamless upgrade despite ROW_FORMAT=COMPRESSED In commit `9bc874a594` (MDEV-23497) the configuration option innodb_read_only_compressed was introduced to giver users advance notice of a plan to remove ROW_FORMAT=COMPRESSED support for InnoDB. Based on user feedback, this plan has been scrapped. Even though ROW_FORMAT=COMPRESSED is a dead end and causes some overhead for InnoDB data structures, we can live with that. Now that we know that some users really want to keep using ROW_FORMAT=COMPRESSED, the previous default value of the parameter innodb_read_only_compressed=ON should be changed to OFF, to allow smooth upgrades to 10.6 and later versions, without requiring users to update any configuration file.	2022-02-03 17:02:14 +01:00
Oleksandr Byelkin	f5c5f8e41e	Merge branch '10.5' into 10.6	2022-02-03 17:01:31 +01:00
Sergei Golubchik	c0f5fd2754	MDEV-27683 EXCHANGE PARTITION allows different index direction, but causes further errors	2022-02-03 15:28:12 +01:00
Sergei Golubchik	a450d58ad0	fix a copy-paste error LEX_CSTRING table_name= { table->s->db.str, table->s->table_name.length }; and misc cleanups	2022-02-03 15:28:12 +01:00
Andrei	e4d7886cc5	MDEV-11675. rpl_start_alter_ftwrl.test is refined The test could fail sporadically because of not anticipated race on slave between CREATE and ALTER queries. Fixed to synchronize slave and master wrt CREATE.	2022-02-02 17:17:27 +02:00
Marko Mäkelä	12f29a4bc0	MDEV-11675 fixup: GCC -Og -Wmaybe-uninitialized save_restore_context_apply_event(): Because compilers cannot infer that ev->apply_event(rgi) will not affect ev->get_type_code(), let us test that condition only once and allow the compiler to emit a tail call. Also, replace a goto with an early return for error handling.	2022-02-02 11:30:16 +02:00
Vladislav Vaintroub	2b95c36b4b	fix clang-cl warnings	2022-02-02 01:35:40 +01:00

1 2 3 4 5 ...

195,032 commits