mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-01-27 09:14:17 +01:00

Author	SHA1	Message	Date
Thirunarayanan Balathandayuthapani	2d42e9ff7d	MDEV-34703 LOAD DATA INFILE using Innodb bulk load aborts problem: ======= - During load statement, InnoDB bulk operation relies on temporary directory and it got crash when tmpdir is exhausted. Solution: ======== During bulk insert, LOAD statement is building the clustered index one record at a time instead of page. By doing this, InnoDB does the following 1) Avoids creation of temporary file for clustered index. 2) Writes the undo log for first insert operation alone	2025-01-15 23:49:13 +05:30
Marko Mäkelä	46aaf328ce	MDEV-35830 Fix innodb_undo_log_truncate in backup recv_sys_t::parse(): Correctly handle the storing==BACKUP case, and simplify some logic around storing==YES as well. The added test mariabackup.undo_truncate is based on an idea of Thirunarayanan Balathandayuthapani. It nondeterministically (not on every run) covers this logic, including the function backup_undo_trunc(), for both innodb_encrypt_log=ON and innodb_encrypt_log=OFF. Reviewed by: Debarun Banerjee	2025-01-13 16:57:11 +02:00
Marko Mäkelä	aa35f62f1c	MDEV-35810 Missing error handling in log resizing log_t::resize_start(): If the ib_logfile101 cannot be created, be sure to reset log_sys.resize_lsn. log_t::resize_abort(): In case SET GLOBAL innodb_log_file_size is aborted, delete the ib_logfile101.	2025-01-13 10:41:40 +02:00
Marko Mäkelä	4fc3a44bab	MDEV-33447 fixup: pmem_persist() on RISC-V and LoongArch Let us enable pmem_persist() on RISC-V and LoongArch, because those are available in the Debian CI. In commit `3f9f5ca48e` these were initially disabled by default. According to the available documentation, these instructions are available in all ISA versions. On LoongArch there would also be __builtin_loongarch_dbar() that generates the same code.	2025-01-13 07:28:40 +02:00
Marko Mäkelä	42e6058629	MDEV-35785 innodb_log_file_mmap is not defined on 32-bit systems innodb_log_file_mmap: Use a constant documentation string that refers to persistent memory also when it is not available in the build. HAVE_INNODB_MMAP: Remove, and unconditionally enable this code. log_mmap(): On 32-bit systems, ensure that the size fits in 32 bits. log_t::resize_start(), log_t::resize_abort(): Only handle memory-mapping if HAVE_PMEM is defined. The generic memory-mapped interface is only for reading the log in recovery. Writable memory mappings are only for persistent memory, that is, Linux file systems with mount -o dax. Reviewed by: Debarun Banerjee, Otto Kekäläinen	2025-01-13 07:27:17 +02:00
ParadoxV5	f361ad75b3	MDEV-35431: fix InnoDB flags error size specifier This came after I prepared #3518.	2025-01-12 13:56:06 +11:00
Sergei Golubchik	221aa5e08f	Merge branch '10.6' into 10.11	2025-01-10 13:14:42 +01:00
Sergei Golubchik	311e88c67a	fix rocksdb tests for buildbot	2025-01-10 13:13:53 +01:00
Marko Mäkelä	4704435068	MDEV-35802 Race condition in log_t::persist() log_t::persist(): Remove the parameter holding_latch, and assert latch_holding_any(). We used to avoid acquiring a latch when log resizing was not in progress. That allowed a race condition to occur where log_t::write_checkpoint() has just completed log resizing. In that case, we could wrongly invoke pmem_persist() on the old log_sys.buf instead of the new one, which was shortly before known as log_sys.resize_buf. log_write_persist(): A non-inline wrapper function that will invoke log_sys.persist() while holding a shared log_sys.latch.	2025-01-10 08:15:09 +02:00
Marko Mäkelä	ff1f611a0d	Avoid assert() By default, CMAKE_BUILD_TYPE RelWithDebInfo or Release implies -DNDEBUG, which disables the assert() macro. MariaDB is deviating from that. Let us be explicit to use assert() only in debug builds. This fixes up `1b8358d943`	2025-01-10 06:50:50 +02:00
Marko Mäkelä	1b8358d943	Use assert() on RMW arguments The macros ut_ad() and DBUG_ASSERT() can evaluate their argument twice. That is wrong for any read-modify-write arguments. Thanks to Nikita Malyavin for pointing this out.	2025-01-09 14:27:13 +02:00
Marko Mäkelä	81633f47c3	MDEV-35796 OPT_PAGE_CHECKSUM is ignored if innodb_encrypt_log=ON recv_sys_t::parse(): When parsing an OPTION record, invoke l.copy_if_needed() before checking if the payload is OPT_PAGE_CHECKSUM followed by a 32-bit page checksum. This fixes up the merge `57d4a242da` of commit `4179f93d28` (MDEV-18976). The impact of this can be observed by running a debug instrumented build on the test encryption.recovery_memory. There should be over 5,000 invocations of log_phys_t::page_checksum(). Without this fix, there should be less than 100 of them (when the OPT_PAGE_CHECKSUM byte happens to encrypt to itself). Reviewed by: Debarun Banerjee Tested by: Matthias Leich	2025-01-09 13:21:38 +02:00
Marko Mäkelä	ed13d93a25	Fix mariadb-backup --backup with innodb_undo_log_truncate=ON This fixes another regression that had been introduced in commit `b249a059da` (MDEV-34850). This should prevent failures of mariadb-backup --backup of the following type: mariabackup: Failed to read undo log tablespace space id … and there is no undo tablespace truncation redo record. This error has not been hit by our internal testing, and we currently have no regression test to cover this.	2025-01-09 13:18:42 +02:00
Marko Mäkelä	ea19a6b38c	MDEV-35699 Multi-batch recovery occasionally fails recv_sys_t::parse<storing=NO>(): Do invoke fil_space_set_recv_size_and_flags() and do parse enough of page 0 to facilitate that. This fixes a regression that had been introduced in commit `b249a059da` (MDEV-34850). In a multi-batch crash recovery, we would fail to invoke fil_space_set_recv_size_and_flags() while parsing the remaining log, before starting the first recovery batch. Reviewed by: Debarun Banerjee Tested by: Matthias Leich	2025-01-09 13:18:30 +02:00
Sergei Golubchik	9929a0a76e	MDEV-32576 increase query length in the InnoDB deadlock output * increase target buffer size to 3072 * remove the parameter, just use the buffer size as a limit	2025-01-09 10:00:36 +01:00
Sergei Golubchik	c478b1ba08	MDEV-35598 foreign key error is unnecessary truncated truncate it at 512 bytes (max allowed by the protocol), not 192	2025-01-09 10:00:36 +01:00
Sergei Golubchik	a0e5dd5433	mysqltest: fix --sorted_results only sort actual results not warnings or metadata also work for vertical results warnings are sorted separately	2025-01-09 10:00:36 +01:00
Sergei Golubchik	74532f2355	MCOL-5819 disable lto for ColumnStore	2025-01-09 10:00:35 +01:00
Sergei Golubchik	b79723ffe3	MDEV-35384 Table performance_schema.session_status and other two tables are not shown in information_schema.tables for normal users get_all_tables() skipped tables if the user has no privileges on the schema itself and no granted privilege on any tables in the schema. that is, it was skipping performance_schema tables (privileges on them aren't explicitly granted, but internally hard-coded) To fix: * extend ACL_internal_table_access::check() method with `bool any_combination_will_do` * fix all perfschema privilege checks to take it into account. * don't reuse table_acl_check object for all tables, initialize it for every table otherwise GRANT_INTERNAL_INFO will leak * remove incorrect privilege check from get_all_tables()	2025-01-09 10:00:35 +01:00
Marko Mäkelä	990b010b09	MDEV-35438 Annotate InnoDB I/O functions with noexcept Most InnoDB functions do not throw any exceptions, not even indirectly std::bad_alloc, which could be thrown by a C++ memory allocation function. Let us annotate many functions with noexcept in order to reduce the code footprint related to exception handling. Reviewed by: Thirunarayanan Balathandayuthapani	2025-01-09 07:43:24 +02:00
Marko Mäkelä	6d4841ae26	MDEV-35647 Possible hang during CREATE TABLE…SELECT error handling ha_innobase::delete_table(): Clear trx->dict_operation_lock_mode after, not before invoking trx->rollback(), so that row_undo_mod_parse_undo_rec() will be invoked with dict_locked=true and dict_sys_t::freeze() will not be invoked for loading a table definition. Inside dict_sys_t::freeze(), an assertion !have_any() would fail when the current thread is already holding the latch. This fixes up commit `c5fd9aa562` (MDEV-25919). Reviewed by: Debarun Banerjee	2025-01-08 13:29:16 +02:00
Marko Mäkelä	420d9eb27f	Merge 10.6 into 10.11	2025-01-08 12:51:26 +02:00
Marko Mäkelä	b251cb6a4f	Merge 10.5 into 10.6	2025-01-08 08:48:21 +02:00
Sergei Golubchik	9508a44c37	enforce no trailing \n in Diagnostic_area messages that is in my_error(), push_warning(), etc	2025-01-07 16:31:39 +01:00
Thirunarayanan Balathandayuthapani	f8cf493290	MDEV-34898 Doublewrite recovery of innodb_checksum_algorithm=full_crc32 encrypted pages does not work - InnoDB fails to recover the full crc32 encrypted page from doublewrite buffer. The reason is that buf_dblwr_t::recover() fails to identify the space id from the page because the page has been encrypted from FIL_PAGE_FILE_FLUSH_LSN_OR_KEY_VERSION bytes. Fix: === buf_dblwr_t::recover(): preserve any pages whose space_id does not match a known tablespace. These could be encrypted pages of tablespaces that had been created with innodb_checksum_algorithm=full_crc32. buf_page_t::read_complete(): If the page looks corrupted and the tablespace is encrypted and in full_crc32 format, try to restore the page from doublewrite buffer. recv_dblwr_t::recover_encrypted_page(): Find the page which has the same page number and try to decrypt the page using space->crypt_data. After decryption, compare the space id. Write the recovered page back to the file.	2025-01-07 19:33:56 +05:30
Monty	cc5d738999	Disable mmap usage in Aria and MyISAM when compiling with valgrind This removes a valgrind warning "cannot read program header" while it tries to search for memory leaks.	2025-01-07 12:13:14 +02:00
Monty	a2d37705ca	Only print "InnoDB: Transaction was aborted..." if log_warnings >= 4 This is a minor fixup for MDEV-24035 Failing assertion UT_LIST_GET_LEN(lock.trx_locks) == 0 causing disruption and replication failure	2025-01-05 16:40:12 +02:00
Monty	88d9348dfc	Remove dates from all rdiff files	2025-01-05 16:40:11 +02:00
Monty	52c29f3bdc	MDEV-35469 Heap tables are calling mallocs to often Heap tables are allocated blocks to store rows according to my_default_record_cache (mapped to the server global variable read_buffer_size). This causes performance issues when the record length is big (> 1000 bytes) and the my_default_record_cache is small. Changed to instead split the default heap allocation to 1/16 of the allowed space and not use my_default_record_cache anymore when creating the heap. The allocation is also aligned to be just under a power of 2. For some test that I have been running, which was using record length=633, the speed of the query doubled thanks to this change. Other things: - Fixed calculation of max_records passed to hp_create() to take into account padding between records. - Updated calculation of memory needed by heap tables. Before we did not take into account internal structures needed to access rows. - Changed block sized for memory_table from 1 to 16384 to get less fragmentation. This also avoids a problem where we need 1K to manage index and row storage which was not counted for before. - Moved heap memory usage to a separate test for 32 bit. - Allocate all data blocks in heap in powers of 2. Change reported memory usage for heap to reflect this. Reviewed-by: Sergei Golubchik <serg@mariadb.org>	2025-01-05 16:40:11 +02:00
Marko Mäkelä	f20ee931d8	Merge 10.5 into 10.6 Note: Changes to the test innodb.stats_persistent in commit `e5c4c0842d` (MDEV-35443) are not merged, because the test scenario is impossible due to commit `e66928ab28` (MDEV-33462).	2025-01-03 09:10:25 +02:00
Thirunarayanan Balathandayuthapani	48b724047e	MDEV-34119 Assertion `page_dir_get_n_heap(new_page) == 2U' failed in dberr_t PageBulk::init() Problem: ======= - insert..select statement on partition table fails to use bulk insert for the transaction. Solution: ======== - Enable the bulk insert operation for insert..select statement for partition table.	2025-01-02 17:34:24 +05:30
Marko Mäkelä	3f914afd3a	Merge 10.6 into 10.11	2025-01-02 12:39:56 +02:00
Marko Mäkelä	a54d151fc1	Merge 10.6 into 10.11	2024-12-19 15:38:53 +02:00
Marko Mäkelä	e5c4c0842d	MDEV-35443: opt_search_plan_for_table() may degrade to full table scan opt_calc_index_goodness(): Correct an inaccurate condition. We can very well use a clustered index of a table that is subject to online rebuild. But we must not choose an index that has not been committed (it is a secondary index that was not fully created) or that is corrupted or not a normal B-tree index. opt_search_plan_for_table(): Remove some redundant code, now that opt_calc_index_goodness() checks against corrupted indexes. The test case allows this code to be exercised. The main observation in the following: ./mtr --rr innodb.stats_persistent rr replay var/log/mysqld.1.rr/latest-trace should be that when opt_search_plan_for_table() is being invoked by dict_stats_update_persistent() on the being-altered statistics table in the 2nd call after ha_innobase::inplace_alter_table(), and the fix in opt_calc_index_goodness() is absent, it would choose the code path if (n_fields == 0), that is, a full table scan, instead of searching for the record. The GDB commands to execute in "rr replay" would be as follows: break ha_innobase::inplace_alter_table continue break opt_search_plan_for_table continue continue next next … Reviewed by: Vladislav Lesin	2024-12-19 14:05:16 +02:00
Daniele Sciascia	07b77e862c	MDEV-35660 Assertion `trx->xid.is_null()' failed The assertion fails during wsrep recovery step, in function innobase_rollback_by_xid(). The transaction's xid is normally cleared as part of lookup by xid, unless the transaction has a wsrep specific xid. This is a regression from MDEV-24035 (commit `ddd7d5d8e3`) which removed the part clears xid before rollback for transaction with a wsrep specific xid.	2024-12-19 08:55:59 +01:00
mariadb-DebarunBanerjee	3f22f5f2fe	MDEV-35679 Potential issue in Secondary Index with ROW_FORMAT=COMPRESSED and Change buffering enabled In function buf_page_create_low(), remove duplicate code that over-write the ibuf_exist variable incorrectly when only compressed page is loaded in buffer pool. This would help removing any old change buffer record immediately before re-using the page.	2024-12-18 20:46:26 +05:30
Yuchen Pei	671f80c738	Merge branch '10.5' into 10.6	2024-12-17 11:06:09 +11:00
Yuchen Pei	77c9917663	MDEV-34716 Fix mysql.servers socket max length too short The limit of socket length on unix according to libc is 108, see sockaddr_un::sun_path, but in the table it is a string of max length 64, which results in truncation of socket and failure to connect by plugins using servers such as spider.	2024-12-17 10:40:57 +11:00
Marko Mäkelä	c982a143fc	MDEV-35494 fixup: Always initialize latch It turns out that init() always checks in debug builds that some fields of the latch had been filled with zero.	2024-12-16 13:23:13 +02:00
Marko Mäkelä	c391fb1ff1	MDEV-35577 Broken recovery after SET GLOBAL innodb_log_file_size If InnoDB is killed in such a way that there had been no writes to a newly resized ib_logfile101 after it replaced ib_logfile0 in log_t::write_checkpoint(), it is possible that recovery will accidentally interpret some garbage at the end of the log as valid. log_t::write_buf(): To prevent the corruption, write an extra NUL byte at the end of log_sys.resize_buf, like we always did for the main log_sys.buf. To remove some conditional branches from a time critical code path, we instantiate a separate template for the rare case that the log is being resized. Define as __attribute__((always_inline)) so that this will be inlined also in the rare case the log is being resized. log_t::writer: Pointer to the current implementation of log_t::write_buf(). For quick access, this is located in the same cache line with log_sys.latch, which protects it. log_t::writer_update(): Update log_sys.writer. log_t::resize_write_buf(): Remove ATTRIBUTE_NOINLINE ATTRIBUTE_COLD. Now that log_t::write_buf() will be instantiated separately for the rare case of log resizing being in progress, there is no need to forbid this code from being inlined. Thanks to Thirunarayanan Balathandayuthapani for finding the root cause of this bug and suggesting the fix of writing an extra NUL byte. Reviewed by: Debarun Banerjee	2024-12-16 11:50:00 +02:00
mariadb-DebarunBanerjee	c7698a0b70	MDEV-35626 Race condition between buf_page_create_low() and read completion This regression is introduced in 10.6 by following commit. commit `35d477dd1d` MDEV-34453 Trying to read 16384 bytes at 70368744161280 The page state could change after being buffer-fixed and needs to be read again after locking the page.	2024-12-13 18:36:47 +05:30
Marko Mäkelä	1097164d3f	MDEV-35619 Assertion failure in row_purge_del_mark_error trx_sys_t::find_same_or_older_in_purge(): Correct a mistake that was made in commit `19acb0257e` (MDEV-35508) and make the caching logic correspond to the one in trx_sys_t::find_same_or_older(). In the more common code path for 64-bit systems, the condition !hot was inadvertently inverted, making us wrongly skip calls to find_same_or_older_low() when the transaction may still be active. Furthermore, the call should have been to find_same_or_older_low() and not the wrapper find_same_or_older().	2024-12-13 11:41:47 +02:00
Marko Mäkelä	ddd7d5d8e3	MDEV-24035 Failing assertion: UT_LIST_GET_LEN(lock.trx_locks) == 0 causing disruption and replication failure Under unknown circumstances, the SQL layer may wrongly disregard an invocation of thd_mark_transaction_to_rollback() when an InnoDB transaction had been aborted (rolled back) due to one of the following errors: * HA_ERR_LOCK_DEADLOCK * HA_ERR_RECORD_CHANGED (if innodb_snapshot_isolation=ON) * HA_ERR_LOCK_WAIT_TIMEOUT (if innodb_rollback_on_timeout=ON) Such an error used to cause a crash of InnoDB during transaction commit. These changes aim to catch and report the error earlier, so that not only this crash can be avoided but also the original root cause be found and fixed more easily later. The idea of this fix is from Michael 'Monty' Widenius. HA_ERR_ROLLBACK: A new error code that will be translated into ER_ROLLBACK_ONLY, signalling that the current transaction has been aborted and the only allowed action is ROLLBACK. trx_t::state: Add TRX_STATE_ABORTED that is like TRX_STATE_NOT_STARTED, but noting that the transaction had been rolled back and aborted. trx_t::is_started(): Replaces trx_is_started(). ha_innobase: Check the transaction state in various places. Simplify the logic around SAVEPOINT. ha_innobase::is_valid_trx(): Replaces ha_innobase::is_read_only(). The InnoDB logic around transaction savepoints, commit, and rollback was unnecessarily complex and might have contributed to this inconsistency. So, we are simplifying that logic as well. trx_savept_t: Replace with const undo_no_t*. When we rollback to a savepoint, all we need to know is the number of undo log records that must survive. trx_named_savept_t, DB_NO_SAVEPOINT: Remove. We can store undo_no_t directly in the space allocated at innobase_hton->savepoint_offset. fts_trx_create(): Do not copy previous savepoints. fts_savepoint_rollback(): If a savepoint was not found, roll back everything after the default savepoint of fts_trx_create(). The test innodb_fts.savepoint is extended to cover this code. Reviewed by: Vladislav Lesin Tested by: Matthias Leich	2024-12-12 18:02:00 +02:00
Dave Gosselin	9aa84cf57f	MDEV-35587 unit.innodb_sync leaks memory on mac unit.innodb_sync calls my_end to cleanup its memory	2024-12-12 10:27:36 +11:00
Marko Mäkelä	7bcd6c610a	MDEV-35618 Bogus assertion failure 'recv_sys.scanned_lsn < max_lsn + 32 * 512U' during recovery buf_dblwr_t::recover(): Correct a debug assertion failure that had been added in commit `bb47e575de` (MDEV-34830). The server may have been killed while a log write was in progress, and therefore recv_sys.scanned_lsn may be up to RECV_PARSING_BUF_SIZE bytes ahead of recv_sys.recovered_lsn. Thanks to Matthias Leich for providing "rr replay" traces and testing this.	2024-12-11 14:47:39 +02:00
Marko Mäkelä	69e20cab28	Merge 10.5 into 10.6	2024-12-11 14:46:43 +02:00
Marko Mäkelä	bfe7c8ff0a	MDEV-35494 fil_space_t::fil_space_t() may be unsafe with GCC -flifetime-dse fil_space_t::create(): Instead of invoking the default fil_space_t constructor on a zero-filled buffer, allocate an uninitialized buffer and invoke an explicitly defined constructor on it. Also, specify initializer expressions for all constant data members, so that all of them will be initialized in the constructor. fil_space_t::being_imported: Replaces part of fil_space_t::purpose. fil_space_t::is_being_imported(), fil_space_t::is_temporary(): Replaces fil_space_t::purpose. fil_space_t:🆔 Changed the type from ulint to uint32_t to reduce incompatibility with later branches that include commit `ca501ffb04` (MDEV-26195). fil_space_t::try_to_close(): Do not attempt to close files that are in an I/O bound phase of ALTER TABLE…IMPORT TABLESPACE. log_file_op, first_page_init: recv_spaces_t: Use uint32_t for the tablespace id. Reviewed by: Debarun Banerjee	2024-12-11 14:44:42 +02:00
Daniel Black	807e4f320f	Change my_umask{,_dir} to mode_t and remove os_innodb_umask os_innodb_umask was of the incorrect type resulting in warnings in clang-19. The correct type is mode_t. As os_innodb_umask was set during innnodb_init from my_umask, corrected the type there along with its companion my_umask_dir. Because of this, the defaults mask values in innodb never had an effect. The resulting change allow found signed differences in my_create{,_nosymlink}, open_nosymlinks: mysys/my_create.c:47:20: error: operand of ?: changes signedness from ‘int’ to ‘mode_t’ {aka ‘unsigned int’} due to unsignedness of other operand [-Werror=sign-compare] 47 \| CreateFlags ? CreateFlags : my_umask); Ref: clang-19 warnings: [55/123] Building CXX object storage/innobase/CMakeFiles/innobase.dir/os/os0file.cc.o storage/innobase/os/os0file.cc:1075:46: warning: implicit conversion loses integer precision: 'ulint' (aka 'unsigned long') to 'mode_t' (aka 'unsigned int') [-Wshorten-64-to-32] 1075 \| file = open(name, create_flag \| O_CLOEXEC, os_innodb_umask); \| ~~~~ ^~~~~~~~~~~~~~~ storage/innobase/os/os0file.cc:1249:46: warning: implicit conversion loses integer precision: 'ulint' (aka 'unsigned long') to 'mode_t' (aka 'unsigned int') [-Wshorten-64-to-32] 1249 \| file = open(name, create_flag \| O_CLOEXEC, os_innodb_umask); \| ~~~~ ^~~~~~~~~~~~~~~ storage/innobase/os/os0file.cc:1381:45: warning: implicit conversion loses integer precision: 'ulint' (aka 'unsigned long') to 'mode_t' (aka 'unsigned int') [-Wshorten-64-to-32] 1381 \| file = open(name, create_flag \| O_CLOEXEC, os_innodb_umask); \| ~~~~ ^~~~~~~~~~~~~~~	2024-12-11 17:21:01 +11:00
Daniel Black	bf7cfa2535	MDEV-35574 remove obsolete pthread_exit calls Threads can normally exit without a explicit pthread_exit call. There seem to date to old glibc bugs, many around 2.2.5. The semi related bug was https://bugs.mysql.com/bug.php?id=82886. To improve safety in the signal handlers DBUG_* code was removed. These where also needed to avoid some MSAN unresolved stack issues. This is effectively a backport of `2719cc4925`.	2024-12-10 12:12:20 +11:00
Thirunarayanan Balathandayuthapani	b9e592a786	MDEV-35475 Assertion `!rec_offs_nth_extern(offsets1, n)' failed in cmp_rec_rec_simple_field Problem: ======= InnoDB wrongly stores the primary key field in externally stored off page during bulk insert operation. This leads to assert failure. Solution: ======== row_merge_buf_blob(): Should store the primary key fields inline. Store the variable length field data externally based on the row format of the table. row_merge_buf_write(): check whether the record size exceeds the maximum record size. row_merge_copy_blob_from_file(): Construct the tuple based on the variable length field	2024-12-09 20:27:12 +05:30

1 2 3 4 5 ...

27974 commits