mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-01-17 12:32:27 +01:00

Author	SHA1	Message	Date
Marko Mäkelä	2027c482de	MDEV-32050: Hold exclusive purge_sys.rseg->latch longer Let the purge_coordinator_task acquire purge_sys.rseg->latch less frequently and hold it longer at a time. This may throttle concurrent DML and prevent purge lag a little. Remove an unnecessary std::this_thread::yield(), because the trx_purge_attach_undo_recs() is supposed to terminate the scan when running out of undo log records. Ultimately, this will result in purge_coordinator_state::do_purge() and purge_coordinator_callback() returning control to the thread pool. Reviewed by: Vladislav Lesin and Vladislav Vaintroub	2023-10-25 09:38:49 +03:00
Marko Mäkelä	44689eb7d8	MDEV-32050: Improve srv_wake_purge_thread_if_not_active() purge_sys_t::wake_if_not_active(): Replaces srv_wake_purge_thread_if_not_active(). innodb_ddl_recovery_done(): Move the wakeup call to srv_init_purge_tasks(). purge_coordinator_timer: Remove. The srv_master_callback() already invokes purge_sys.wake_if_not_active() once per second. Reviewed by: Vladislav Lesin and Vladislav Vaintroub	2023-10-25 09:38:21 +03:00
Marko Mäkelä	14685b10df	MDEV-32050: Deprecate&ignore innodb_purge_rseg_truncate_frequency The motivation of introducing the parameter innodb_purge_rseg_truncate_frequency in mysql/mysql-server@28bbd66ea5 and mysql/mysql-server@8fc2120fed seems to have been to avoid stalls due to freeing undo log pages or truncating undo log tablespaces. In MariaDB Server, innodb_undo_log_truncate=ON should be a much lighter operation than in MySQL, because it will not involve any log checkpoint. Another source of performance stalls should be trx_purge_truncate_rseg_history(), which is shrinking the history list by freeing the undo log pages whose undo records have been purged. To alleviate that, we will introduce a purge_truncation_task that will offload this from the purge_coordinator_task. In that way, the next innodb_purge_batch_size pages may be parsed and purged while the pages from the previous batch are being freed and the history list being shrunk. The processing of innodb_undo_log_truncate=ON will still remain the responsibility of the purge_coordinator_task. purge_coordinator_state::count: Remove. We will ignore innodb_purge_rseg_truncate_frequency, and act as if it had been set to 1 (the maximum shrinking frequency). purge_coordinator_state::do_purge(): Invoke an asynchronous task purge_truncation_callback() to free the undo log pages. purge_sys_t::iterator::free_history(): Free those undo log pages that have been processed. This used to be a part of trx_purge_truncate_history(). purge_sys_t::clone_end_view(): Take a new value of purge_sys.head as a parameter, so that it will be updated while holding exclusive purge_sys.latch. This is needed for race-free access to the field in purge_truncation_callback(). Reviewed by: Vladislav Lesin	2023-10-25 09:11:58 +03:00
Marko Mäkelä	21bec97044	MDEV-32050: Clean up online ALTER UndorecApplier::assign_rec(): Remove. We will pass the undo record to UndorecApplier::apply_undo_rec(). There is no need to copy the undo record, because nothing else can write to the undo log pages that belong to an active or incomplete transaction. trx_t::apply_log(): Buffer-fix the undo page across mini-transaction boundary in order to avoid repeated page lookups. Reviewed by: Vladislav Lesin	2023-10-25 08:27:27 +03:00
Marko Mäkelä	9bb5d9fe8b	MDEV-32050: Clean up log parsing purge_node_t, undo_node_t: Change the type of rec_type and cmpl_info to byte, because this data is being extracted from a single byte. UndoRecApplier: Change type and cmpl_info to be of type byte, and move them next to the 16-bit offset field to minimize alignment bloat. row_purge_parse_undo_rec(): Remove some redundant code. Purge will be started by innodb_ddl_recovery_done(), at which point all necessary subsystems will have been initialized. trx_purge_rec_t::undo_rec: Point to const. Reviewed by: Vladislav Lesin	2023-10-25 08:27:08 +03:00
Marko Mäkelä	ea42c4baac	MDEV-32050 preparation: Simplify ROLLBACK undo_node_t::state: Replaced with bool is_temp. row_undo_rec_get(): Do not copy the undo log record. The motivation of the copying was to not hold latches on the undo pages and therefore to avoid deadlocks due to lock order inversion a.k.a. latching order violation: It is not allowed to wait for an index page latch while holding an undo page latch, because MVCC reads would first acquire an index page latch and then an undo page latch. But, in rollback, we do not actually need any latch on our own undo pages. The transaction that is being rolled back is the exclusive owner of its undo log records. They cannot be overwritten by other threads until the rollback is complete. Therefore, a buffer fix will protect the undo log record just fine, by preventing page eviction. We still must initially acquire a shared latch on each undo page, to avoid a race condition like the one that was fixed in commit `b102872ad5`. row_undo_ins_parse_undo_rec(): The first two bytes of the undo log record now are the pointer to the next record within the page, not a length. Reviewed by: Vladislav Lesin	2023-10-25 08:26:34 +03:00
Marko Mäkelä	b78b77e77d	MDEV-32530 Race condition in lock_wait_rpl_report() After acquiring lock_sys.latch, always load trx->lock.wait_lock. It could have changed by another thread that did lock_rec_move() and released lock_sys.latch right before lock_sys.wr_lock_try() succeeded. This regression was introduced in commit `e039720bf3` (MDEV-32096). Reviewed by: Vladislav Lesin	2023-10-24 14:33:14 +03:00
Marko Mäkelä	b21f52ee73	Merge 10.5 into 10.6	2023-10-23 16:43:48 +03:00
Marko Mäkelä	b5e43a1d35	MDEV-32552 Write-ahead logging is broken for freed pages buf_page_free(): Flag the freed page as modified if it is found in the buffer pool. buf_flush_page(): If the page has been freed, ensure that the log for it has been durably written, before removing the page from buf_pool.flush_list. FindBlockX: Find also MTR_MEMO_PAGE_X_MODIFY in order to avoid an occasional failure of innodb.innodb_defrag_concurrent, which involves freeing and reallocating pages in the same mini-transaction. This fixes a regression that was introduced in commit `a35b4ae898` (MDEV-15528). This logic was tested by commenting out the $shutdown_timeout line from a test and running the following: ./mtr --rr innodb.scrub rr replay var/log/mysqld.1.rr/mariadbd-0 A breakpoint in the modified buf_flush_page() was hit, and the FIL_PAGE_LSN of that page had been last modified during the mtr_t::commit() of a mini-transaction where buf_page_free() had been executed on that page.	2023-10-23 16:13:16 +03:00
Oleksandr Byelkin	0a4103e6bb	new CC v3.3	2023-10-23 13:39:00 +02:00
Sergei Petrunia	4941ac9192	MDEV-32113: utf8mb3_key_col=utf8mb4_value cannot be used for ref (Variant#3: Allow cross-charset comparisons, use a special CHARSET_INFO to create lookup keys. Review input addressed.) Equalities that compare utf8mb{3,4}_general_ci strings, like: WHERE ... utf8mb3_key_col=utf8mb4_value (MB3-4-CMP) can now be used to construct ref[const] access and also participate in multiple-equalities. This means that utf8mb3_key_col can be used for key-lookups when compared with an utf8mb4 constant, field or expression using '=' or '<=>' comparison operators. This is controlled by optimizer_switch='cset_narrowing=on', which is OFF by default. IMPLEMENTATION Item value comparison in (MB3-4-CMP) is done using utf8mb4_general_ci. This is valid as any utf8mb3 value is also an utf8mb4 value. When making index lookup value for utf8mb3_key_col, we do "Charset Narrowing": characters that are in the Basic Multilingual Plane (=BMP) are copied as-is, as they can be represented in utf8mb3. Characters that are outside the BMP cannot be represented in utf8mb3 and are replaced with U+FFFD, the "Replacement Character". In utf8mb4_general_ci, the Replacement Character compares as equal to any character that's not in BMP. Because of this, the constructed lookup value will find all index records that would be considered equal by the original condition (MB3-4-CMP). Approved-by: Monty <monty@mariadb.org>	2023-10-19 17:24:30 +03:00
Monty	6a674c3142	MDEV-32476 LeakSanitizer errors in get_quick_select or Assertion ... Problem was that JOIN_TAB::cleanup() was not run because JOIN::top_join_tab_count was not set in case of early errors. Fixed by setting JOIN::tab_join_tab_count when JOIN_TAB's are allocated. Something that should eventually be fixed: - Cleaning up JOIN_TAB's is now done in 3 different loops. JOIN_TAB::cleanup() is only doing a partial cleanup. Other cleanups are done outside of JOIN_TAB::cleanup(). The above should be fixed so that JOIN_TAB::cleanup() is freeing everything related to it's own memory, including all its sub JOIN_ TAB's. JOIN::cleanup() should only loop over all it's top JOIN_TAB's and call JOIN_TAB::cleanup() on these. This will greatly simplify and speedup the current code (as we now do some cleanup's twice).	2023-10-19 16:17:01 +03:00
Monty	a1b6befc78	Fixed crash in is_stat_table() when using hash joins. Other usage if persistent statistics is checking 'stats_is_read' in caller, which is why this was not noticed earlier. Other things: - Simplified no_stat_values_provided	2023-10-19 16:17:01 +03:00
Marko Mäkelä	6991b1c47c	Merge 10.5 into 10.6	2023-10-19 13:50:00 +03:00
Thirunarayanan Balathandayuthapani	85751ed81d	MDEV-31851 After crash recovery, undo tablespace fails to open srv_all_undo_tablespaces_open(): While opening the extra unused undo tablespaces, InnoDB should use ULINT_UNDEFINED instead of SRV_SPACE_ID_UPPER_BOUND.	2023-10-19 15:39:44 +05:30
Thirunarayanan Balathandayuthapani	dbba1bb1c3	MDEV-31851 After crash recovery, undo tablespace fails to open recv_recovery_from_checkpoint_start(): InnoDB should add the redo log block header + trailer size while checking the log sequence number in log file with log sequence number in the system tablespace first page.	2023-10-19 13:12:10 +05:30
Marko Mäkelä	2d6dc65de5	MDEV-32144 fixup In commit `384eb570a6` the debug check was relaxed in trx_undo_header_create(), not in the intended function trx_undo_write_xid().	2023-10-19 08:24:37 +03:00
Marko Mäkelä	cfd1788182	MDEV-32511: Race condition between checkpoint and page write fil_aio_callback(): Invoke fil_node_t::complete_write() before releasing any page latch, so that in case a log checkpoint is executed roughly concurrently with the first write into a file since the previous checkpoint, we will not miss a fdatasync() or fsync() call to make the write durable.	2023-10-18 16:51:04 +03:00
Marko Mäkelä	bf7c6fc20b	MDEV-32511 Assertion !os_aio_pending_writes() failed In MemorySanitizer builds of 10.10 and 10.11, we would rather often have the assertion fail in innodb_init() during mariadb-backup --prepare. The assertion could also fail during InnoDB startup, but less often. Before commit `685d958e38` in 10.8 the log file cleanup after a successfully applied backup is different, and the os_aio_pending_writes() assertion is in srv0start.cc. IORequest::write_complete(): Invoke node->complete_write() before releasing the page latch, so that a log checkpoint that is about to execute concurrently will not miss a fdatasync() or fsync() on the file, in case this was the first write since the last such call. create_log_file(), srv_start(): Replace the debug assertion with a debug check. For all intents and purposes, all writes could have been completed but some write_io_callback() may not have invoked io_slots::release() yet.	2023-10-18 16:33:11 +03:00
Thirunarayanan Balathandayuthapani	3da5d047b8	MDEV-31851 After crash recovery, undo tablespace fails to open Problem: ======== - InnoDB fails to open undo tablespace when page0 is corrupted and fails to throw error. Solution: ========= - InnoDB throws DB_CORRUPTION error when InnoDB encounters page0 corruption of undo tablespace. - InnoDB restores the page0 of undo tablespace from doublewrite buffer if it encounters page corruption - Moved Datafile::restore_from_doublewrite() to recv_dblwr_t::restore_first_page(). So that undo tablespace and system tablespace can use this function instead of duplicating the code srv_undo_tablespace_open(): Returns 0 if file doesn't exist or ULINT_UNDEFINED if page0 is corrupted.	2023-10-17 18:41:21 +05:30
Thirunarayanan Balathandayuthapani	ee5cadd5c8	MDEV-28122 Optimize table crash while applying online log - InnoDB fails to check the overflow buffer while applying the operation to the table that was rebuilt. This is caused by commit `3cef4f8f0f` (MDEV-515).	2023-10-16 20:17:09 +05:30
Monty	cca9547892	Post fix for MDEV-32449	2023-10-16 12:55:17 +03:00
Monty	1c554459b3	MDEV-32449 Server crashes in Alter_info::add_stat_drop_index upon CREATE TABLE Fixed missing initialization of Alter_info() This could cause crashes in some create table like scenarios where some generated indexes where automatically dropped. I also added a test that we do not try to drop from index_stats for temporary tables.	2023-10-14 15:46:29 +03:00
Monty	ec277a70e8	Do not create histograms for single column unique key The intentention was always to not create histograms for single value unique keys (as histograms is not useful in this case), but because of a bug in the code this was still done. The changes in the test cases was mainly because hist_size is now NULL for these kind of columns.	2023-10-14 13:43:26 +03:00
Sergei Golubchik	ea0b1ccd41	Revert "MDEV-29091: Correct event_name in PFS for wait caused by FOR UPDATE" This reverts commit `03c9a4ef4a`. The fix is wrong. It was doing this: if the uninitialized wait->m_class has some specific value, then don't initialize it.	2023-10-14 11:03:00 +02:00
Sergei Golubchik	c378efeeb9	make perfschema.show_aggregate test more reliable	2023-10-13 18:13:12 +02:00
Vlad Lesin	18fa00a54c	MDEV-32272 lock_release_on_prepare_try() does not release lock if supremum bit is set along with other bits set in lock's bitmap The error is caused by MDEV-30165 fix with the following commit: `d13a57ae81` There is logical error in lock_release_on_prepare_try(): if (supremum_bit) lock_rec_unlock_supremum(*cell, lock); else lock_rec_dequeue_from_page(lock, false); Because there can be other bits set in the lock's bitmap, and the lock type can be suitable for releasing criteria, but the above logic releases only supremum bit of the lock. The fix is to release lock if it suits for releasing criteria and unlock supremum if supremum is locked otherwise. Tere is also the test for the case, which was reported by QA team. I placed it in a separate files, because it requires debug build. Reviewed by: Marko Mäkelä	2023-10-13 16:29:04 +03:00
Sergei Golubchik	e3e66a575b	make perfschema.show_aggregate test more debuggable	2023-10-13 14:51:43 +02:00
Thirunarayanan Balathandayuthapani	cbad0bcd41	MDEV-31098 InnoDB Recovery doesn't display encryption message when no encryption configuration passed - InnoDB fails to report the error when encryption configuration wasn't passed. This patch addresses the issue by adding the error while loading the tablespace and deferring the tablespace creation.	2023-10-13 17:27:27 +05:30
Daniel Black	fbd11d5f29	MDEV-18200 MariaBackup full backup failed with InnoDB: Failing assertion: success Review cleanups.	2023-10-13 09:48:57 +11:00
Daniel Black	c79ca7c7ad	MDEV-18200 MariaBackup full backup failed with InnoDB: Failing assertion: success There are many filesystem related errors that can occur with MariaBackup. These already outputed to stderr with a good description of the error. Many of these are permission or resource (file descriptor) limits where the assertion and resulting core crash doesn't offer developers anything more than the log message. To the user, assertions and core crashes come across as poor error handling. As such we return an error and handle this all the way up the stack.	2023-10-12 21:37:27 +11:00
Marko Mäkelä	f9d471e2d5	Cleanup: Remove innobase_init_vc_templ() This fixes up a merge of commit `4fb8f7d07a` with respect to commit `ea37b14409`.	2023-10-12 09:48:54 +03:00
Monty	8bf17c579b	MDEV-32388 MSAN / Valgrind errors in Item_func_like::get_mm_leaf upon query from partitioned table The problem was that RANGE_OPT_PARAM was not completely initialized in some cases. Added bzero() to ensure that all elements are always initialized.	2023-10-10 14:05:02 +03:00
Monty	55534a2616	Removed warning from ssl_cipher.test	2023-10-10 14:05:02 +03:00
Monty	b159f05a63	MDEV-31957 Concurrent ALTER and ANALYZE collecting statistics can result in stale statistical data Fixed hang when renaming index to original name	2023-10-10 11:12:26 +03:00
Monty	fdcb443e62	Remember first error in Dummy_error_handler Use Dummy_error_handler in open_stat_tables() to ignore all errors when opening statistics tables.	2023-10-10 11:12:26 +03:00
Monty	b04af64882	Fixed that log_slow.test works with view_protocol Part of the test did not work with view_protocol as the query written to the slow_log table is changed because of view_protocol.	2023-10-08 22:48:27 +03:00
Monty	1dd6d9a0bf	Fixed compiler warnings in connect/odbconn.cpp	2023-10-08 22:48:27 +03:00
Monty	9d19b65269	MDEV-22243 type_test.type_test_double fails with 'NUMERIC_SCALE NULL' There where several reasons why the test failed: - Constructors for Field_double and Field_float changed an argument to the constructor instead of a the correct class variable. - gcc 7.5.0 produced wrong code when inlining Field_double constructor into Field_test_double constructor. Fixed by changing the correct class variable and make the constructors not inline to go around the gcc bug.	2023-10-08 22:46:44 +03:00
Otto Kekalainen	8941bdc474	Fix merge commit `5ea5291`: No test file or result files should be executable In commit `5ea5291` @sanja-byelkin for unknown reason switched the file mode for 3 Galera tzinfo related test files from 644 -> 755. This exists only from branch 10.6 onward: $ git checkout 10.5 $ find mysql-test -executable -name .test -or -executable -name .result (no results) $ git checkout 10.6 $ find mysql-test -executable -name .test -or -executable -name .result mysql-test/suite/galera/t/mysql_tzmysql-test/suite/galera/t/mysql_tzinfo_to_sql.test mysql-test/suite/galera/t/mariadb_tzinfo_to_sql.test mysql-test/suite/galera/r/mariadb_tzinfo_to_sql.resultinfo_to_sql.test mysql-test/suite/galera/t/mariadb_tzinfo_to_sql.test mysql-test/suite/galera/r/mariadb_tzinfo_to_sql.result No test file nor test result file should be executable, so run chmod -x on them. All new code of the whole pull request, including one or several files that are either new files or modified ones, are contributed under the BSD-new license. I am contributing on behalf of my employer Amazon Web Services, Inc.	2023-10-09 03:28:45 +11:00
Monty	185591c1c0	MDEV-31349 test maria.maria-purge failed on 'aria_log.00000002 not found' The bug was in the test case. The problem was that maria_empty_logs.inc deleted aria log files before the server was properly shutdown. Fixed by waiting for pid file to disappear before starting to delete log files. Other things: - Fixed that translog_purge_at_flush() will not stop deleting files even if one file could not be deleted.	2023-10-08 18:16:20 +03:00
Monty	424a7a2620	Fixed randomly failing test main.order_by_optimizer_innodb The problem was that sometimes InnoDB returned sligtly wrong record count for table, which causes the optimizer to disregard the result from the range optimizer. The end result was that the optimizer choosed a ref access instead of a range access which caused errors in buildbot. Fixed by adding more rows to the table to ensure that table scan is more costly than range scan of the given interval.	2023-10-08 18:16:08 +03:00
Marko Mäkelä	625a150a86	Merge 10.5 into 10.6	2023-10-06 14:34:01 +03:00
Marko Mäkelä	6e9b421f77	MDEV-32364 Server crashes when starting server with high innodb_log_buffer_size log_t::create(): Return whether the initialisation succeeded. It may fail if too large an innodb_log_buffer_size is specified.	2023-10-06 14:16:01 +03:00
Marko Mäkelä	0e0a19b9f6	MDEV-32361 mariadb-backup --move-back leaves out ib_logfile0 copy_back(): Also copy the dummy empty ib_logfile0 so that MariaDB Server 10.8 or later can be started after --copy-back or --move-back. Thanks to Daniel Black for reporting this. This is a 10.5 version of commit `ebf3649259`	2023-10-06 12:58:52 +03:00
Marko Mäkelä	10a368d35a	Fix GCC 13.2.0 -Wmismatched-new-delete Table_cache_instance::operator new[](size_t): Reverted the changes that were made in commit `8edef482a7` and move them to the only caller.	2023-10-06 08:19:20 +03:00
Vladislav Vaintroub	9e62ab7aaf	MDEV-31095 tpool - do not create new worker, if thread creation is pending. Use an std::atomic_flag to track thread creation in progress. This is mainly a cleanup, the effect of this change was not measureable in my tests.	2023-10-04 17:44:13 +02:00
Vladislav Vaintroub	e33e2fa949	MDEV-31095 tpool - restrict threadpool concurrency during bufferpool load Add threadpool functionality to restrict concurrency during "batch" periods (where tasks are added in rapid succession). This will throttle thread creation more agressively than usual, while keeping performance at least on-par. One of these cases is bufferpool load, where async read IOs are executed without any throttling. There can be as much as 650K read IOs for loading 10GB buffer pool. Another one is recovery, where "fake read" IOs are executed. Why there are more threads than we expect? Worker threads are not be recognized as idle, until they return to the standby list, and to return to that list, they need to acquire mutex currently held in the submit_task(). In those cases, submit_task() has no worker to wake, and would create threads until default concurrency level (2*ncpus) is satisfied. Only after that throttling would happen.	2023-10-04 17:44:02 +02:00
Michael Widenius	9ba8dc1413	MDEV-32164 Server crashes in JOIN::cleanup after erroneous query with view The problem was that we did not handle errors properly in JOIN::get_best_combination. In case an early error, JOIN->join_tab would contain unintialized values, which would cause errors on cleanup(). The error in question was reported earlier, but not noticed until later. One cause of this is that most of the sql_select.cc code just checks thd->fatal_error and not thd->is_error(). Fixed by changing of checks of fatal_error to is_error().	2023-10-03 08:25:31 +03:00
Monty	d4347177c7	Change SEL_ARG::MAX_SEL_ARGS to a user defined variable optimizer_max_sel_args This allows a user to to change the default value of MAX_SEL_ARGS (16000) in the rare case where they neeed more generated SEL_ARGS (as part of the range optimizer)	2023-10-03 08:25:31 +03:00

1 2 3 4 5 ...

197229 commits