mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-01-16 03:52:35 +01:00

Author	SHA1	Message	Date
Sergei Golubchik	2efd9b17ba	mhnsw: cache start node too, don't push too much in pg_discard	2024-11-05 14:00:49 -08:00
Sergei Golubchik	687bfa7691	bugfix: properly reset db_plugin when hlindex discovery fails otherwise it'll be free'd twice	2024-11-05 14:00:49 -08:00
Sergei Golubchik	613542dceb	mhnsw: build indexes with the columns of exactly right size	2024-11-05 14:00:49 -08:00
Sergei Golubchik	1029d2f245	cleanups	2024-11-05 14:00:49 -08:00
Sergei Golubchik	c42ec1932f	mhnsw: remove EXTEND_CANDIDATES and KEEP_PRUNED_CONNECTIONS	2024-11-05 14:00:49 -08:00
Sergei Golubchik	a8fd1199e2	mhnsw: search intermediate layers with ef=1 also add missing candidates.empty();	2024-11-05 14:00:49 -08:00
Sergei Golubchik	42804ad5a5	mhnsw: fix the heuristic neighbor selection algorithm	2024-11-05 14:00:49 -08:00
Sergei Golubchik	ad94c8d714	mhnsw: don't prefix blob ref array with its length	2024-11-05 14:00:49 -08:00
Sergei Golubchik	c91cab3b6f	mhnsw: don't create many empty layers	2024-11-05 14:00:49 -08:00
Sergei Golubchik	058976533f	mhnsw: remove a redundant loop and ha_update_row	2024-11-05 14:00:49 -08:00
Sergei Golubchik	e0bdddad74	mhnsw: modify target's neighbors directly	2024-11-05 14:00:49 -08:00
Sergei Golubchik	27bfa21a58	mhnsw: cache neighbors too	2024-11-05 14:00:49 -08:00
Sergei Golubchik	b492025c6c	mhnsw: don't guess whether it's insert or update we know it every time	2024-11-05 14:00:49 -08:00
Sergei Golubchik	267092d4a1	mhnsw: refactor FVector* classes Now there's an FVector class which is a pure vector, an array of floats. It doesn't necessarily corresponds to a row in the table, and usually there is only one FVector instance - the one we're searching for. And there's an FVectorNode class, which is a node in the graph. It has a ref (identifying a row in the source table), possibly an array of floats (or not — in which case it will be read lazily from the source table as needed). There are many FVectorNodes and they're cached to avoid re-reading them from the disk.	2024-11-05 14:00:48 -08:00
Sergei Golubchik	10de659020	mhnsw: fix memory management move everything into a query-local memroot which is freed at the end	2024-11-05 14:00:48 -08:00
Sergei Golubchik	51c8cdcbb2	mhnsw: simplify memory management of returned results instead of pointers to FVectorRef's (which are stored elsewhere) let's return one big array of all refs. Freeing this array will free the complete result set.	2024-11-05 14:00:48 -08:00
Sergei Golubchik	3ff7f04fd4	misc changes * sysvars should be REQUIRED_ARG * fix a mix of US and UK spelling (use US) * use consistent naming * work if VEC_DISTANCE arguments are in the swapped order (const, col) * work if VEC_DISTANCE argument is NULL/invalid or wrong length * abort INSERT if the value is invalid or wrong length * store the "number of neighbors" in a blob in endianness-independent way * use field->store(longlong, bool) not field->store(double) * a lot more error checking everywhere * cleanup after errors * simplify calling conventions, remove reinterpret_cast's * todo/XXX comments * whitespaces * use float consistently memory management is still totally PoC quality	2024-11-05 14:00:48 -08:00
Vicențiu Ciorbaru	88839e71a3	Initial HNSW implementation This commit includes the work done in collaboration with Hugo Wen from Amazon: MDEV-33408 Alter HNSW graph storage and fix memory leak This commit changes the way HNSW graph information is stored in the second table. Instead of storing connections as separate records, it now stores neighbors for each node, leading to significant performance improvements and storage savings. Comparing with the previous approach, the insert speed is 5 times faster, search speed improves by 23%, and storage usage is reduced by 73%, based on ann-benchmark tests with random-xs-20-euclidean and random-s-100-euclidean datasets. Additionally, in previous code, vector objects were not released after use, resulting in excessive memory consumption (over 20GB for building the index with 90,000 records), preventing tests with large datasets. Now ensure that vectors are released appropriately during the insert and search functions. Note there are still some vectors that need to be cleaned up after search query completion. Needs to be addressed in a future commit. All new code of the whole pull request, including one or several files that are either new files or modified ones, are contributed under the BSD-new license. I am contributing on behalf of my employer Amazon Web Services, Inc. As well as the commit: Introduce session variables to manage HNSW index parameters Three variables: hnsw_max_connection_per_layer hnsw_ef_constructor hnsw_ef_search ann-benchmark tool is also updated to support these variables in commit https://github.com/HugoWenTD/ann-benchmarks/commit/e09784e for branch https://github.com/HugoWenTD/ann-benchmarks/tree/mariadb-configurable All new code of the whole pull request, including one or several files that are either new files or modified ones, are contributed under the BSD-new license. I am contributing on behalf of my employer Amazon Web Services, Inc. Co-authored-by: Hugo Wen <wenhug@amazon.com>	2024-11-05 14:00:48 -08:00
Vicențiu Ciorbaru	26e5654301	cleanup: simplify Queue<>, add const also add const to methods in List<> and Hash_set<> while we're at it	2024-11-05 14:00:48 -08:00
Sergei Golubchik	553815ea24	cleanup: C++11 range-based for loop for Hash_set<>	2024-11-05 14:00:48 -08:00
Sergei Golubchik	d6add9a03d	initial support for vector indexes MDEV-33407 Parser support for vector indexes The syntax is create table t1 (... vector index (v) ...); limitation: * v is a binary string and NOT NULL * only one vector index per table * temporary tables are not supported MDEV-33404 Engine-independent indexes: subtable method added support for so-called "high level indexes", they are not visible to the storage engine, implemented on the sql level. For every such an index in a table, say, t1, the server implicitly creates a second table named, like, t1#i#05 (where "05" is the index number in t1). This table has a fixed structure, no frm, not accessible directly, doesn't go into the table cache, needs no MDLs. MDEV-33406 basic optimizer support for k-NN searches for a query like SELECT ... ORDER BY func() optimizer will use item_func->part_of_sortkey() to decide what keys can be used to resolve ORDER BY.	2024-11-05 14:00:48 -08:00
Sergei Golubchik	9ccf02a9a7	MDEV-32885 VEC_DISTANCE() function	2024-11-05 14:00:48 -08:00
Sergei Golubchik	08a7f18b19	cleanup: init_tmp_table_share(bool thread_specific) let the caller tell init_tmp_table_share() whether the table should be thread_specific or not. In particular, internal tmp tables created in the slave thread are perfectly thread specific	2024-11-05 14:00:48 -08:00
Sergei Golubchik	44c6328cbb	cleanup: thd->alloc<>() and thd->calloc<>() create templates thd->alloc<X>(n) to use instead of (X)thd->alloc(sizeof(X)n) and the same for thd->calloc(). By the default the type is char, so old usage of thd->alloc(size) works too.	2024-11-05 14:00:48 -08:00
Sergei Golubchik	eff16d7593	Revert "MDEV-15458 Segfault in heap_scan() upon UPDATE after ADD SYSTEM VERSIONING" This partially reverts `43623f04a9` Engines have to set ::position() after ::write_row(), otherwise the server won't be able to refer to the row just inserted. This is important for high-level indexes. heap part isn't reverted, so heap doesn't support high-level indexes. to fix this, it'll need info->lastpos in addition to info->current_ptr	2024-11-05 14:00:48 -08:00
Sergei Golubchik	07ec1a9e37	cleanup: unused function argument	2024-11-05 14:00:48 -08:00
Sergei Golubchik	aa09cb3b11	open frm for DROP TABLE needed to get partitioning and information about secondary objects	2024-11-05 14:00:48 -08:00
Sergei Golubchik	c1b4f3a32c	cleanup: extract ha_create_table_from_share()	2024-11-05 14:00:48 -08:00
Sergei Golubchik	1fe8a1bb76	cleanup: generalize ER_INNODB_NO_FT_TEMP_TABLE	2024-11-05 14:00:48 -08:00
Sergei Golubchik	fd69abe44f	cleanup: generalize ER_SPATIAL_CANT_HAVE_NULL	2024-11-05 14:00:48 -08:00
Sergei Golubchik	356baeea4b	cleanup: make_long_hash_field_name() and add_hash_field()	2024-11-05 14:00:47 -08:00
Sergei Golubchik	062f8eb37d	cleanup: key algorithm vs key flags the information about index algorithm was stored in two places inconsistently split between both. BTREE index could have key->algorithm == HA_KEY_ALG_BTREE, if the user explicitly specified USING BTREE or HA_KEY_ALG_UNDEF, if not. RTREE index had key->algorithm == HA_KEY_ALG_RTREE and always had key->flags & HA_SPATIAL FULLTEXT index had key->algorithm == HA_KEY_ALG_FULLTEXT and always had key->flags & HA_FULLTEXT HASH index had key->algorithm == HA_KEY_ALG_HASH or HA_KEY_ALG_UNDEF long unique index always had key->algorithm == HA_KEY_ALG_LONG_HASH In this commit: All indexes except BTREE and HASH always have key->algorithm set, HA_SPATIAL and HA_FULLTEXT flags are not used anymore (except for storage to keep frms backward compatible). As a side effect ALTER TABLE now detects FULLTEXT index renames correctly	2024-11-05 14:00:47 -08:00
Sergei Golubchik	f5e9c4e9ef	cleanup: Queue and Bounded_queue Bounded_queue<> pretended to be a typesafe C++ wrapper on top of pure C queues.h. But it wasn't, it was tightly bounded to filesort and only useful there. * implement Queue<> - a typesafe C++ wrapper on top of QUEUE * move Bounded_queue to filesort.cc, remove pointless "generalizations" change it to use Queue. * remove bounded_queue.h * change subselect_rowid_merge_engine to use Queue, not QUEUE	2024-11-05 14:00:47 -08:00
Sergei Golubchik	ae59127158	cleanup: lex_string_set3()	2024-11-05 14:00:47 -08:00
Sergei Golubchik	32e6f8ff2e	cleanup: remove unconditional #ifdef's	2024-11-05 14:00:47 -08:00
Sergei Golubchik	570daecf49	cleanup: const in List::push_front()	2024-11-05 14:00:47 -08:00
Sergei Golubchik	44ff2f7831	reject invalid spatial key declarations in the parser	2024-11-05 14:00:47 -08:00
Sergei Golubchik	0cc01bde45	cleanup: pass TABLE_SHARE to store_key_options() preparation for indexes that can be in TABLE_SHARE but not in TABLE	2024-11-05 14:00:47 -08:00
Sergei Golubchik	949fed514a	cleanup: get_float convenience helper more helpers like that can be added as needed	2024-11-05 14:00:47 -08:00
Sergei Golubchik	115d3e050c	cleanup: engine_option_value::Value::find_in_list() helper	2024-11-05 14:00:47 -08:00
Sergei Golubchik	d046aca0c7	cleanup: CREATE_TYPELIB_FOR() helper	2024-11-05 14:00:47 -08:00
Sergei Golubchik	9fa31c1bd9	cleanup: spaces, casts, comments	2024-11-05 14:00:47 -08:00
Sergei Golubchik	9f2adffcca	memroot improvement: fix savepoint support savepoint support was added a year ago as get_last_memroot_block() and free_all_new_blocks(), but it didn't work and was disabled. * fix it to work * instead of freeing the memory, only mark blocks free - this feature is supposed to be used inside a loop (otherwise there is no need to free anything, end of statement will do it anyway). And freeing blocks inside a loop is a bad idea, as they'll be all malloc-ed on the next iteration again. So, don't. * fix a bug in mark_blocks_free() - it doesn't change the number of blocks	2024-11-05 14:00:47 -08:00
Sergei Golubchik	4f4c5a2ba9	fix a typo and an old bug in prefschema.transaction test	2024-11-05 14:00:47 -08:00
Sergei Golubchik	9ddac64188	make INFORMATION_SCHEMA.STATISTICS.COMMENT not nullable as it can never be null (only "" or "disabled")	2024-11-05 14:00:46 -08:00
Oleg Smirnov	a914087fab	MDEV-35307 Unexpected error WARN_SORTING_ON_TRUNCATED_LENGTH or assertion failure in diagnostics area #2 When strict mode is enabled, all warnings during `INSERT` are converted to errors regardless of their actual severity. `WARN_SORTING_ON_TRUNCATED_LENGTH` is not considered severe enough to be elevated to the ERROR level, and this commit fixes that	2024-11-05 14:52:20 +07:00
Alexander Barkov	a4cb03ec56	Adding a comment near the keyword NOCOPY	2024-11-05 09:14:06 +04:00
Monty	40810baffe	MDEV-33144 Implement the Percona variable slow_query_log_always_write_time This task is inspired by the Percona implementation of slow_query_log_always_write_time. This task implements the variable log_slow_always_query_time (name matching other MariaDB variables using the slow query log). The default value for the variable is 31536000, which makes MariaDB compatible with older installations. For queries with execution time longer than log_slow_always_query_time the variables log_slow_rate_limit and log_slow_min_examined_row_limit will be ignored and the query will be written to the slow query log if there is no other limitations (like log_slow_filter etc). Other things: - long_query_time internal variable renamed to log_slow_query_time. - More descriptive information for "log_slow_query_time".	2024-11-01 08:58:37 +01:00
Oleg Smirnov	bf9662f6fa	MDEV-35275 Unexpected WARN_SORTING_ON_TRUNCATED_LENGTH or assertion failure in diagnostics area MDEV-27277 added warnings on truncation during sorting for SELECTs but did not for DML operations. However, UPDATEs and DELETEs may also perform sorting and thus produce warnings. This commit fixes that	2024-10-30 18:47:11 +07:00
Alexander Barkov	556a40dce0	MDEV-35229 NOCOPY has become reserved word bringing wide incompatibility This patch was suggested by Sergei Golubchik. It reverts the second patch from the PR: commit `fa5eeb4931` Fixed ALTER TABLE NOCOPY keyword failure and adds NOCOPY_SYM into keyword_func_sp_var_and_label. The price is one extra shift/recuce conflict in yy_oracle.yy. This should to tolerable.	2024-10-30 13:58:20 +04:00
Alexander Barkov	8c0a260a5b	MDEV-35250 Assertion `dec <= 6' failed in my_timestamp_binary_length The TIMESTAMP related code did not handle AUTO_SEC_PART_DIGITS. FROM_UNIXTIME() sets its member 'decimals' to AUTO_SEC_PART_DIGITS. So some scripts involving FROM_UNIXTIME() crashed on assert in debug builds and returned unexpected results in release builds.	2024-10-30 11:09:21 +04:00
Aleksey Midenkov	cc183489da	MDEV-27293 Allow converting a versioned table from implicit to explicit row_start/row_end columns In case of adding both system fields of same type (length, unsigned flag) as old implicit system fields do the rename of implicit system fields to the ones specified in ALTER, remove SYSTEM_INVISIBLE flag in that case. Correct PERIOD clause must be specified in ALTER as well. MDEV-34904 Inplace alter for implicit to explicit versioning is broken Whether ALTER goes inplace and how it goes inplace depends on handler_flags which goes from alter_info->flags by this logic: ha_alter_info->handler_flags\|= (alter_info->flags & ~flags_to_remove); ALTER_VERS_EXPLICIT was not in flags_to_remove and its value (1ULL << 35) clashed with ALTER_ADD_NON_UNIQUE_NON_PRIM_INDEX. ALTER_VERS_EXPLICIT must not affect inplace, it is SQL-only so we remove it from handler_flags.	2024-10-29 17:46:40 +03:00
Teemu Ollakka	47dd617c7f	MDEV-35265 wsrep.wsrep-recover, wsrep.wsrep-recover-v25 fail on assertion The tests fail on assertion ut_ad(!wsrep_is_wsrep_xid(&trx->xid)); in `innobase_recover_rollback_by_xid()`. The fix is to avoid async rollback for prepared transactions when wsrep is ON or wsrep recovery is in progress. The rationale is that the rollback of prepared transactions must complete before the node starts applying write sets after SST, or in case of wsrep recovery, the recovery must complete before the process exists. Change the assertion into stronger one ut_ad(!(WSREP_ON \|\| wsrep_recovery)); to catch if the async rollback codepath is taken when wsrep is enabled.	2024-10-29 12:15:53 +02:00
Yuchen Pei	4b6922a315	MDEV-25008: UPDATE/DELETE: Cost-based choice IN->EXISTS vs Materialization Single-table UPDATE/DELETE didn't provide outer_lookup_keys value for subqueries. This didn't allow to make a meaningful choice between IN->EXISTS and Materialization strategies for subqueries. Fix this: * Make UPDATE/DELETE save Sql_cmd_dml::scanned_rows, * Then, subquery's JOIN::choose_subquery_plan() can fetch it from there for outer_lookup_keys Details: UPDATE/DELETE now calls select_lex->optimize_unflattened_subqueries() twice, like SELECT does (first call optimize_constant_subquries() in JOIN::optimize_inner(), then call optimize_unflattened_subqueries() in JOIN::optimize_stage2()): 1. Call with const_only=true before any optimizations. This allows range optimizer and others to use the values of cheap const subqueries. 2. Call it with const_only=false after range optimizer, partition pruning, etc. outer_lookup_keys value is provided, so it's possible to pick a good subquery strategy. Note: PROTECT_STATEMENT_MEMROOT requires that first SP execution performs subquery optimization for all subqueries, even for degenerate query plans like "Impossible WHERE". Due to that, we ensure that the call to optimize_unflattened_subqueries (with const_only=false) even for degenerate query plans still happens, as was the case before this change.	2024-10-23 23:51:24 +11:00
Oleg Smirnov	fd87e01f38	MDEV-27277 Add a warning when max_sort_length is reached During a query execution some sorting and grouping operations on strings may be involved. System variable max_sort_length defines the maximum number of bytes to use when comparing strings during sorting/grouping. Thus, the comparable parts of strings may be less than their actual size, so the results of the query may be not sorted/grouped properly. To indicate that some comparisons were done on a truncated lengths, a new warning has been introduced with this commit.	2024-10-22 22:39:36 +07:00
Alexander Barkov	0d17c540a5	MDEV-27277 Add a warning when max_sort_length is reached Step#1: fixing the return type of strnxfrm() from size_t to this structure: typedef struct { size_t m_output_length; size_t m_source_length_used; uint m_warnings; } my_strnxfrm_ret_t;	2024-10-22 21:42:53 +07:00
Alexander Barkov	e1cd3c4033	MDEV-12252 ROW data type for stored function return values Adding support for the ROW data type in the stored function RETURNS clause: - explicit ROW(..members...) for both sql_mode=DEFAULT and sql_mode=ORACLE CREATE FUNCTION f1() RETURNS ROW(a INT, b VARCHAR(32)) ... - anchored "ROW TYPE OF [db1.]table1" declarations for sql_mode=DEFAULT CREATE FUNCTION f1() RETURNS ROW TYPE OF test.t1 ... - anchored "[db1.]table1%ROWTYPE" declarations for sql_mode=ORACLE CREATE FUNCTION f1() RETURN test.t1%ROWTYPE ... Adding support for anchored scalar data types in RETURNS clause: - "TYPE OF [db1.]table1.column1" for sql_mode=DEFAULT CREATE FUNCTION f1() RETURNS TYPE OF test.t1.column1; - "[db1.]table1.column1" for sql_mode=ORACLE CREATE FUNCTION f1() RETURN test.t1.column1%TYPE; Details: - Adding a new sql_mode_t parameter to sp_head::create() sp_head::sp_head() sp_package::create() sp_package::sp_package() to guarantee early initialization of sp_head::m_sql_mode. Before this change, this member was not initialized at all during CREATE FUNCTION/PROCEDURE/PACKAGE statements, and was not used. Now it needs to be initialized to write properly the mysql.proc.returns column, according to the create time sql_mode. - Code refactoring to make the things simpler and functions smaller: * Adding a new method Field_row::row_create_fields(THD thd, List<Spvar_definition> list) to make a Virtual_tmp_table with Fields for ROW members from an explicit definition. * Adding a new method Field_row::row_create_fields(THD thd, const Spvar_definition &def) to make a Virtual_tmp_table with Fields for ROW members from an explicit or a table anchored definition. Adding a new method Item_args::add_array_of_item_field(THD thd, const Virtual_tmp_table &vtable) to create and array of Item_field corresponding to all Field instances in a Virtual_tmp_table Removing Item_field_row::row_create_items(). It was decomposed into the new methods described above. * Moving the code from the loop body in sp_rcontext::init_var_items() into a separate method Spvar_definition::make_item_field_row(), to make the code clearer (smaller functions). make_item_field_row() itself uses the new methods described above. - Changing the data type of sp_head::m_return_field_def from Column_definition to Spvar_definition. So now it supports not only SQL column field types, but also explicit ROW and anchored ROW data types, as well as anchored column types. - Adding a new Column_definition parameter to sp_head::create_result_field(). Before this patch, create_result_field() took the definition only from m_return_field_def. Now it's also called with a local Column_definition variable which contains the explicit definition resolved from an anchored defition. - Modifying sql_yacc.yy to support the new grammar. Adding new helper methods: * sf_return_fill_definition_row() * sf_return_fill_definition_rowtype_of() * sf_return_fill_definition_type_of() - Fixing tests in: * Virtual_tmp_table::setup_field_pointers() in sql_select.cc * Send_field::normalize() in field.h * store_column_type() to prevent calling Type_handler_row::field_type(), which is implemented a DBUG_ASSERT(0). Before this patch the affected methods and functions were called only for scalar data types. Now ROW is also possible. - Adding a new virtual method Field::cols() - Overriding methods: Item_func_sp::cols() Item_func_sp::element_index() Item_func_sp::check_cols() Item_func_sp::bring_value() to support the ROW data type. - Extending the rule sp_return_type to support * explicit ROW and anchored ROW data types * anchored scalar data types - Overriding Field_row::sql_type() to print the data type of an explicit ROW.	2024-10-21 07:59:29 +04:00
Alexander Barkov	dfaf7e2eb4	MDEV-15751 CURRENT_TIMESTAMP should return a TIMESTAMP [WITH TIME ZONE?] Changing the return type of the following functions: - CURRENT_TIMESTAMP, CURRENT_TIMESTAMP(), NOW() - SYSDATE() - FROM_UNIXTIME() from DATETIME to TIMESTAMP. Note, the old function NOW() returning DATETIME is still available as LOCALTIMESTAMP or LOCALTIMESTAMP(), e.g.: SELECT LOCALTIMESTAMP, -- DATETIME CURRENT_TIMESTAMP; -- TIMESTAMP The change in the functions return data type fixes some problems that occurred near a DST change: - Problem #1 INSERT INTO t1 (timestamp_field) VALUES (CURRENT_TIMESTAMP); INSERT INTO t1 (timestamp_field) VALUES (COALESCE(CURRENT_TIMESTAMP)); could result into two different values inserted. - Problem #2 INSERT INTO t1 (timestamp_field) VALUES (FROM_UNIXTIME(1288477526)); INSERT INTO t1 (timestamp_field) VALUES (FROM_UNIXTIME(1288477526+3600)); could result into two equal TIMESTAMP values near a DST change. Additional changes: - FROM_UNIXTIME(0) now returns SQL NULL instead of '1970-01-01 00:00:00' (assuming time_zone='+00:00') - UNIX_TIMESTAMP('1970-01-01 00:00:00') now returns SQL NULL instead of 0 (assuming time_zone='+00:00' These additional changes are needed for consistency with TIMESTAMP fields, which cannot store '1970-01-01 00:00:00 +00:00'	2024-10-19 22:48:23 +02:00
Brandon Nesterenko	a812dba6dc	MDEV-20153: Slave error message incorrectly mentions server_uuid Correct an error message thrown by the slave to remove the descriptor "slave_uuid", as it was never ported to MariaDB.	2024-10-17 15:53:00 -06:00
Brandon Nesterenko	7a7c338a0b	MDEV-34930: MDEV-32014 Galera and SST/no binlog fixes 1. Binlog commit by rotate (MDEV-32014) should not be used with Galera, yet while WSREP binlog emulation is active, the code path could lead into binlog_cache_data::write_prepare() in an invalid state, leading to errors in MTR. To fix, an extra check is added to ensure the binlog is actually active before calling write_prepare(). 2. If the #binlog_cache_files directory exists on a mariadbd run without opt_log_bin, the directory was treated as a table/database, leading to errors. To fix, on startup, if opt_log_bin is disabled and #binlog_cache_files exists (in the default log directory), the directory is deleted (and an informational message is provided in the error log) Reviewed By: ============ Andrei Elkin <andrei.elkin@mariadb.com>	2024-10-17 07:53:59 -06:00
Brandon Nesterenko	3ebe317f9b	MDEV-32014: Reduce min val of large_commit_threshold for debug builds To help in the testing of MDEV-32014, allow debug_builds to set a lower value for binlog_large_commit_threshold	2024-10-17 07:53:59 -06:00
Libing Song	72cc58bb71	MDEV-32014 Rename binlog cache temporary file to binlog file for large transaction Description =========== When a transaction commits, it copies the binlog events from binlog cache to binlog file. Very large transactions (eg. gigabytes) can stall other transactions for a long time because the data is copied while holding LOCK_log, which blocks other commits from binlogging. The solution in this patch is to rename the binlog cache file to a binlog file instead of copy, if the commiting transaction has large binlog cache. Rename is a very fast operation, it doesn't block other transactions a long time. Design ====== * binlog_large_commit_threshold type: ulonglong scope: global dynamic: yes default: 128MB Only the binlog cache temporary files large than 128MB are renamed to binlog file. * #binlog_cache_files directory To support rename, all binlog cache temporary files are managed as normal files now. `#binlog_cache_files` directory is in the same directory with binlog files. It is created at server startup if it doesn't exist. Otherwise, all files in the directory is deleted at startup. The temporary files are named with ML_ prefix and the memorary address of the binlog_cache_data object which guarantees it is unique. * Reserve space To supprot rename feature, It must reserve enough space at the begin of the binlog cache file. The space is required for Format description, Gtid list, checkpoint and Gtid events when renaming it to a binlog file. Since binlog_cache_data's cache_log is directly accessed by binlog log, online alter and wsrep. It is not easy to update all the code. Thus binlog cache will not reserve space if it is not session binlog cache or wsrep session is enabled. - m_file_reserved_bytes Stores the bytes reserved at the begin of the cache file. It is initialized in write_prepare() and cleared by reset(). The reserved file header is hide to callers. Thus there is no change for callers. E.g. - get_byte_position() still get the length of binlog data written to the cache, but not the file length. - truncate(0) will truncate the file to m_file_reserved_bytes but not 0. - write_prepare() write_prepare() is called everytime when anything is being written into the cache. It will call init_file_reserved_bytes() to create the cache file (if it doesn't exist) and reserve suitable space if the data written exceeds buffer's size. * Binlog_commit_by_rotate It is used to encapsulate the code for remaing a binlog cache tempoary file to binlog file. - should_commit_by_rotate() it is called by write_transaction_to_binlog_events() to check if a binlog cache should be rename to a binlog file. - commit() That is the entry to rename a binlog cache and commit the transaction. Both rename and commit are protected by LOCK_log, Thus not other transactions can write anything into the renamed binlog before it. Rename happens in a rotation. After the new binlog file is generated, replace_binlog_file() is called to: - copy data from the new binlog file to its binlog cache file. - write gtid event. - rename the binlog cache file to binlog file. After that the rotation will continue to succeed. Then the transaction is committed in a seperated group itself. Its cache file will be detached and cache log will be reset before calling trx_group_commit_with_engines(). Thus only Xid event be written.	2024-10-17 07:53:59 -06:00
Yuchen Pei	35cebfdc51	MDEV-15696 Implement SHOW CREATE SERVER One change is that if the port is not supplied or out of bound, the old behaviour is to print 3306. The new behaviour is to not print it (if not supplied) or the out of bound value.	2024-10-15 10:50:23 +11:00
Yuchen Pei	d2eba35653	MDEV-34716 Allow arbitrary options in CREATE SERVER The existing syntax for CREATE SERVER CREATE [OR REPLACE] SERVER [IF NOT EXISTS] server_name FOREIGN DATA WRAPPER wrapper_name OPTIONS (option [, option] ...) option: { HOST character-literal \| DATABASE character-literal \| USER character-literal \| PASSWORD character-literal \| SOCKET character-literal \| OWNER character-literal \| PORT numeric-literal } With this change we have: option: { HOST character-literal \| DATABASE character-literal \| USER character-literal \| PASSWORD character-literal \| SOCKET character-literal \| OWNER character-literal \| PORT numeric-literal \| PORT quoted-numerical-literal \| identifier character-literal} We store these options as a JSON field in the mysql.servers system table. We retain the restriction that PORT needs to be a number, but also allow it to be a quoted number, so that SHOW CREATE SERVER can be used for dumping. Without an accompanied implementation of SHOW CREATE SERVER, some mysqldump tests will fail. Therefore this commit should be immediately followed by the one implementating SHOW CREATE SERVER, with testing covering both.	2024-10-15 10:50:22 +11:00
Rex	9315452ea0	MDEV-34941 MDEV-31466-fix column count issue with union in derived table In specifying a derived table with a union, for example CREATE TABLE t (c1 INT KEY,c2 INT,c3 INT) ENGINE=MyISAM; SELECT * FROM (SELECT * FROM t UNION SELECT * FROM t) AS d (d1,d2); we bypass an earlier check for the correct number of specified column names, causing a crash. Fixed by adding a check for the correct number of supplied arguments in st_select_lex_unit::rename_types_list()	2024-10-15 06:50:19 +12:00
Rex	e90aab7acc	MDEV-34931 MDEV-31466 name resolution fails in --view Fix for MDEV-31466 - add optional derived table column names. Column names within a SELECT_LEX structure can be left in a non-reparsable state (as printed out from *::print) after JOIN::prepare. This caused an incorrect view definition to be written into the .FRM file. Fixed by resetting item list names in SELECT_LEX structures representing derived tables before writing out the view definition. Reviewed by Igor Babaev (igor@mariadb.com)	2024-10-15 06:08:46 +12:00
Rex	10008b3d3e	MDEV-31466 Add optional correlation column list for derived tables Extend derived table syntax to support column name assignment. (subquery expression) [as\|=] ident [comma separated column name list]. Prior to this patch, the optional comma separated column name list is not supported. Processing within the unit of the subquery expression will use original column names, outside the unit will use the new names. For example, in the query select a1, a2 from (select c1, c2, c3 from t1 where c2 > 0) as dt (a1, a2, a3) where a2 > 10; we see the second column of the derived table dt being used both within, (where c2 > 0), and outside, (where a2 > 10), the specification. Both conditions apply to t1.c2. When multiple unit preparations are required, such as when being used within a prepared statement or procedure, original column names are needed for correct resolution. Original names are reset within mysql_derived_reinit(). Item_holder items, used for result tables in both TVC and union preparations are renamed before use within st_select_lex_unit::prepare(). During wildcard expansion, if column names are present, items names are set directly after creation. Reviewed by Igor Babaev (igor@mariadb.com)	2024-10-15 06:08:46 +12:00
Christian Gonzalez	fd0cc2b1fd	Make SESSION_USER() comparable with CURRENT_USER() Update `SESSION_USER()` behaviour to be comparable with `CURRENT_USER()`. `SESSION_USER()` will return the user and host columns from `mysql.user` used to authenticate the user when the session was created. Historically `SESSION_USER()` was an alias of `USER()` function. The main difference with `USER()` behaviour after this changes is that `SESSION_USER()` now returns the host column from `mysql.user` instead of the client host or ip. NOTE: `SESSION_USER_IS_USER` old mode is added to make the change backward compatible. All new code of the whole pull request, including one or several files that are either new files or modified ones, are contributed under the BSD-new license. I am contributing on behalf of my employer Amazon Web Services, Inc.	2024-10-04 13:22:40 +02:00
sts-kokseng.wong	383d1f90dd	The revision corresponds to the review comments. 1. Move the unit tests into the compat/oracle suite, sp-param.test file. 2. Remove the added unit test file and result file. 3. Add type, Alter_info::enum_alter_table_algorithm, into the union. 4. Remove the extra switch case	2024-10-04 00:17:37 +02:00
sts-kokseng.wong	fa5eeb4931	Fixed ALTER TABLE NOCOPY keyword failure	2024-10-04 00:17:37 +02:00
sts-kokseng.wong	43825af101	MDEV-34316 sql_mode=ORACLE: Ignore the NOCOPY keyword in stored routine parameters During sql_mode=ORACLE, ignore the NOCOPY keyword in stored routine parameters. The optimization (pass-by-reference instead of pass-by-value) helping to avoid value copying will be done in a separate task when needed.	2024-10-04 00:17:37 +02:00
Marko Mäkelä	f493e46494	Merge 11.6 into 11.7	2024-10-03 18:15:13 +03:00
Marko Mäkelä	43465352b9	Merge 11.4 into 11.6	2024-10-03 16:09:56 +03:00
Marko Mäkelä	b53b81e937	Merge 11.2 into 11.4	2024-10-03 14:32:14 +03:00
Marko Mäkelä	12a91b57e2	Merge 10.11 into 11.2	2024-10-03 13:24:43 +03:00
Marko Mäkelä	63913ce5af	Merge 10.6 into 10.11	2024-10-03 10:55:08 +03:00
Marko Mäkelä	7e0afb1c73	Merge 10.5 into 10.6	2024-10-03 09:31:39 +03:00
Yuchen Pei	ba7088d462	Merge '11.4' into 11.6	2024-10-03 15:59:20 +10:00
Sergei Petrunia	1cda4726ca	MDEV-34993, part2: backport optimizer_adjust_secondary_key_costs ...and make the fix for MDEV-34993 switchable. It is enabled by default and controlled with @optimizer_adjust_secondary_key_costs=fix_card_multiplier	2024-10-02 10:52:09 +03:00
Sergei Petrunia	8166a5d33d	MDEV-34993: Incorrect cardinality estimation causes poor query plan When calculate_cond_selectivity_for_table() takes into account multi- column selectivities from range access, it tries to take-into account that selectivity for some columns may have been already taken into account. For example, for range access on IDX1 using {kp1, kp2}, the selectivity of restrictions on "kp2" might have already been taken into account to some extent. So, the code tries to "discount" that using rec_per_key[] estimates. This seems to be wrong and unreliable: the "discounting" may produce a rselectivity_multiplier number that hints that the overall selectivity of range access on IDX1 was greater than 1. Do a conservative fix: if we arrive at conclusion that selectivity of range access on condition in IDX1 >1.0, clip it down to 1.	2024-10-02 10:52:09 +03:00
Sergei Golubchik	9021f40b8e	MDEV-35050 Found wrong usage of mutex upon setting plugin session variables	2024-10-01 18:29:11 +02:00
Sergei Golubchik	b1bbdbab9e	cleanup: remove redundant if() likely, a result of auto-merge of two fixes in different versions	2024-10-01 18:29:11 +02:00
Sergei Golubchik	5bf543fd43	MDEV-24193 UBSAN: sql/sql_acl.cc:9985:29: runtime error: member access within null pointer of type 'struct TABLE' , ASAN: use-after-poison in handle_grant_table privilege tables do not always have to exist	2024-10-01 18:29:11 +02:00
Sergei Golubchik	2cdcfb644c	MDEV-26314 ST_EQUALS listed twice in information_schema.SQL_FUNCTIONS	2024-10-01 18:29:11 +02:00
Oleksandr Byelkin	8d810e9426	MDEV-29537 Creation of view with UNION and SELECT ... FOR UPDATE in definition is failed with error lock_type is writen in the last SELECT of the unit even if it parsed last, so it should be printed last from the last select of the unit.	2024-10-01 11:07:45 +02:00
ParadoxV5	d8d80bd503	Fix a couple of `my_snprintf` arg mismatches This commit backports two fixes from #3360’s vast discovery.	2024-10-01 09:53:13 +01:00
Rucha Deodhar	753e7d6d7c	MDEV-27412: JSON_TABLE doesn't properly unquote strings Analysis: The value gets appended as string instead of unescaped json value Fix: Append the value of json in a temporary string and then store it in the field instead of directly storing as string.	2024-10-01 13:45:46 +05:30
Thirunarayanan Balathandayuthapani	cc810e64d4	MDEV-34392 Inplace algorithm violates the foreign key constraint Don't allow the referencing key column from NULL TO NOT NULL when 1) Foreign key constraint type is ON UPDATE SET NULL 2) Foreign key constraint type is ON DELETE SET NULL 3) Foreign key constraint type is UPDATE CASCADE and referenced column declared as NULL Don't allow the referenced key column from NOT NULL to NULL when foreign key constraint type is UPDATE CASCADE and referencing key columns doesn't allow NULL values get_foreign_key_info(): InnoDB sends the information about nullability of the foreign key fields and referenced key fields. fk_check_column_changes(): Enforce the above rules for COPY algorithm innobase_check_foreign_drop_col(): Checks whether the dropped column exists in existing foreign key relation innobase_check_foreign_low() : Enforce the above rules for INPLACE algorithm dict_foreign_t::check_fk_constraint_valid(): This is used by CREATE TABLE statement to check nullability for foreign key relation.	2024-10-01 09:41:56 +05:30
Meng-Hsiu Chiang	21b20712a3	Replace using of internal fmt lib API with public API The commit `cd5808eb` introduced a union as a storage for the format argument passed to the internal API fmt::detail::make_arg. This was done to solve the issue that the internal API no longer accepted temporary variables. However, it's generally better to avoid using internal APIs, as they are more likely to have breaking changes in the future. Instead, we can use the public API fmt::dynamic_format_arg_store to dynamically build the argument list. This API accepts temporary variables, and its behavior is more stable than the internal API. `libfmt.cmake` is updated to reflect the change as well. All new code of the whole pull request, including one or several files that are either new files or modified ones, are contributed under the BSD-new license. I am contributing on behalf of my employer Amazon Web Services, Inc.	2024-09-30 16:25:13 +01:00
Max Kellermann	45298b730b	sql/handler: referenced_by_foreign_key() returns bool The method was declared to return an unsigned integer, but it is really a boolean (and used as such by all callers). A secondary change is the addition of "const" and "noexcept" to this method. In ha_mroonga.cpp, I also added "inline" to the two helper methods of referenced_by_foreign_key(). This allows the compiler to flatten the method.	2024-09-30 16:33:25 +03:00
Oleksandr Byelkin	da322f19c6	MDEV-26459 Assertion `block_size <= 0xFFFFFFFFL' failed in calculate_block_sizes for 10.7 only Limit default allocation block in tree of Unique class	2024-09-30 15:18:00 +02:00
Oleksandr Byelkin	20f57a8529	MDEV-33373 part 1: Unexpected ER_FILE_NOT_FOUND upon reading from logging table after crash recovery We have found that my_errno can be "passed" to the next commad in some cases. It is practically impossible to check/fix all cases of my_errno in the server, plugins and engines so we will reset it as we reset other errors. The test case will be fixed by CSV engine fix so will be added with it (see part2).	2024-09-30 12:53:07 +02:00
Yuchen Pei	b7bca3ff71	MDEV-34518 Initialise THD::max_tmp_space_used	2024-09-30 16:33:15 +10:00
sjaakola	cf0c3ec274	MDEV-30307 KILL command inside a transaction causes problem for galera replication Added new test scenario in galera.galera_bf_kill test to make the issue surface. The tetst scenario has a multi statement transaction containing a KILL command. When the KILL is submitted, another transaction is replicated, which causes BF abort for the KILL command processing. Handling BF abort rollback while executing KILL command causes node hanging, in this scenario. sql_kill() and sql_kill_user() functions have now fix, to perform implicit commit before starting the KILL command execution. BEcause of the implicit commit, the KILL execution will not happen inside transaction context anymore. Signed-off-by: Julius Goryavsky <julius.goryavsky@mariadb.com>	2024-09-27 19:26:26 +02:00
Kristian Nielsen	35c732cdde	MDEV-25611: RESET MASTER causes the server to hang RESET MASTER waits for storage engines to reply to a binlog checkpoint requests. If this response is delayed for a long time for some reason, then RESET MASTER can hang. Fix this by forcing a log sync in all engines just before waiting for the checkpoint reply. (Waiting for old checkpoint responses is needed to preserve durability of any commits that were synced to disk in the to-be-deleted binlog but not yet synced in the engine.) Reviewed-by: Andrei Elkin <andrei.elkin@mariadb.com> Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>	2024-09-27 15:10:06 +02:00
Sergei Petrunia	ce272f7c29	Remove unused Table_function_json_table::m_text_literal_cs	2024-09-27 13:52:17 +03:00
Sergei Petrunia	d0a6a7886b	MDEV-25822 JSON_TABLE: default values should allow non-string literals (Polished initial patch by Alexey Botchkov) Make the code handle DEFAULT values of any datatype - Make Json_table_column::On_response::m_default be Item*, not LEX_STRING. - Change the parser to use string literal non-terminals for producing the DEFAULT value -- Also, stop updating json_table->m_text_literal_cs for the DEFAULT value literals as it is not used.	2024-09-27 13:52:17 +03:00
Denis Protivensky	9f61aa4f8a	MDEV-34822 pre-fix: Make wsrep_ready flag read lock-free It's read for every command execution, and during slave replication for every applied event. It's also planned to be used during write set applying, so it means mostly every server thread is going to compete for the mutex covering this variable, especially considering how rarely it changes. Converting wsrep_ready to atomic relaxes the things. Signed-off-by: Julius Goryavsky <julius.goryavsky@mariadb.com>	2024-09-26 00:04:56 +02:00
Teemu Ollakka	b2429e2025	MDEV-34976 Server crash report broken if Galera is not loaded The crash report terminates prematurely when Galera library was not loaded. As a fix, check whether the provider is loaded before shutting down Galera connections. Signed-off-by: Julius Goryavsky <julius.goryavsky@mariadb.com>	2024-09-26 00:04:56 +02:00
Dave Gosselin	f7c5182b7c	MDEV-31636 Memory leak in Sys_var_gtid_binlog_state::do_check() Move memory allocations performed during Sys_var_gtid_binlog_state::do_check to Sys_var_gtid_binlog_state::global_update where they will be freed before the latter method returns.	2024-09-25 14:17:21 -04:00

1 2 3 4 5 ...

76116 commits