mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-04-03 13:55:38 +02:00

Author	SHA1	Message	Date
Rucha Deodhar	95a9078efc	MDEV-28071: JSON_EXISTS returns always 1 if it is used range notation for json path Analysis: When searching for the given path in json string, if the current step is of array range type, then path was considered reached which meant path exists. So output was always true. The end indexes of range were not evaluated. Fix: If the current step type for a path is array range, then check if the value array_counter[] is in range of n_item and n_item_end. If it is, then path exists. Only then return true. If the range criteria is never met then return false.	2022-04-15 01:04:25 +05:30
Rucha Deodhar	e6511a39f8	vcol.wrong_arena failing on buildbot when current date is '2022-03-17' Analysis: When current date is '2022-03-17', dayname() gives 'Thursday'. The previous json state is PS_KEYX which means key started with quote. So now json parser for path is supposed to parse the key. The keyname starts with 'T'. But the path transition table has JE_SYN when previous state is PS_KEYX and next letter is 'T'. So it gives error. Fix: We want to continue parsing the quoted keyname. So JE_SYN is incorrect. Replaced it with PS_KNMX.	2022-04-15 01:02:44 +05:30
Rucha Deodhar	c781cefd8a	MDEV-27911: Implement range notation for json path Range can be thought about in similar manner as wildcard (*) where more than one elements are processed. To implement range notation, extended json parser to parse the 'to' keyword and added JSON_PATH_ARRAY_RANGE for path type. If there is 'to' keyword then use JSON_PATH_ARRAY range for path type along with existing type. This new integer to store the end index of range is n_item_end. When there is 'to' keyword, store the integer in n_item_end else store in n_item.	2022-04-15 01:02:44 +05:30
Rucha Deodhar	abe9712194	MDEV-27972: Unexpected behavior with negative zero (-0) in JSON Path Analysis: When we have '-' followed by 0, then the state is changed to JE_SYN, meaning syntax error. Fix: Change the state to PS_INT instead, because we are reading '0' next (integer) and it is not a syntax error.	2022-04-13 21:16:32 +05:30
Rucha Deodhar	dfcbb30a92	MDEV-22224: Support JSON Path negative index This patch can be viewed as combination of two parts: 1) Enabling '-' in the path so that the parser does not give out a warning. 2) Setting the negative index to a correct value and returning the appropriate value. 1) To enable using the negative index in the path: To make the parser not return warning when negative index is used in path '-' needs to be allowed in json path characters. P_NEG is added to enable this and is made recognizable by setting the 45th index of json_path_chr_map[] to P_NEG (instead of previous P_ETC) because 45 corresponds to '-' in unicode. When the path is being parsed and '-' is encountered, the parser should recognize it as parsing '-' sign, so a new json state PS_NEG is required. When the state is PS_NEG, it means that a negative integer is going to be parsed so set is_negative_index of current step to 1 and n_item is set accordingly when integer is encountered after '-'. Next proceed with parsing rest of the path and get the correct path. Next thing is parsing the json and returning correct value. 2) Setting the negative index to a correct value and returning the value: While parsing json if we encounter array and the path step for the array is a negative index (n_item < 0), then we can count the number of elements in the array and set n_item to correct corresponding value. This is done in json_skip_array_and_count.	2022-04-13 21:16:32 +05:30
Marko Mäkelä	8680eedb26	Merge 10.8 into 10.9	2022-03-30 09:41:14 +03:00
Marko Mäkelä	5c69e93630	Merge 10.7 into 10.8	2022-03-30 09:34:07 +03:00
Marko Mäkelä	a4d753758f	Merge 10.6 into 10.7	2022-03-30 08:52:05 +03:00
Marko Mäkelä	b242c3141f	Merge 10.5 into 10.6	2022-03-29 16:16:21 +03:00
Marko Mäkelä	d62b0368ca	Merge 10.4 into 10.5	2022-03-29 12:59:18 +03:00
Marko Mäkelä	ae6e214fd8	Merge 10.3 into 10.4	2022-03-29 11:13:18 +03:00
Marko Mäkelä	020e7d89eb	Merge 10.2 into 10.3	2022-03-29 09:53:15 +03:00
Alexander Barkov	0c4c064f98	MDEV-27743 Remove Lex::charset This patch also fixes: MDEV-27690 Crash on `CHARACTER SET csname COLLATE DEFAULT` in column definition MDEV-27853 Wrong data type on column `COLLATE DEFAULT` and table `COLLATE some_non_default_collation` MDEV-28067 Multiple conflicting column COLLATE clauses are not rejected MDEV-28118 Wrong collation of `CAST(.. AS CHAR COLLATE DEFAULT)` MDEV-28119 Wrong column collation on MODIFY + CONVERT	2022-03-22 17:12:15 +04:00
Alexey Botchkov	6277e7df6b	MDEV-22742 UBSAN: Many overflow issues in strings/decimal.c - runtime error: signed integer overflow: x * y cannot be represented in type 'long long int' (on optimized builds). Avoid integer overflow, do the check before the calculation.	2022-03-21 15:05:42 +04:00
Marko Mäkelä	18bb95b608	Merge 10.7 into 10.8	2022-03-14 11:52:11 +02:00
Marko Mäkelä	e67d46e4a1	Merge 10.6 into 10.7	2022-03-14 11:30:32 +02:00
Marko Mäkelä	572e34304e	Merge 10.5 into 10.6	2022-03-14 10:59:46 +02:00
Marko Mäkelä	59359fb44a	MDEV-24841 Build error with MSAN use-of-uninitialized-value in comp_err The MemorySanitizer implementation in clang includes some built-in instrumentation (interceptors) for GNU libc. In GNU libc 2.33, the interface to the stat() family of functions was changed. Until the MemorySanitizer interceptors are adjusted, any MSAN code builds will act as if that the stat() family of functions failed to initialize the struct stat. A fix was applied in https://reviews.llvm.org/rG4e1a6c07052b466a2a1cd0c3ff150e4e89a6d87a but it fails to cover the 64-bit variants of the calls. For now, let us work around the MemorySanitizer bug by defining and using the macro MSAN_STAT_WORKAROUND().	2022-03-14 09:28:55 +02:00
Sergei Golubchik	a4f0ae7c18	UBSAN: out of bound array read in json json_lib.c:847:25: runtime error: index 200 out of bounds for type 'json_string_char_classes [128]' json_lib.c:847:25: runtime error: load of address 0x56286f7175a0 with insufficient space for an object of type 'json_string_char_classes' fixes main.json_equals and main.json_normalize	2022-02-24 19:18:19 +01:00
Oleksandr Byelkin	4fb2cb1a30	Merge branch '10.7' into 10.8	2022-02-04 14:50:25 +01:00
Oleksandr Byelkin	9ed8deb656	Merge branch '10.6' into 10.7	2022-02-04 14:11:46 +01:00
Oleksandr Byelkin	f5c5f8e41e	Merge branch '10.5' into 10.6	2022-02-03 17:01:31 +01:00
Oleksandr Byelkin	cf63eecef4	Merge branch '10.4' into 10.5	2022-02-01 20:33:04 +01:00
Sergei Golubchik	bc10f58a58	MDEV-24909 JSON functions don't respect KILL QUERY / max_statement_time limit pass the pointer to thd->killed down to the json library, check it while scanning, use thd->check_killed() to generate the proper error message	2022-01-30 12:07:31 +01:00
Oleksandr Byelkin	a576a1cea5	Merge branch '10.3' into 10.4	2022-01-30 09:46:52 +01:00
Oleksandr Byelkin	41a163ac5c	Merge branch '10.2' into 10.3	2022-01-29 15:41:05 +01:00
Alexander Barkov	b915f79e4e	MDEV-25904 New collation functions to compare InnoDB style trimmed NO PAD strings	2022-01-21 12:16:07 +04:00
Alexander Barkov	47463e5796	MDEV-27552 Change the return type of my_uca_context_weight_find() to MY_CONTRACTION*	2022-01-20 15:44:13 +04:00
Sergei Petrunia	ce4956f322	Code cleanup	2022-01-19 18:14:07 +03:00
Sergei Petrunia	3936dc3353	MDEV-26724 Endless loop in json_escape_to_string upon ... empty string Part#3: - make json_escape() return different errors on conversion error and on out-of-space condition. - Make histogram code handle conversion errors.	2022-01-19 18:10:11 +03:00
Sergei Petrunia	dde6d76995	Trivial code cleanup	2022-01-19 18:10:09 +03:00
Sergei Petrunia	72c0ba43b2	Code cleanup part #1	2022-01-19 18:10:09 +03:00
Michael Okoko	fb2edab3eb	Extract json parser functions from class Signed-off-by: Michael Okoko <okokomichaels@outlook.com>	2022-01-19 18:10:08 +03:00
Michael Okoko	6bc2df5fa4	Add parser to read JSON array (of histograms) into string vector Signed-off-by: Michael Okoko <okokomichaels@outlook.com>	2022-01-19 18:10:08 +03:00
Vladislav Vaintroub	47e18af906	MDEV-27494 Rename .ic files to .inl	2022-01-17 16:41:51 +01:00
Marko Mäkelä	4f7574b10c	MDEV-27042 fixup: GCC 11 -Og -Wmaybe-uninitialized	2021-11-29 09:24:58 +02:00
Alexander Barkov	f9ad8072cd	MDEV-27042 UCA: Resetting contractions to ignorable does not work well The weight scanner routine scanner_next() did not properly handle the cases when a contraction produces no weights (is ignorable). Adding a helper routine my_uca_scanner_set_weight() and using it in all cases: - A single ASCII character - A contraction starting with an ASCII character - A multi-byte character - A contraction starting with a multi-byte character Also adding two other helper routines: - my_uca_scanner_next_expansion_weight() - my_uca_scanner_set_weight_outside_maxchar() to avoid using scanner->wbeg directly inside scanner_next(). This reduces the probability of similar future bugs.	2021-11-24 13:45:35 +04:00
Alexander Barkov	0a3d1d106a	Refactoring for MDEV-27042 and MDEV-27009 This patch prepares the code for upcoming changes: MDEV-27009 Add UCA-14.0.0 collations MDEV-27042 UCA: Resetting contractions to ignorable does not work well 1. Adding "const" qualifiers to return type and parameters in functions: - my_uca_contraction2_weight() - my_wmemcmp() - my_uca_contraction_weight() - my_uca_scanner_contraction_find() - my_uca_previous_context_find() - my_uca_context_weight_find() 2. Adding a helper function my_uca_true_contraction_eq() 3. Changing the way how scanner->wbeg is set during context weight handling. It was previously set inside functions: - my_uca_scanner_contraction_find() - my_uca_previous_context_find() Now it's set inside scanner_next(), which makes the code more symmetric for context-free and context-dependent sequences. This makes then upcoming fix for MDEV-27042 simpler.	2021-11-24 13:35:57 +04:00
Marko Mäkelä	06988bdcaa	Merge 10.6 into 10.7	2021-11-09 09:40:29 +02:00
Marko Mäkelä	25ac047baf	Merge 10.5 into 10.6	2021-11-09 09:11:50 +02:00
Marko Mäkelä	9c18b96603	Merge 10.4 into 10.5	2021-11-09 08:50:33 +02:00
Marko Mäkelä	47ab793d71	Merge 10.3 into 10.4	2021-11-09 08:40:14 +02:00
Marko Mäkelä	524b4a89da	Merge 10.2 into 10.3	2021-11-09 08:26:59 +02:00
Alexander Barkov	d0b611a76d	MDEV-24335 Unexpected question mark in the end of a TINYTEXT column my_copy_fix_mb() passed MIN(src_length,dst_length) to my_append_fix_badly_formed_tail(). It could break a multi-byte character in the middle, which put the question mark to the destination. Fixing the code to pass the true src_length to my_append_fix_badly_formed_tail().	2021-11-02 09:00:49 +04:00
Alexander Barkov	059797ed44	MDEV-24901 SIGSEGV in fts_get_table_name, SIGSEGV in ib_vector_size, SIGSEGV in row_merge_fts_doc_tokenize, stack smashing strmake() puts one extra 0x00 byte at the end of the string. The code in my_strnxfrm_tis620[_nopad] did not take this into account, so in the reported scenario the 0x00 byte was put outside of a stack variable, which made ASAN crash. This problem is already fixed in in MySQL: commit 19bd66fe43c41f0bde5f36bc6b455a46693069fb Author: bin.x.su@oracle.com <> Date: Fri Apr 4 11:35:27 2014 +0800 But the fix does not seem to be correct, as it breaks when finds a zero byte in the source string. Using memcpy() instead of strmake(). - Unlike strmake(), memcpy() it does not write beyond the destination size passed. - Unlike the MySQL fix, memcpy() does not break on the first 0x00 byte found in the source string.	2021-10-29 12:37:29 +04:00
Eric Herman	401ff6994d	MDEV-26221: DYNAMIC_ARRAY use size_t for sizes https://jira.mariadb.org/browse/MDEV-26221 my_sys DYNAMIC_ARRAY and DYNAMIC_STRING inconsistancy The DYNAMIC_STRING uses size_t for sizes, but DYNAMIC_ARRAY used uint. This patch adjusts DYNAMIC_ARRAY to use size_t like DYNAMIC_STRING. As the MY_DIR member number_of_files is copied from a DYNAMIC_ARRAY, this is changed to be size_t. As MY_TMPDIR members 'cur' and 'max' are copied from a DYNAMIC_ARRAY, these are also changed to be size_t. The lists of plugins and stored procedures use DYNAMIC_ARRAY, but their APIs assume a size of 'uint'; these are unchanged.	2021-10-19 16:00:26 +03:00
Marko Mäkelä	b36d6f92a8	Merge 10.6 into 10.7	2021-09-30 11:01:07 +03:00
Alexander Barkov	0d68b0a2d6	MDEV-26669 Add MY_COLLATION_HANDLER functions min_str() and max_str()	2021-09-27 17:10:22 +04:00
Alexander Barkov	0629711db4	MDEV-26572 Improve simple multibyte collation performance on the ASCII range	2021-09-13 08:03:25 +04:00
Eric Herman	105e4148bf	Add json_normalize function to json_lib This patch implements a library for normalizing json documents. The algorithm is: * Recursively sort json keys according to utf8mb4_bin collation. * Normalize numbers to be of the form [-]<digit>.<frac>E<exponent> * All unneeded whitespace and line endings are removed. * Arrays are not sorted. Co-authored-by: Vicențiu Ciorbaru <vicentiu@mariadb.org>	2021-07-21 16:32:11 +03:00

1 2 3 4 5 ...

1647 commits