mariadb

mirror of https://github.com/MariaDB/server.git synced 2025-01-27 17:33:44 +01:00

Author	SHA1	Message	Date
Sergei Petrunia	ce4956f322	Code cleanup	2022-01-19 18:14:07 +03:00
Sergei Petrunia	db8f15be93	MDEV-27229: Estimation for filtered rows less precise ... #5 Followup: remove this line from get_column_range_cardinality() set_if_bigger(res, col_stats->get_avg_frequency()); and make sure it is only used with the binary histograms. For JSON histograms, it makes the estimates unnecessarily imprecise.	2022-01-19 18:10:12 +03:00
Sergei Petrunia	531dd708ef	MDEV-27229: Estimation for filtered rows less precise ... #5 Fix special handling for values that are right next to buckets with ndv=1.	2022-01-19 18:10:12 +03:00
Sergei Petrunia	d8d57d2c27	MDEV-26764: JSON_HB Histograms: handle BINARY and unassigned characters Encode such characters in hex.	2022-01-19 18:10:12 +03:00
Sergei Petrunia	c2d2c1e727	MDEV-26519: Improved histograms Save extra information in the histogram: "target_histogram_size": nnn, "collected_at": "(date and time)", "collected_by": "(server version)",	2022-01-19 18:10:12 +03:00
Sergei Petrunia	a0f93f433a	Rename histogram_hb_v2 -> histogram_hb	2022-01-19 18:10:11 +03:00
Sergei Petrunia	1d14176ec4	MDEV-26519: Improved histograms: Make JSON parser efficient Previous JSON parser was using an API which made the parsing inefficient: the same JSON contents was parsed again and again. Switch to using a lower-level parsing API which allows to do parsing in an efficient way.	2022-01-19 18:10:11 +03:00
Sergei Petrunia	eb6a9ad705	MDEV-26886: Estimation for filtered rows less precise with JSON histogram - Make Histogram_json_hb::range_selectivity handle singleton buckets specially when computing selectivity of the max. endpoint bound. (for min. endpoint, we already do that). - Also, fixed comments for Histogram_json_hb::find_bucket	2022-01-19 18:10:11 +03:00
Sergei Petrunia	05877df472	MDEV-26849: JSON Histograms: point selectivity estimates are off .. for non-existent values. Handle this special case.	2022-01-19 18:10:11 +03:00
Sergei Petrunia	d03daaf8a8	Use JSON_NAME, not the "histogram_hb_v2" constant	2022-01-19 18:10:10 +03:00
Sergei Petrunia	28ad128585	Fix off-by-one error in Histogram_json_hb::find_bucket	2022-01-19 18:10:10 +03:00
Sergei Petrunia	382250c05c	Address review input	2022-01-19 18:10:10 +03:00
Sergei Petrunia	49a7bbb1f6	Valgrind fixes, poor .result fixes, code cleanups - Use String::c_ptr_safe() instead of String::c_ptr - Do proper datatype conversions in Histogram_json_hb::parse - Remove Histogram_json_hb::Bucket::end_value. Introduce get_end_value() instead.	2022-01-19 18:10:10 +03:00
Sergei Petrunia	f460272054	MDEV-26519: JSON Histograms: improve histogram collection Basic ideas: 1. Store "popular" values in their own buckets. 2. Also store ndv (Number of Distinct Values) in each bucket. Because of #1, the buckets are now variable-size, so store the size in each bucket. Adjust selectivity estimation functions accordingly.	2022-01-19 18:10:10 +03:00
Sergei Petrunia	716c98b15d	Fix compilation on windows part 2	2022-01-19 18:10:10 +03:00
Sergei Petrunia	1861a2a2cd	Rollback a change from previous commit	2022-01-19 18:10:10 +03:00
Sergei Petrunia	9271bd17f7	More code cleanups Remove Histogram_*::is_available(), it is not applicable anymore. Fix compilation on Windows	2022-01-19 18:10:10 +03:00
Sergei Petrunia	1d98168547	Move JSON histograms code into its own files	2022-01-19 18:10:10 +03:00

18 commits