mirror of
https://github.com/MariaDB/server.git
synced 2025-01-20 22:12:30 +01:00
d33b523271
Fixed BUGS: #3300: "UPDATE statement with no index column in where condition locks all rows" Implement semi-consistent read to reduce lock conflicts at the cost of breaking serializability. ha_innobase::unlock_row(): reset the "did semi consistent read" flag ha_innobase::was_semi_consistent_read(), ha_innobase::try_semi_consistent_read(): new methods row_prebuilt_t, row_create_prebuilt(): add field row_read_type for keeping track of semi-consistent reads row_vers_build_for_semi_consistent_read(), row_sel_build_committed_vers_for_mysql(): new functions row_search_for_mysql(): implement semi-consistent reads #9802: "Foreign key checks disallow alter table". Added test cases. #12456: "Cursor shows incorrect data - DML does not affect, probably caching" This patch implements a high-granularity read view to be used with cursors. In this high-granularity consistent read view modifications done by the creating transaction after the cursor is created or future transactions are not visible. But those modifications that transaction did before the cursor was created are visible. #12701: "Support >4GB buffer pool and log files on 64-bit Windows" Do not call os_file_create_tmpfile() at runtime. Instead, create all tempfiles at startup and guard access to them with mutexes. #13778: "If FOREIGN_KEY_CHECKS=0, one can create inconsistent FOREIGN KEYs". When FOREIGN_KEY_CHECKS=0 we still need to check that datatypes between foreign key references are compatible. #14189: "VARBINARY and BINARY variables: trailing space ignored with InnoDB" innobase_init(): Assert that DATA_MYSQL_BINARY_CHARSET_COLL == my_charset_bin.number. dtype_get_pad_char(): Do not pad VARBINARY or BINARY columns. row_ins_cascade_calc_update_vec(): Refuse ON UPDATE CASCADE when trying to change the length of a VARBINARY column that refers to or is referenced by a BINARY column. BINARY columns are no longer padded on comparison, and thus they cannot be padded on storage either. #14747: "Race condition can cause btr_search_drop_page_hash_index() to crash" Note that buf_block_t::index should be protected by btr_search_latch or an s-latch or x-latch on the index page. btr_search_drop_page_hash_index(): Read block->index while holding btr_search_latch and use the cached value in the loop. Remove some redundant assertions. #15108: "mysqld crashes when innodb_log_file_size is set > 4G" #15308: "Problem of Order with Enum Column in Primary Key" #15550: "mysqld crashes in printing a FOREIGN KEY error in InnoDB" row_ins_foreign_report_add_err(): When printing the parent record, use the index in the parent table rather than the index in the child table. #15653: "Slow inserts to InnoDB if many thousands of .ibd files" Keep track on unflushed modifications to file spaces. When there are tens of thousands of file spaces, flushing all files in fil_flush_file_spaces() would be very slow. fil_flush_file_spaces(): Only flush unflushed file spaces. fil_space_t, fil_system_t: Add a list of unflushed spaces. #15991: "innodb-file-per-table + symlink database + rename = cr" os_file_handle_error(): Map the error codes EXDEV, ENOTDIR, and EISDIR to the new code OS_FILE_PATH_ERROR. Treat this code as OS_FILE_PATH_ERROR. This fixes the crash on RENAME TABLE when the .ibd file is a symbolic link to a different file system. #16157: "InnoDB crashes when main location settings are empty" This patch is from Heikki. #16298: "InnoDB segfaults in INSERTs in upgrade of 4.0 -> 5.0 tables with VARCHAR BINARY" dict_load_columns(): Set the charset-collation code DATA_MYSQL_BINARY_CHARSET_COLL for those binary string columns that lack a charset-collation code, i.e., the tables were created with an older version of MySQL/InnoDB than 4.1.2. #16229: "MySQL/InnoDB uses full explicit table locks in trigger processing" Take a InnoDB table lock only if user has explicitly requested a table lock. Added some additional comments to store_lock() and external_lock(). #16387: "InnoDB crash when dropping a foreign key <table>_ibfk_0" Do not mistake TABLENAME_ibfk_0 for auto-generated id. dict_table_get_highest_foreign_id(): Ignore foreign constraint identifiers starting with the pattern TABLENAME_ibfk_0. #16582: "InnoDB: Error in an adaptive hash index pointer to page" Account for a race condition when dropping the adaptive hash index for a B-tree page. btr_search_drop_page_hash_index(): Retry the operation if a hash index with different parameters was built meanwhile. Add diagnostics for the case that hash node pointers to the page remain. btr_search_info_update_hash(), btr_search_info_update_slow(): Document the parameter "info" as in/out. #16814: "SHOW INNODB STATUS format error in LATEST FOREIGN KEY ERROR section" Add a missing newline to the LAST FOREIGN KEY ERROR section in SHOW INNODB STATUS output. dict_foreign_error_report(): Always print a newline after invoking dict_print_info_on_foreign_key_in_create_format(). #16827: "Better InnoDB error message if ibdata files omitted from my.cnf" #17126: "CHECK TABLE on InnoDB causes a short hang during check of adaptive hash" CHECK TABLE blocking other queries, by releasing the btr_search_latch periodically during the adaptive hash table validation. #17405: "Valgrind: conditional jump or move depends on unititialised values" buf_block_init(): Reset magic_n, buf_fix_count and io_fix to avoid testing uninitialized variables.
439 lines
11 KiB
C
439 lines
11 KiB
C
/************************************************************************
|
||
Memory primitives
|
||
|
||
(c) 1994, 1995 Innobase Oy
|
||
|
||
Created 5/11/1994 Heikki Tuuri
|
||
*************************************************************************/
|
||
|
||
#include "ut0mem.h"
|
||
|
||
#ifdef UNIV_NONINL
|
||
#include "ut0mem.ic"
|
||
#endif
|
||
|
||
#include "mem0mem.h"
|
||
#include "os0sync.h"
|
||
#include "os0thread.h"
|
||
|
||
/* This struct is placed first in every allocated memory block */
|
||
typedef struct ut_mem_block_struct ut_mem_block_t;
|
||
|
||
/* The total amount of memory currently allocated from the OS with malloc */
|
||
ulint ut_total_allocated_memory = 0;
|
||
|
||
struct ut_mem_block_struct{
|
||
UT_LIST_NODE_T(ut_mem_block_t) mem_block_list;
|
||
/* mem block list node */
|
||
ulint size; /* size of allocated memory */
|
||
ulint magic_n;
|
||
};
|
||
|
||
#define UT_MEM_MAGIC_N 1601650166
|
||
|
||
/* List of all memory blocks allocated from the operating system
|
||
with malloc */
|
||
UT_LIST_BASE_NODE_T(ut_mem_block_t) ut_mem_block_list;
|
||
|
||
os_fast_mutex_t ut_list_mutex; /* this protects the list */
|
||
|
||
ibool ut_mem_block_list_inited = FALSE;
|
||
|
||
ulint* ut_mem_null_ptr = NULL;
|
||
|
||
/**************************************************************************
|
||
Initializes the mem block list at database startup. */
|
||
static
|
||
void
|
||
ut_mem_block_list_init(void)
|
||
/*========================*/
|
||
{
|
||
os_fast_mutex_init(&ut_list_mutex);
|
||
UT_LIST_INIT(ut_mem_block_list);
|
||
ut_mem_block_list_inited = TRUE;
|
||
}
|
||
|
||
/**************************************************************************
|
||
Allocates memory. Sets it also to zero if UNIV_SET_MEM_TO_ZERO is
|
||
defined and set_to_zero is TRUE. */
|
||
|
||
void*
|
||
ut_malloc_low(
|
||
/*==========*/
|
||
/* out, own: allocated memory */
|
||
ulint n, /* in: number of bytes to allocate */
|
||
ibool set_to_zero, /* in: TRUE if allocated memory should be
|
||
set to zero if UNIV_SET_MEM_TO_ZERO is
|
||
defined */
|
||
ibool assert_on_error)/* in: if TRUE, we crash mysqld if the
|
||
memory cannot be allocated */
|
||
{
|
||
ulint retry_count = 0;
|
||
void* ret;
|
||
|
||
ut_ad((sizeof(ut_mem_block_t) % 8) == 0); /* check alignment ok */
|
||
|
||
if (!ut_mem_block_list_inited) {
|
||
ut_mem_block_list_init();
|
||
}
|
||
retry:
|
||
os_fast_mutex_lock(&ut_list_mutex);
|
||
|
||
ret = malloc(n + sizeof(ut_mem_block_t));
|
||
|
||
if (ret == NULL && retry_count < 60) {
|
||
if (retry_count == 0) {
|
||
ut_print_timestamp(stderr);
|
||
|
||
fprintf(stderr,
|
||
" InnoDB: Error: cannot allocate %lu bytes of\n"
|
||
"InnoDB: memory with malloc! Total allocated memory\n"
|
||
"InnoDB: by InnoDB %lu bytes. Operating system errno: %lu\n"
|
||
"InnoDB: Check if you should increase the swap file or\n"
|
||
"InnoDB: ulimits of your operating system.\n"
|
||
"InnoDB: On FreeBSD check you have compiled the OS with\n"
|
||
"InnoDB: a big enough maximum process size.\n"
|
||
"InnoDB: Note that in most 32-bit computers the process\n"
|
||
"InnoDB: memory space is limited to 2 GB or 4 GB.\n"
|
||
"InnoDB: We keep retrying the allocation for 60 seconds...\n",
|
||
(ulong) n, (ulong) ut_total_allocated_memory,
|
||
#ifdef __WIN__
|
||
(ulong) GetLastError()
|
||
#else
|
||
(ulong) errno
|
||
#endif
|
||
);
|
||
}
|
||
|
||
os_fast_mutex_unlock(&ut_list_mutex);
|
||
|
||
/* Sleep for a second and retry the allocation; maybe this is
|
||
just a temporary shortage of memory */
|
||
|
||
os_thread_sleep(1000000);
|
||
|
||
retry_count++;
|
||
|
||
goto retry;
|
||
}
|
||
|
||
if (ret == NULL) {
|
||
/* Flush stderr to make more probable that the error
|
||
message gets in the error file before we generate a seg
|
||
fault */
|
||
|
||
fflush(stderr);
|
||
|
||
os_fast_mutex_unlock(&ut_list_mutex);
|
||
|
||
/* Make an intentional seg fault so that we get a stack
|
||
trace */
|
||
/* Intentional segfault on NetWare causes an abend. Avoid this
|
||
by graceful exit handling in ut_a(). */
|
||
#if (!defined __NETWARE__)
|
||
if (assert_on_error) {
|
||
ut_print_timestamp(stderr);
|
||
|
||
fprintf(stderr,
|
||
" InnoDB: We now intentionally generate a seg fault so that\n"
|
||
"InnoDB: on Linux we get a stack trace.\n");
|
||
|
||
if (*ut_mem_null_ptr) ut_mem_null_ptr = 0;
|
||
} else {
|
||
return(NULL);
|
||
}
|
||
#else
|
||
ut_a(0);
|
||
#endif
|
||
}
|
||
|
||
if (set_to_zero) {
|
||
#ifdef UNIV_SET_MEM_TO_ZERO
|
||
memset(ret, '\0', n + sizeof(ut_mem_block_t));
|
||
#endif
|
||
}
|
||
|
||
((ut_mem_block_t*)ret)->size = n + sizeof(ut_mem_block_t);
|
||
((ut_mem_block_t*)ret)->magic_n = UT_MEM_MAGIC_N;
|
||
|
||
ut_total_allocated_memory += n + sizeof(ut_mem_block_t);
|
||
|
||
UT_LIST_ADD_FIRST(mem_block_list, ut_mem_block_list,
|
||
((ut_mem_block_t*)ret));
|
||
os_fast_mutex_unlock(&ut_list_mutex);
|
||
|
||
return((void*)((byte*)ret + sizeof(ut_mem_block_t)));
|
||
}
|
||
|
||
/**************************************************************************
|
||
Allocates memory. Sets it also to zero if UNIV_SET_MEM_TO_ZERO is
|
||
defined. */
|
||
|
||
void*
|
||
ut_malloc(
|
||
/*======*/
|
||
/* out, own: allocated memory */
|
||
ulint n) /* in: number of bytes to allocate */
|
||
{
|
||
return(ut_malloc_low(n, TRUE, TRUE));
|
||
}
|
||
|
||
/**************************************************************************
|
||
Tests if malloc of n bytes would succeed. ut_malloc() asserts if memory runs
|
||
out. It cannot be used if we want to return an error message. Prints to
|
||
stderr a message if fails. */
|
||
|
||
ibool
|
||
ut_test_malloc(
|
||
/*===========*/
|
||
/* out: TRUE if succeeded */
|
||
ulint n) /* in: try to allocate this many bytes */
|
||
{
|
||
void* ret;
|
||
|
||
ret = malloc(n);
|
||
|
||
if (ret == NULL) {
|
||
ut_print_timestamp(stderr);
|
||
fprintf(stderr,
|
||
" InnoDB: Error: cannot allocate %lu bytes of memory for\n"
|
||
"InnoDB: a BLOB with malloc! Total allocated memory\n"
|
||
"InnoDB: by InnoDB %lu bytes. Operating system errno: %d\n"
|
||
"InnoDB: Check if you should increase the swap file or\n"
|
||
"InnoDB: ulimits of your operating system.\n"
|
||
"InnoDB: On FreeBSD check you have compiled the OS with\n"
|
||
"InnoDB: a big enough maximum process size.\n",
|
||
(ulong) n,
|
||
(ulong) ut_total_allocated_memory,
|
||
(int) errno);
|
||
return(FALSE);
|
||
}
|
||
|
||
free(ret);
|
||
|
||
return(TRUE);
|
||
}
|
||
|
||
/**************************************************************************
|
||
Frees a memory block allocated with ut_malloc. */
|
||
|
||
void
|
||
ut_free(
|
||
/*====*/
|
||
void* ptr) /* in, own: memory block */
|
||
{
|
||
ut_mem_block_t* block;
|
||
|
||
block = (ut_mem_block_t*)((byte*)ptr - sizeof(ut_mem_block_t));
|
||
|
||
os_fast_mutex_lock(&ut_list_mutex);
|
||
|
||
ut_a(block->magic_n == UT_MEM_MAGIC_N);
|
||
ut_a(ut_total_allocated_memory >= block->size);
|
||
|
||
ut_total_allocated_memory -= block->size;
|
||
|
||
UT_LIST_REMOVE(mem_block_list, ut_mem_block_list, block);
|
||
free(block);
|
||
|
||
os_fast_mutex_unlock(&ut_list_mutex);
|
||
}
|
||
|
||
/**************************************************************************
|
||
Implements realloc. This is needed by /pars/lexyy.c. Otherwise, you should not
|
||
use this function because the allocation functions in mem0mem.h are the
|
||
recommended ones in InnoDB.
|
||
|
||
man realloc in Linux, 2004:
|
||
|
||
realloc() changes the size of the memory block pointed to
|
||
by ptr to size bytes. The contents will be unchanged to
|
||
the minimum of the old and new sizes; newly allocated mem<65>
|
||
ory will be uninitialized. If ptr is NULL, the call is
|
||
equivalent to malloc(size); if size is equal to zero, the
|
||
call is equivalent to free(ptr). Unless ptr is NULL, it
|
||
must have been returned by an earlier call to malloc(),
|
||
calloc() or realloc().
|
||
|
||
RETURN VALUE
|
||
realloc() returns a pointer to the newly allocated memory,
|
||
which is suitably aligned for any kind of variable and may
|
||
be different from ptr, or NULL if the request fails. If
|
||
size was equal to 0, either NULL or a pointer suitable to
|
||
be passed to free() is returned. If realloc() fails the
|
||
original block is left untouched - it is not freed or
|
||
moved. */
|
||
|
||
void*
|
||
ut_realloc(
|
||
/*=======*/
|
||
/* out, own: pointer to new mem block or NULL */
|
||
void* ptr, /* in: pointer to old block or NULL */
|
||
ulint size) /* in: desired size */
|
||
{
|
||
ut_mem_block_t* block;
|
||
ulint old_size;
|
||
ulint min_size;
|
||
void* new_ptr;
|
||
|
||
if (ptr == NULL) {
|
||
|
||
return(ut_malloc(size));
|
||
}
|
||
|
||
if (size == 0) {
|
||
ut_free(ptr);
|
||
|
||
return(NULL);
|
||
}
|
||
|
||
block = (ut_mem_block_t*)((byte*)ptr - sizeof(ut_mem_block_t));
|
||
|
||
ut_a(block->magic_n == UT_MEM_MAGIC_N);
|
||
|
||
old_size = block->size - sizeof(ut_mem_block_t);
|
||
|
||
if (size < old_size) {
|
||
min_size = size;
|
||
} else {
|
||
min_size = old_size;
|
||
}
|
||
|
||
new_ptr = ut_malloc(size);
|
||
|
||
if (new_ptr == NULL) {
|
||
|
||
return(NULL);
|
||
}
|
||
|
||
/* Copy the old data from ptr */
|
||
ut_memcpy(new_ptr, ptr, min_size);
|
||
|
||
ut_free(ptr);
|
||
|
||
return(new_ptr);
|
||
}
|
||
|
||
/**************************************************************************
|
||
Frees in shutdown all allocated memory not freed yet. */
|
||
|
||
void
|
||
ut_free_all_mem(void)
|
||
/*=================*/
|
||
{
|
||
ut_mem_block_t* block;
|
||
|
||
os_fast_mutex_free(&ut_list_mutex);
|
||
|
||
while ((block = UT_LIST_GET_FIRST(ut_mem_block_list))) {
|
||
|
||
ut_a(block->magic_n == UT_MEM_MAGIC_N);
|
||
ut_a(ut_total_allocated_memory >= block->size);
|
||
|
||
ut_total_allocated_memory -= block->size;
|
||
|
||
UT_LIST_REMOVE(mem_block_list, ut_mem_block_list, block);
|
||
free(block);
|
||
}
|
||
|
||
if (ut_total_allocated_memory != 0) {
|
||
fprintf(stderr,
|
||
"InnoDB: Warning: after shutdown total allocated memory is %lu\n",
|
||
(ulong) ut_total_allocated_memory);
|
||
}
|
||
}
|
||
|
||
/**************************************************************************
|
||
Copies up to size - 1 characters from the NUL-terminated string src to
|
||
dst, NUL-terminating the result. Returns strlen(src), so truncation
|
||
occurred if the return value >= size. */
|
||
|
||
ulint
|
||
ut_strlcpy(
|
||
/*=======*/
|
||
/* out: strlen(src) */
|
||
char* dst, /* in: destination buffer */
|
||
const char* src, /* in: source buffer */
|
||
ulint size) /* in: size of destination buffer */
|
||
{
|
||
ulint src_size = strlen(src);
|
||
|
||
if (size != 0) {
|
||
ulint n = ut_min(src_size, size - 1);
|
||
|
||
memcpy(dst, src, n);
|
||
dst[n] = '\0';
|
||
}
|
||
|
||
return(src_size);
|
||
}
|
||
|
||
/**************************************************************************
|
||
Like ut_strlcpy, but if src doesn't fit in dst completely, copies the last
|
||
(size - 1) bytes of src, not the first. */
|
||
|
||
ulint
|
||
ut_strlcpy_rev(
|
||
/*===========*/
|
||
/* out: strlen(src) */
|
||
char* dst, /* in: destination buffer */
|
||
const char* src, /* in: source buffer */
|
||
ulint size) /* in: size of destination buffer */
|
||
{
|
||
ulint src_size = strlen(src);
|
||
|
||
if (size != 0) {
|
||
ulint n = ut_min(src_size, size - 1);
|
||
|
||
memcpy(dst, src + src_size - n, n + 1);
|
||
}
|
||
|
||
return(src_size);
|
||
}
|
||
|
||
/**************************************************************************
|
||
Make a quoted copy of a NUL-terminated string. Leading and trailing
|
||
quotes will not be included; only embedded quotes will be escaped.
|
||
See also ut_strlenq() and ut_memcpyq(). */
|
||
|
||
char*
|
||
ut_strcpyq(
|
||
/*=======*/
|
||
/* out: pointer to end of dest */
|
||
char* dest, /* in: output buffer */
|
||
char q, /* in: the quote character */
|
||
const char* src) /* in: null-terminated string */
|
||
{
|
||
while (*src) {
|
||
if ((*dest++ = *src++) == q) {
|
||
*dest++ = q;
|
||
}
|
||
}
|
||
|
||
return(dest);
|
||
}
|
||
|
||
/**************************************************************************
|
||
Make a quoted copy of a fixed-length string. Leading and trailing
|
||
quotes will not be included; only embedded quotes will be escaped.
|
||
See also ut_strlenq() and ut_strcpyq(). */
|
||
|
||
char*
|
||
ut_memcpyq(
|
||
/*=======*/
|
||
/* out: pointer to end of dest */
|
||
char* dest, /* in: output buffer */
|
||
char q, /* in: the quote character */
|
||
const char* src, /* in: string to be quoted */
|
||
ulint len) /* in: length of src */
|
||
{
|
||
const char* srcend = src + len;
|
||
|
||
while (src < srcend) {
|
||
if ((*dest++ = *src++) == q) {
|
||
*dest++ = q;
|
||
}
|
||
}
|
||
|
||
return(dest);
|
||
}
|