mariadb/mysql-test/suite/perfschema/t
Dmitry Lenev 00496b7acd Fix for bug #52044 "FLUSH TABLES WITH READ LOCK and FLUSH
TABLES <list> WITH READ LOCK are incompatible".

The problem was that FLUSH TABLES <list> WITH READ LOCK
which was issued when other connection has acquired global
read lock using FLUSH TABLES WITH READ LOCK was blocked
and has to wait until global read lock is released.

This issue stemmed from the fact that FLUSH TABLES <list>
WITH READ LOCK implementation has acquired X metadata locks
on tables to be flushed. Since these locks required acquiring
of global IX lock this statement was incompatible with global
read lock.

This patch addresses problem by using SNW metadata type of
lock for tables to be flushed by FLUSH TABLES <list> WITH
READ LOCK. It is OK to acquire them without global IX lock
as long as we won't try to upgrade those locks. Since SNW
locks allow concurrent statements using same table FLUSH
TABLE <list> WITH READ LOCK now has to wait until old
versions of tables to be flushed go away after acquiring
metadata locks. Since such waiting can lead to deadlock
MDL deadlock detector was extended to take into account
waits for flush and resolve such deadlocks.

As a bonus code in open_tables() which was responsible for
waiting old versions of tables to go away was refactored.
Now when we encounter old version of table in open_table()
we don't back-off and wait for all old version to go away,
but instead wait for this particular table to be flushed.
Such approach supported by deadlock detection should reduce
number of scenarios in which FLUSH TABLES aborts concurrent
multi-statement transactions.

Note that active FLUSH TABLES <list> WITH READ LOCK still
blocks concurrent FLUSH TABLES WITH READ LOCK statement
as the former keeps tables open and thus prevents the
latter statement from doing flush.

mysql-test/include/handler.inc:
  Adjusted test case after changing status which is set
  when FLUSH TABLES waits for tables to be flushed from
  "Flushing tables" to "Waiting for table".
mysql-test/r/flush.result:
  Added test which checks that "flush tables <list> with
  read lock" is compatible with active "flush tables with
  read lock" but not vice-versa. This test also covers
  bug #52044 "FLUSH TABLES WITH READ LOCK and FLUSH TABLES
  <list> WITH READ LOCK are incompatible".
mysql-test/r/mdl_sync.result:
  Added scenarios in which wait for table to be flushed
  causes deadlocks to the coverage of MDL deadlock detector.
mysql-test/suite/perfschema/r/dml_setup_instruments.result:
  Adjusted test results after removal of COND_refresh
  condition variable.
mysql-test/suite/perfschema/r/server_init.result:
  Adjusted test and its results after removal of COND_refresh
  condition variable.
mysql-test/suite/perfschema/t/server_init.test:
  Adjusted test and its results after removal of COND_refresh
  condition variable.
mysql-test/t/flush.test:
  Added test which checks that "flush tables <list> with
  read lock" is compatible with active "flush tables with
  read lock" but not vice-versa. This test also covers
  bug #52044 "FLUSH TABLES WITH READ LOCK and FLUSH TABLES
  <list> WITH READ LOCK are incompatible".
mysql-test/t/kill.test:
  Adjusted test case after changing status which is set
  when FLUSH TABLES waits for tables to be flushed from
  "Flushing tables" to "Waiting for table".
mysql-test/t/lock_multi.test:
  Adjusted test case after changing status which is set
  when FLUSH TABLES waits for tables to be flushed from
  "Flushing tables" to "Waiting for table".
mysql-test/t/mdl_sync.test:
  Added scenarios in which wait for table to be flushed
  causes deadlocks to the coverage of MDL deadlock detector.
sql/ha_ndbcluster.cc:
  Adjusted code after adding one more parameter for
  close_cached_tables() call - timeout for waiting for
  table to be flushed.
sql/ha_ndbcluster_binlog.cc:
  Adjusted code after adding one more parameter for
  close_cached_tables() call - timeout for waiting for
  table to be flushed.
sql/lock.cc:
  Removed COND_refresh condition variable. See comment
  for sql_base.cc for details.
sql/mdl.cc:
  Now MDL deadlock detector takes into account information
  about waits for table flushes when searching for deadlock.
  To implement this change:
  - Declaration of enum_deadlock_weight and
    Deadlock_detection_visitor were moved to mdl.h header
    to make them available to the code in table.cc which
    implements deadlock detector traversal through edges
    of waiters graph representing waiting for flush.
  - Since now MDL_context may wait not only for metadata
    lock but also for table to be flushed an abstract
    Wait_for_edge class was introduced. Its descendants
    MDL_ticket and Flush_ticket incapsulate specifics
    of inspecting waiters graph when following through
    edge representing wait of particular type.
  
  We no longer require global IX metadata lock when acquiring
  SNW or SNRW locks. Such locks are needed only when metadata
  locks of these types are upgraded to X locks. This allows
  to use SNW locks in FLUSH TABLES <list> WITH READ LOCK
  implementation and keep the latter compatible with global
  read lock.
sql/mdl.h:
  Now MDL deadlock detector takes into account information
  about waits for table flushes when searching for deadlock.
  To implement this change:
  - Declaration of enum_deadlock_weight and
    Deadlock_detection_visitor were moved to mdl.h header
    to make them available to the code in table.cc which
    implements deadlock detector traversal through edges
    of waiters graph representing waiting for flush.
  - Since now MDL_context may wait not only for metadata
    lock but also for table to be flushed an abstract
    Wait_for_edge class was introduced. Its descendants
    MDL_ticket and Flush_ticket incapsulate specifics
    of inspecting waiters graph when following through
    edge representing wait of particular type.
  - Deadlock_detection_visitor now has m_table_shares_visited
    member which allows to support recursive locking for
    LOCK_open. This is required when deadlock detector
    inspects waiters graph which contains several edges
    representing waits for flushes or needs to come through
    the such edge more than once.
sql/mysqld.cc:
  Removed COND_refresh condition variable. See comment
  for sql_base.cc for details.
sql/mysqld.h:
  Removed COND_refresh condition variable. See comment
  for sql_base.cc for details.
sql/sql_base.cc:
  Changed approach to how threads are waiting for table
  to be flushed. Now thread that wants to wait for old
  table to go away subscribes for notification by adding
  Flush_ticket to table's share and waits using
  MDL_context::m_wait object. Once table gets flushed
  (i.e. all tables are closed and table share is ready
  to be destroyed) all such waiters are notified
  individually.
  Thanks to this change MDL deadlock detector can take
  such waits into account.
  
  To implement this/as result of this change:
  - tdc_wait_for_old_versions() was replaced with
    tdc_wait_for_old_version() which waits for individual
    old share to go away and which is called by open_table()
    after finding out that share is outdated. We don't
    need to perform back-off before such waiting thanks
    to the fact that deadlock detector now sees such waits.
  - As result Open_table_ctx::m_mdl_requests became
    unnecessary and was removed. We no longer allocate
    copies of MDL_request objects on MEM_ROOT when
    MYSQL_OPEN_FORCE_SHARED/SHARED_HIGH_PRIO flags are
    in effect.
  - close_cached_tables() and tdc_wait_for_old_version()
    share code which implements waiting for share to be
    flushed - the both use TABLE_SHARE::wait_until_flush()
    method. Thanks to this close_cached_tables() supports
    timeouts and has extra parameter for this.
  - Open_table_context::OT_MDL_CONFLICT enum element was
    renamed to OT_CONFLICT as it is now also used in cases
    when back-off is required to resolve deadlock caused
    by waiting for flush and not metadata lock.
  - In cases when we discover that current connection tries
    to open tables from different generation we now simply
    back-off and restart process of opening tables. To
    support this Open_table_context::OT_REOPEN_TABLES enum
    element was added.
  - COND_refresh condition variable became unnecessary and
    was removed.
  - mysql_notify_thread_having_shared_lock() no longer wakes
    up connections waiting for flush as all such connections
    can be waken up by deadlock detector if necessary.
sql/sql_base.h:
  - close_cached_tables() now has one more parameter -
    timeout for waiting for table to be flushed.
  - Open_table_context::OT_MDL_CONFLICT enum element was
    renamed to OT_CONFLICT as it is now also used in cases
    when back-off is required to resolve deadlock caused
    by waiting for flush and not metadata lock.
    Added new OT_REOPEN_TABLES enum element to be used in
    cases when we need to restart open tables process even
    in the middle of transaction.
  - Open_table_ctx::m_mdl_requests became unnecessary and
    was removed.
sql/sql_class.h:
  Added assert ensuring that we won't use LOCK_open mutex
  with THD::enter_cond(). Otherwise deadlocks can arise in
  MDL deadlock detector.
sql/sql_parse.cc:
  Changed FLUSH TABLES <list> WITH READ LOCK to take SNW
  metadata locks instead of X locks on tables to be flushed.
  Since we no longer require global IX lock to be taken
  when SNW locks are taken this makes this statement
  compatible with FLUSH TABLES WITH READ LOCK statement.
  Since SNW locks allow other connections to have table
  opened FLUSH TABLES <list> WITH READ LOCK now has to
  wait during open_tables() for old version to go away.
  Such waits can lead to deadlocks which will be detected
  by MDL deadlock detector which now takes waits for table
  to be flushed into account.
  
  Also adjusted code after adding one more parameter for
  close_cached_tables() call - timeout for waiting for
  table to be flushed.
sql/sql_yacc.yy:
  FLUSH TABLES <list> WITH READ LOCK now needs only SNW
  metadata locks on tables.
sql/sys_vars.cc:
  Adjusted code after adding one more parameter for
  close_cached_tables() call - timeout for waiting for
  table to be flushed.
sql/table.cc:
  Implemented new approach to how threads are waiting for
  table to be flushed. Now thread that wants to wait for
  old table to go away subscribes for notification by
  adding Flush_ticket to table's share and waits using
  MDL_context::m_wait object. Once table gets flushed
  (i.e. all tables are closed and table share is ready
  to be destroyed) all such waiters are notified
  individually. This change allows to make such waits
  visible inside of MDL deadlock detector.
  To do it:
  
  - Added list of waiters/Flush_tickets to TABLE_SHARE
    class.
  - Changed free_table_share() to postpone freeing of
    share memory until last waiter goes away and to
    wake up subscribed waiters.
  - Added TABLE_SHARE::wait_until_flushed() method which
    implements subscription to the list of waiters for
    table to be flushed and waiting for this event.
  
  Implemented interface which allows to expose waits for
  flushes to MDL deadlock detector:
  
  - Introduced Flush_ticket class a descendant of
    Wait_for_edge class.
  - Added TABLE_SHARE::find_deadlock() method which allows
    deadlock detector to find out what contexts are still
    using old version of table in question (i.e. to find
    out what contexts are waited for by owner of
    Flush_ticket).
sql/table.h:
  In order to support new strategy of waiting for table flush
  (see comment for table.cc for details) added list of
  waiters/Flush_tickets to TABLE_SHARE class.
  
  Implemented interface which allows to expose waits for
  flushes to MDL deadlock detector:
  - Introduced Flush_ticket class a descendant of
    Wait_for_edge class.
  - Added TABLE_SHARE::find_deadlock() method which allows
    deadlock detector to find out what contexts are still
    using old version of table in question (i.e. to find
    out what contexts are waited for by owner of
    Flush_ticket).
2010-07-27 17:34:58 +04:00
..
aggregate.test Misc cleanup 2010-01-27 08:26:05 -07:00
bad_option_1.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
bad_option_2.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
binlog_mix.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
binlog_row.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
binlog_stmt.test BUG#50670: Slave stops with error code 1644 2010-04-28 14:47:49 +02:00
cnf_option.cnf WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
cnf_option.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
column_privilege.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_cond_instances.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_events_waits_current.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_events_waits_history.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_events_waits_history_long.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_ews_by_event_name.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_ews_by_instance.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_ews_by_thread_by_event_name.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_file_instances.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_fs_by_event_name.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_fs_by_instance.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_mutex_instances.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_performance_timers.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_processlist.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_rwlock_instances.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_setup_consumers.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_setup_instruments.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_setup_objects.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
ddl_setup_timers.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
disabled.def WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_cond_instances.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_events_waits_current.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_events_waits_history.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_events_waits_history_long.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_ews_by_event_name.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_ews_by_instance.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_ews_by_thread_by_event_name.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_file_instances.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_file_summary_by_event_name.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_file_summary_by_instance.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_mutex_instances.test Bug#50596 Spurious test failures in perfschema.dml_mutex_instances 2010-01-25 20:12:20 -07:00
dml_performance_timers.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_processlist.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_rwlock_instances.test Bug#50596 Spurious test failures in perfschema.dml_mutex_instances 2010-01-25 20:12:20 -07:00
dml_setup_consumers.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_setup_instruments.test Fixed merge problem (naming of CRYPTO_dynlock_value::lock in the tests) 2010-01-21 17:57:57 -07:00
dml_setup_objects.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
dml_setup_timers.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
func_file_io.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
func_mutex.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
global_read_lock.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
information_schema.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
misc.test Bug#52586 Misleading error message on attempt to access a P_S table using a wrong name 2010-07-15 18:50:39 -06:00
myisam_file_io.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
myisam_file_io.test Bug#53394 Tests: perfschema.myisam_file_io fails 2010-07-16 08:21:07 -06:00
no_threads-master.opt Post-push fixups for WL#5349 2010-06-22 12:13:47 +02:00
no_threads.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
one_thread_per_con-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
one_thread_per_con.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
pfs_upgrade.test pfs_upgrade.test is a case-sensitive test. 2010-03-09 18:05:19 +03:00
pool_of_threads-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
privilege.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
query_cache.test Bug#53392 Tests: perfschema.query_cache fails 2010-07-16 08:28:19 -06:00
read_only.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
schema.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
selects.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
server_init.test Fix for bug #52044 "FLUSH TABLES WITH READ LOCK and FLUSH 2010-07-27 17:34:58 +04:00
start_server_no_cond_class-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_cond_class.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_cond_inst-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_cond_inst.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_file_class-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_file_class.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_file_inst-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_file_inst.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_mutex_class-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_mutex_class.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_mutex_inst-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_mutex_inst.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_rwlock_class-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_rwlock_class.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_rwlock_inst-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_rwlock_inst.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_thread_class-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_thread_class.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_thread_inst-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_no_thread_inst.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_off-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_off.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_on-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
start_server_on.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
tampered_perfschema_table1-master.opt WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
tampered_perfschema_table1.test WL#2360 Performance schema 2010-01-11 18:47:27 -07:00
thread_cache.test Bug#54782 Performance schema per thread accounting and thread cache 2010-07-16 07:50:50 -06:00