2009-10-09 14:21:29 +02:00
|
|
|
#ifndef MY_ATOMIC_INCLUDED
|
|
|
|
#define MY_ATOMIC_INCLUDED
|
|
|
|
|
2011-06-30 17:46:53 +02:00
|
|
|
/* Copyright (c) 2006, 2010, Oracle and/or its affiliates. All rights reserved.
|
2006-05-31 18:44:09 +02:00
|
|
|
|
|
|
|
This program is free software; you can redistribute it and/or modify
|
|
|
|
it under the terms of the GNU General Public License as published by
|
2006-12-27 02:23:51 +01:00
|
|
|
the Free Software Foundation; version 2 of the License.
|
2006-05-31 18:44:09 +02:00
|
|
|
|
|
|
|
This program is distributed in the hope that it will be useful,
|
|
|
|
but WITHOUT ANY WARRANTY; without even the implied warranty of
|
|
|
|
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
|
|
|
|
GNU General Public License for more details.
|
|
|
|
|
|
|
|
You should have received a copy of the GNU General Public License
|
|
|
|
along with this program; if not, write to the Free Software
|
2011-06-30 17:46:53 +02:00
|
|
|
Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA */
|
2006-05-31 18:44:09 +02:00
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
/*
|
|
|
|
This header defines five atomic operations:
|
|
|
|
|
|
|
|
my_atomic_add#(&var, what)
|
Bug#22320: my_atomic-t unit test fails
Bug#52261: 64 bit atomic operations do not work on Solaris i386
gcc in debug compilation
One of the various problems was that the source operand to
CMPXCHG8b was marked as a input/output operand, causing GCC
to use the EBX register as the destination register for the
CMPXCHG8b instruction. This could lead to crashes as the EBX
register is also implicitly used by the instruction, causing
the value to be potentially garbaged and a protection fault
once the value is used to access a position in memory.
Another problem was the lack of proper clobbers for the atomic
operations and, also, a discrepancy between the implementations
for the Compare and Set operation. The specific problems are
described and fixed by Kristian Nielsen patches:
Patch: 1
Fix bugs in my_atomic_cas*(val,cmp,new) that *cmp is accessed
after CAS succeds.
In the gcc builtin implementation, problem was that *cmp was
read again after atomic CAS to check if old *val == *cmp;
this fails if CAS is successful and another thread modifies
*cmp in-between.
In the x86-gcc implementation, problem was that *cmp was set
also in the case of successful CAS; this means there is a
window where it can clobber a value written by another thread
after successful CAS.
Patch 2:
Add a GCC asm "memory" clobber to primitives that imply a
memory barrier.
This signifies to GCC that any potentially aliased memory
must be flushed before the operation, and re-read after the
operation, so that read or modification in other threads of
such memory values will work as intended.
In effect, it makes these primitives work as memory barriers
for the compiler as well as the CPU. This is better and more
correct than adding "volatile" to variables.
2010-07-23 14:37:10 +02:00
|
|
|
'Fetch and Add'
|
2009-10-09 14:21:29 +02:00
|
|
|
add 'what' to *var, and return the old value of *var
|
|
|
|
|
|
|
|
my_atomic_fas#(&var, what)
|
|
|
|
'Fetch And Store'
|
|
|
|
store 'what' in *var, and return the old value of *var
|
|
|
|
|
|
|
|
my_atomic_cas#(&var, &old, new)
|
Bug#22320: my_atomic-t unit test fails
Bug#52261: 64 bit atomic operations do not work on Solaris i386
gcc in debug compilation
One of the various problems was that the source operand to
CMPXCHG8b was marked as a input/output operand, causing GCC
to use the EBX register as the destination register for the
CMPXCHG8b instruction. This could lead to crashes as the EBX
register is also implicitly used by the instruction, causing
the value to be potentially garbaged and a protection fault
once the value is used to access a position in memory.
Another problem was the lack of proper clobbers for the atomic
operations and, also, a discrepancy between the implementations
for the Compare and Set operation. The specific problems are
described and fixed by Kristian Nielsen patches:
Patch: 1
Fix bugs in my_atomic_cas*(val,cmp,new) that *cmp is accessed
after CAS succeds.
In the gcc builtin implementation, problem was that *cmp was
read again after atomic CAS to check if old *val == *cmp;
this fails if CAS is successful and another thread modifies
*cmp in-between.
In the x86-gcc implementation, problem was that *cmp was set
also in the case of successful CAS; this means there is a
window where it can clobber a value written by another thread
after successful CAS.
Patch 2:
Add a GCC asm "memory" clobber to primitives that imply a
memory barrier.
This signifies to GCC that any potentially aliased memory
must be flushed before the operation, and re-read after the
operation, so that read or modification in other threads of
such memory values will work as intended.
In effect, it makes these primitives work as memory barriers
for the compiler as well as the CPU. This is better and more
correct than adding "volatile" to variables.
2010-07-23 14:37:10 +02:00
|
|
|
An odd variation of 'Compare And Set/Swap'
|
2009-10-09 14:21:29 +02:00
|
|
|
if *var is equal to *old, then store 'new' in *var, and return TRUE
|
|
|
|
otherwise store *var in *old, and return FALSE
|
Bug#22320: my_atomic-t unit test fails
Bug#52261: 64 bit atomic operations do not work on Solaris i386
gcc in debug compilation
One of the various problems was that the source operand to
CMPXCHG8b was marked as a input/output operand, causing GCC
to use the EBX register as the destination register for the
CMPXCHG8b instruction. This could lead to crashes as the EBX
register is also implicitly used by the instruction, causing
the value to be potentially garbaged and a protection fault
once the value is used to access a position in memory.
Another problem was the lack of proper clobbers for the atomic
operations and, also, a discrepancy between the implementations
for the Compare and Set operation. The specific problems are
described and fixed by Kristian Nielsen patches:
Patch: 1
Fix bugs in my_atomic_cas*(val,cmp,new) that *cmp is accessed
after CAS succeds.
In the gcc builtin implementation, problem was that *cmp was
read again after atomic CAS to check if old *val == *cmp;
this fails if CAS is successful and another thread modifies
*cmp in-between.
In the x86-gcc implementation, problem was that *cmp was set
also in the case of successful CAS; this means there is a
window where it can clobber a value written by another thread
after successful CAS.
Patch 2:
Add a GCC asm "memory" clobber to primitives that imply a
memory barrier.
This signifies to GCC that any potentially aliased memory
must be flushed before the operation, and re-read after the
operation, so that read or modification in other threads of
such memory values will work as intended.
In effect, it makes these primitives work as memory barriers
for the compiler as well as the CPU. This is better and more
correct than adding "volatile" to variables.
2010-07-23 14:37:10 +02:00
|
|
|
Usually, &old should not be accessed if the operation is successful.
|
2009-10-09 14:21:29 +02:00
|
|
|
|
|
|
|
my_atomic_load#(&var)
|
|
|
|
return *var
|
|
|
|
|
|
|
|
my_atomic_store#(&var, what)
|
|
|
|
store 'what' in *var
|
|
|
|
|
2009-10-12 11:00:39 +02:00
|
|
|
'#' is substituted by a size suffix - 8, 16, 32, 64, or ptr
|
2009-10-09 14:21:29 +02:00
|
|
|
(e.g. my_atomic_add8, my_atomic_fas32, my_atomic_casptr).
|
|
|
|
|
|
|
|
NOTE This operations are not always atomic, so they always must be
|
|
|
|
enclosed in my_atomic_rwlock_rdlock(lock)/my_atomic_rwlock_rdunlock(lock)
|
|
|
|
or my_atomic_rwlock_wrlock(lock)/my_atomic_rwlock_wrunlock(lock).
|
|
|
|
Hint: if a code block makes intensive use of atomic ops, it make sense
|
|
|
|
to take/release rwlock once for the whole block, not for every statement.
|
|
|
|
|
|
|
|
On architectures where these operations are really atomic, rwlocks will
|
|
|
|
be optimized away.
|
|
|
|
8- and 16-bit atomics aren't implemented for windows (see generic-msvc.h),
|
|
|
|
but can be added, if necessary.
|
|
|
|
*/
|
|
|
|
|
2006-06-17 16:20:39 +02:00
|
|
|
#ifndef my_atomic_rwlock_init
|
2006-05-31 18:44:09 +02:00
|
|
|
|
2006-06-17 16:20:39 +02:00
|
|
|
#define intptr void *
|
2009-10-09 14:21:29 +02:00
|
|
|
/**
|
2009-12-15 17:07:43 +01:00
|
|
|
Currently we don't support 8-bit and 16-bit operations.
|
|
|
|
It can be added later if needed.
|
2009-10-09 14:21:29 +02:00
|
|
|
*/
|
2009-12-15 17:07:43 +01:00
|
|
|
#undef MY_ATOMIC_HAS_8_16
|
2006-05-31 18:44:09 +02:00
|
|
|
|
|
|
|
#ifndef MY_ATOMIC_MODE_RWLOCKS
|
2008-10-13 22:03:12 +02:00
|
|
|
/*
|
|
|
|
* Attempt to do atomic ops without locks
|
|
|
|
*/
|
2006-05-31 18:44:09 +02:00
|
|
|
#include "atomic/nolock.h"
|
|
|
|
#endif
|
|
|
|
|
2009-12-23 09:27:41 +01:00
|
|
|
#ifndef make_atomic_cas_body
|
2009-10-09 14:21:29 +02:00
|
|
|
/* nolock.h was not able to generate even a CAS function, fall back */
|
2006-05-31 18:44:09 +02:00
|
|
|
#include "atomic/rwlock.h"
|
2009-12-19 12:48:39 +01:00
|
|
|
#endif
|
2009-12-19 17:44:45 +01:00
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
/* define missing functions by using the already generated ones */
|
2006-06-29 15:39:53 +02:00
|
|
|
#ifndef make_atomic_add_body
|
2009-10-09 14:21:29 +02:00
|
|
|
#define make_atomic_add_body(S) \
|
2006-06-29 15:39:53 +02:00
|
|
|
int ## S tmp=*a; \
|
2010-02-19 15:20:29 +01:00
|
|
|
while (!my_atomic_cas ## S(a, &tmp, tmp+v)) ; \
|
2006-06-29 15:39:53 +02:00
|
|
|
v=tmp;
|
|
|
|
#endif
|
2009-10-09 14:21:29 +02:00
|
|
|
#ifndef make_atomic_fas_body
|
|
|
|
#define make_atomic_fas_body(S) \
|
|
|
|
int ## S tmp=*a; \
|
2010-02-19 15:20:29 +01:00
|
|
|
while (!my_atomic_cas ## S(a, &tmp, v)) ; \
|
2009-10-09 14:21:29 +02:00
|
|
|
v=tmp;
|
|
|
|
#endif
|
|
|
|
#ifndef make_atomic_load_body
|
|
|
|
#define make_atomic_load_body(S) \
|
|
|
|
ret= 0; /* avoid compiler warning */ \
|
|
|
|
(void)(my_atomic_cas ## S(a, &ret, ret));
|
|
|
|
#endif
|
|
|
|
#ifndef make_atomic_store_body
|
|
|
|
#define make_atomic_store_body(S) \
|
|
|
|
(void)(my_atomic_fas ## S (a, v));
|
|
|
|
#endif
|
|
|
|
|
|
|
|
/*
|
|
|
|
transparent_union doesn't work in g++
|
|
|
|
Bug ?
|
|
|
|
|
|
|
|
Darwin's gcc doesn't want to put pointers in a transparent_union
|
|
|
|
when built with -arch ppc64. Complains:
|
|
|
|
warning: 'transparent_union' attribute ignored
|
|
|
|
*/
|
|
|
|
#if defined(__GNUC__) && !defined(__cplusplus) && \
|
2010-01-28 11:09:05 +01:00
|
|
|
! (defined(__APPLE__) && (defined(_ARCH_PPC64) ||defined (_ARCH_PPC)))
|
2009-10-09 14:21:29 +02:00
|
|
|
/*
|
|
|
|
we want to be able to use my_atomic_xxx functions with
|
|
|
|
both signed and unsigned integers. But gcc will issue a warning
|
|
|
|
"passing arg N of `my_atomic_XXX' as [un]signed due to prototype"
|
|
|
|
if the signedness of the argument doesn't match the prototype, or
|
|
|
|
"pointer targets in passing argument N of my_atomic_XXX differ in signedness"
|
|
|
|
if int* is used where uint* is expected (or vice versa).
|
|
|
|
Let's shut these warnings up
|
|
|
|
*/
|
|
|
|
#define make_transparent_unions(S) \
|
|
|
|
typedef union { \
|
|
|
|
int ## S i; \
|
|
|
|
uint ## S u; \
|
|
|
|
} U_ ## S __attribute__ ((transparent_union)); \
|
|
|
|
typedef union { \
|
|
|
|
int ## S volatile *i; \
|
|
|
|
uint ## S volatile *u; \
|
|
|
|
} Uv_ ## S __attribute__ ((transparent_union));
|
|
|
|
#define uintptr intptr
|
|
|
|
make_transparent_unions(8)
|
|
|
|
make_transparent_unions(16)
|
|
|
|
make_transparent_unions(32)
|
2009-10-12 11:00:39 +02:00
|
|
|
make_transparent_unions(64)
|
2009-10-09 14:21:29 +02:00
|
|
|
make_transparent_unions(ptr)
|
|
|
|
#undef uintptr
|
|
|
|
#undef make_transparent_unions
|
|
|
|
#define a U_a.i
|
|
|
|
#define cmp U_cmp.i
|
|
|
|
#define v U_v.i
|
|
|
|
#define set U_set.i
|
|
|
|
#else
|
|
|
|
#define U_8 int8
|
|
|
|
#define U_16 int16
|
|
|
|
#define U_32 int32
|
2009-10-12 11:00:39 +02:00
|
|
|
#define U_64 int64
|
2009-10-09 14:21:29 +02:00
|
|
|
#define U_ptr intptr
|
|
|
|
#define Uv_8 int8
|
|
|
|
#define Uv_16 int16
|
|
|
|
#define Uv_32 int32
|
2009-10-12 11:00:39 +02:00
|
|
|
#define Uv_64 int64
|
2009-10-09 14:21:29 +02:00
|
|
|
#define Uv_ptr intptr
|
|
|
|
#define U_a volatile *a
|
|
|
|
#define U_cmp *cmp
|
|
|
|
#define U_v v
|
|
|
|
#define U_set set
|
|
|
|
#endif /* __GCC__ transparent_union magic */
|
2006-06-29 15:39:53 +02:00
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
#define make_atomic_cas(S) \
|
2010-07-23 22:18:36 +02:00
|
|
|
static inline int my_atomic_cas ## S(Uv_ ## S U_a, \
|
2009-10-09 14:21:29 +02:00
|
|
|
Uv_ ## S U_cmp, U_ ## S U_set) \
|
|
|
|
{ \
|
|
|
|
int8 ret; \
|
|
|
|
make_atomic_cas_body(S); \
|
|
|
|
return ret; \
|
2006-06-17 16:20:39 +02:00
|
|
|
}
|
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
#define make_atomic_add(S) \
|
2010-07-23 22:18:36 +02:00
|
|
|
static inline int ## S my_atomic_add ## S( \
|
2009-10-09 14:21:29 +02:00
|
|
|
Uv_ ## S U_a, U_ ## S U_v) \
|
|
|
|
{ \
|
|
|
|
make_atomic_add_body(S); \
|
|
|
|
return v; \
|
2006-06-17 16:20:39 +02:00
|
|
|
}
|
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
#define make_atomic_fas(S) \
|
2010-07-23 22:18:36 +02:00
|
|
|
static inline int ## S my_atomic_fas ## S( \
|
2009-10-09 14:21:29 +02:00
|
|
|
Uv_ ## S U_a, U_ ## S U_v) \
|
|
|
|
{ \
|
|
|
|
make_atomic_fas_body(S); \
|
|
|
|
return v; \
|
2006-06-17 16:20:39 +02:00
|
|
|
}
|
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
#define make_atomic_load(S) \
|
2010-07-23 22:18:36 +02:00
|
|
|
static inline int ## S my_atomic_load ## S(Uv_ ## S U_a) \
|
2009-10-09 14:21:29 +02:00
|
|
|
{ \
|
|
|
|
int ## S ret; \
|
|
|
|
make_atomic_load_body(S); \
|
|
|
|
return ret; \
|
2006-06-17 16:20:39 +02:00
|
|
|
}
|
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
#define make_atomic_store(S) \
|
2010-07-23 22:18:36 +02:00
|
|
|
static inline void my_atomic_store ## S( \
|
2009-10-09 14:21:29 +02:00
|
|
|
Uv_ ## S U_a, U_ ## S U_v) \
|
|
|
|
{ \
|
|
|
|
make_atomic_store_body(S); \
|
2006-06-17 16:20:39 +02:00
|
|
|
}
|
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
#ifdef MY_ATOMIC_HAS_8_16
|
|
|
|
make_atomic_cas(8)
|
2006-06-17 16:20:39 +02:00
|
|
|
make_atomic_cas(16)
|
2009-10-09 14:21:29 +02:00
|
|
|
#endif
|
2006-06-17 16:20:39 +02:00
|
|
|
make_atomic_cas(32)
|
2009-10-12 11:00:39 +02:00
|
|
|
make_atomic_cas(64)
|
2006-06-17 16:20:39 +02:00
|
|
|
make_atomic_cas(ptr)
|
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
#ifdef MY_ATOMIC_HAS_8_16
|
|
|
|
make_atomic_add(8)
|
2006-06-29 15:39:53 +02:00
|
|
|
make_atomic_add(16)
|
2009-10-09 14:21:29 +02:00
|
|
|
#endif
|
2006-06-29 15:39:53 +02:00
|
|
|
make_atomic_add(32)
|
2009-10-12 11:00:39 +02:00
|
|
|
make_atomic_add(64)
|
2006-06-29 15:39:53 +02:00
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
#ifdef MY_ATOMIC_HAS_8_16
|
|
|
|
make_atomic_load(8)
|
2006-06-17 16:20:39 +02:00
|
|
|
make_atomic_load(16)
|
2009-10-09 14:21:29 +02:00
|
|
|
#endif
|
2006-06-17 16:20:39 +02:00
|
|
|
make_atomic_load(32)
|
2009-10-12 11:00:39 +02:00
|
|
|
make_atomic_load(64)
|
2006-06-17 16:20:39 +02:00
|
|
|
make_atomic_load(ptr)
|
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
#ifdef MY_ATOMIC_HAS_8_16
|
|
|
|
make_atomic_fas(8)
|
|
|
|
make_atomic_fas(16)
|
|
|
|
#endif
|
|
|
|
make_atomic_fas(32)
|
2009-10-12 11:00:39 +02:00
|
|
|
make_atomic_fas(64)
|
2009-10-09 14:21:29 +02:00
|
|
|
make_atomic_fas(ptr)
|
|
|
|
|
|
|
|
#ifdef MY_ATOMIC_HAS_8_16
|
|
|
|
make_atomic_store(8)
|
2006-06-17 16:20:39 +02:00
|
|
|
make_atomic_store(16)
|
2009-10-09 14:21:29 +02:00
|
|
|
#endif
|
2006-06-17 16:20:39 +02:00
|
|
|
make_atomic_store(32)
|
2009-10-12 11:00:39 +02:00
|
|
|
make_atomic_store(64)
|
2006-06-17 16:20:39 +02:00
|
|
|
make_atomic_store(ptr)
|
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
#ifdef _atomic_h_cleanup_
|
|
|
|
#include _atomic_h_cleanup_
|
|
|
|
#undef _atomic_h_cleanup_
|
|
|
|
#endif
|
2006-06-17 16:20:39 +02:00
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
#undef U_8
|
|
|
|
#undef U_16
|
|
|
|
#undef U_32
|
2009-10-12 11:00:39 +02:00
|
|
|
#undef U_64
|
2009-10-09 14:21:29 +02:00
|
|
|
#undef U_ptr
|
|
|
|
#undef Uv_8
|
|
|
|
#undef Uv_16
|
|
|
|
#undef Uv_32
|
2009-10-12 11:00:39 +02:00
|
|
|
#undef Uv_64
|
2009-10-09 14:21:29 +02:00
|
|
|
#undef Uv_ptr
|
|
|
|
#undef a
|
|
|
|
#undef cmp
|
|
|
|
#undef v
|
|
|
|
#undef set
|
|
|
|
#undef U_a
|
|
|
|
#undef U_cmp
|
|
|
|
#undef U_v
|
|
|
|
#undef U_set
|
2006-06-17 16:20:39 +02:00
|
|
|
#undef make_atomic_add
|
|
|
|
#undef make_atomic_cas
|
|
|
|
#undef make_atomic_load
|
|
|
|
#undef make_atomic_store
|
2009-10-09 14:21:29 +02:00
|
|
|
#undef make_atomic_fas
|
2006-06-29 15:39:53 +02:00
|
|
|
#undef make_atomic_add_body
|
|
|
|
#undef make_atomic_cas_body
|
|
|
|
#undef make_atomic_load_body
|
|
|
|
#undef make_atomic_store_body
|
2009-10-09 14:21:29 +02:00
|
|
|
#undef make_atomic_fas_body
|
2009-12-19 18:24:52 +01:00
|
|
|
#undef intptr
|
2006-06-17 16:20:39 +02:00
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
/*
|
|
|
|
the macro below defines (as an expression) the code that
|
|
|
|
will be run in spin-loops. Intel manuals recummend to have PAUSE there.
|
|
|
|
It is expected to be defined in include/atomic/ *.h files
|
|
|
|
*/
|
|
|
|
#ifndef LF_BACKOFF
|
|
|
|
#define LF_BACKOFF (1)
|
2006-06-17 16:20:39 +02:00
|
|
|
#endif
|
|
|
|
|
2006-05-31 18:44:09 +02:00
|
|
|
#define MY_ATOMIC_OK 0
|
|
|
|
#define MY_ATOMIC_NOT_1CPU 1
|
|
|
|
extern int my_atomic_initialize();
|
|
|
|
|
|
|
|
#endif
|
|
|
|
|
2009-10-09 14:21:29 +02:00
|
|
|
#endif /* MY_ATOMIC_INCLUDED */
|