I encountered an issue with a (basic) spinlock mutex that does not seem to work as expected.
我遇到了一个(基本的)自旋锁互斥锁的问题,似乎没有按预期工作。
4 threads are incrementing a non-atomic counter that is protected by this mutex. The outcome does not match with the expected result which makes the mutex seem broken.
4个线程正在递增受此互斥锁保护的非原子计数器。结果与使互斥体看起来破碎的预期结果不匹配。
example output:
示例输出:
result: 2554230
expected: 10000000
In my environment it happens under the following conditions:
在我的环境中,它发生在以下条件下:
-
flag
isstd::atomic<bool>
, anything else such asstd::atomic<int>
orstd::atomic_flag
(withtest_and_set
) works fine.flag是std :: atomic
,其他任何东西,如std :: atomic 或std :: atomic_flag(带有test_and_set)都可以正常工作。 -
compiled on X86_64 with gcc 6.3.1 and
-O3
flag在X86_64上使用gcc 6.3.1和-O3标志编译
My question is, what could explain this behavior ?
我的问题是,有什么可以解释这种行为?
#include <iostream>
#include <vector>
#include <atomic>
#include <thread>
#include <mutex>
#include <assert.h>
class my_mutex {
std::atomic<bool> flag{false};
public:
void lock()
{
while (flag.exchange(true, std::memory_order_acquire));
}
void unlock()
{
flag.store(false, std::memory_order_release);
}
};
my_mutex mut;
static int counter = 0;
void increment(int cycles)
{
for (int i=0; i < cycles; ++i)
{
std::lock_guard<my_mutex> lck(mut);
++counter;
}
}
int main()
{
std::vector<std::thread> vec;
const int n_thr = 4;
const int n_cycles = 2500000;
for (int i = 0; i < n_thr; ++i)
vec.emplace_back(increment, n_cycles);
for(auto &t : vec)
t.join();
std::cout << " result: " << counter << std::endl;
std::cout << "expected: " << n_cycles * n_thr << std::endl;
}
edit
Per request from Voo, here is the assembly output for increment()
..
根据Voo的请求,这里是increment()的汇编输出。
$ g++ -O3 increment.cpp
$ gdb a.out
Reading symbols from a.out...done.
(gdb) disassemble increment
Dump of assembler code for function increment(int):
0x0000000000401020 <+0>: mov 0x20122a(%rip),%ecx # 0x602250 <_ZL7counter>
0x0000000000401026 <+6>: test %edi,%edi
0x0000000000401028 <+8>: mov $0x1,%edx
0x000000000040102d <+13>: lea (%rdi,%rcx,1),%esi
0x0000000000401030 <+16>: jle 0x401058 <increment(int)+56>
0x0000000000401032 <+18>: nopw 0x0(%rax,%rax,1)
0x0000000000401038 <+24>: mov %edx,%eax
0x000000000040103a <+26>: xchg %al,0x20120c(%rip) # 0x60224c <mut>
0x0000000000401040 <+32>: test %al,%al
0x0000000000401042 <+34>: jne 0x401038 <increment(int)+24>
0x0000000000401044 <+36>: add $0x1,%ecx
0x0000000000401047 <+39>: cmp %ecx,%esi
0x0000000000401049 <+41>: mov %ecx,0x201201(%rip) # 0x602250 <_ZL7counter>
0x000000000040104f <+47>: movb $0x0,0x2011f6(%rip) # 0x60224c <mut>
0x0000000000401056 <+54>: jne 0x401038 <increment(int)+24>
0x0000000000401058 <+56>: repz retq
End of assembler dump.
1 个解决方案
#1
3
Your code is correct. It's a bug 80004 - [6 Regression] non-atomic load moved to before atomic load with std::memory_order_acquire
你的代码是正确的。这是一个错误80004 - [6回归]非原子加载在原子加载之前移动到std :: memory_order_acquire
#1
3
Your code is correct. It's a bug 80004 - [6 Regression] non-atomic load moved to before atomic load with std::memory_order_acquire
你的代码是正确的。这是一个错误80004 - [6回归]非原子加载在原子加载之前移动到std :: memory_order_acquire