RE: 4.7 vs 5.2.1 SMP/UP bridging performance

From: Andrew Gallatin <gallatin_at_cs.duke.edu>
Date: Thu, 6 May 2004 10:15:44 -0400 (EDT)
Don Bowman writes:
 > 
 > On the P4, there are mfence,lfence,sfence instructions to enforce
 > memory ordering. These are cheaper than "lock; andl" or "cpuid",
 > which are the traditional 'sync' instructions.

For what its worth, using those operations yeilds these results
on my 2.53GHz P4 (for UP)

Mutex (atomic_store_rel_int) cycles per iteration: 208 
Mutex (sfence) cycles per iteration: 85 
Mutex (lfence) cycles per iteration: 63 
Mutex (mfence) cycles per iteration: 169 
Mutex (none) cycles per iteration: 18 

lfence looks like a winner..

Drew
Received on Thu May 06 2004 - 05:15:53 UTC

This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:37:53 UTC