Re: 4.7 vs 5.2.1 SMP/UP bridging performance

From: Bruce Evans <bde_at_zeta.org.au>
Date: Fri, 7 May 2004 03:18:47 +1000 (EST)
On Thu, 6 May 2004, Bruce M Simpson wrote:

> On Thu, May 06, 2004 at 10:15:44AM -0400, Andrew Gallatin wrote:
> > For what its worth, using those operations yeilds these results
> > on my 2.53GHz P4 (for UP)
> >
> > Mutex (atomic_store_rel_int) cycles per iteration: 208
> > Mutex (sfence) cycles per iteration: 85
> > Mutex (lfence) cycles per iteration: 63
> > Mutex (mfence) cycles per iteration: 169
> > Mutex (none) cycles per iteration: 18
> >
> > lfence looks like a winner..
>
> Please be aware, though, that the different FENCE instructions are acting
> as fences against different things. The NASM documentation has a good
> quick reference for what each of the instructions do, but the definitive
> reference is Intel's IA-32 programmer's reference manuals.

They are also documented in amd64 manuals.

Don't they all act as fences only on the same CPU, so they are no help
for SMP?  They are still almost twice as slow than full locks on Athlons,
so hopefully they do more.

Bruce
Received on Thu May 06 2004 - 08:19:01 UTC

This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:37:53 UTC