Garrett Wollman wrote: > The problem is that the P4 is not very wide to begin with, and it's very > hard to optimize well for that 23-stage pipeline. I'll say. I spent months tuning some assembly code for P3 and P4 and was quite disappointed that the P4 consistently required more CPU cycles for the same code. Only the P4s faster clock kept it from actually being slower than the P3. I attribute a lot of that to the P4s long pipeline. TimReceived on Mon Aug 25 2003 - 09:06:59 UTC
This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:37:20 UTC