Summary
The document contains tables of CPU performance from 80486 to Intel Core 2 Duo and AMD Athlon 64, including speed via caches and RAM derived from my benchmarks and results reports, which are available
from
Compuserve PC Hardware Forum and my web site.
More Historic Data
In analysing results of the different benchmarks, in virtually all cases (except for RAM), CPU speed, including using data in cache, is shown to be proportional to CPU MHz for a particular type of processor. So it has been decided to show the MHz of available CPUs and provide simple tables with one set of numbers for each type of CPU. For results depending on memory speed, results are provided for a range of RAM types.
As the number and type of instructions used in the benchmarks is known, performance results can be easily converted to Millions of Instructions Per Second (MIPS) or Millions of Floating Point Operations Per Second (MFLOPS). Comparison numbers represent percentages of MIPS or MFLOPS divided by CPU MHz.
Contents
1. CPU Performance Tables - Introduction
In analysing results of the different benchmarks, in virtually all
cases, CPU speed, including using data in cache, is shown to be
proportional to CPU MHz for a particular type of processor. So it
has been decided to show the MHz of available CPUs and provide simple
tables with one set of numbers for each type of CPU. As the number
and type of instructions used in the benchmarks is known, performance
results can be easily converted to Millions of Instructions Per
Second (MIPS) or Millions of Floating Point Operations Per Second
(MFLOPS). Comparison numbers represent percentages of MIPS or MFLOPS
divided by CPU MHz.
CPUs can have more than one Integer or Floating Point Pipeline which
each may be able to produce one result per Hz. Also, P4 Integer
Pipelines can execute two instructions per cycle. Thus it is possible
that the percentage calculations can be greater than 100.
Overall processor performance depends on the type/version and speed
of CPU and cache sizes along with memory speed, which is dependent on
the system board used. No attempt is made to provide an overall
performance rating, the overall aim being to show some strengths and
weaknesses of each type of processor.
Note that these benchmarks reflect performance of one CPU on multi-core processors.
Back To Contents List
2. CPU Characteristics
Some of the following have been gathered from benchmark results and magazines
and may not be completely accurate. Mobile CPUs have the same speeds
using mains power but might have larger L2 caches and slower RAM.
Typical type of RAM is shown but some will have faster or slower
variations, including dual memory channels.
Back To Contents List
Intel
XEON CPUs could be Pentium Pro, III or P4 usually with options
for larger L2 caches.
CPU MHz MHz KB L1 KB L2 Type L2 FSB Typical
From To Cache Cache Cache MHz RAM
Pentium 75 200 8 256-512 External 50-66 FPM
Pentium Pro 180 200 8 256-512 Internal 60-66 EDO
Pentium MMX 166 233 16 256-512 External 66 SDRAM
Pentium II 233 333 16 512 Half Speed 66 SDRAM
Pentium II 350 500 16 512 Half Speed 100 PC100
Celeron 266 300 16 0 None 66 SDRAM
Celeron A 300 550 16 128 Full Speed 66 SDRAM
Celeron 2 566 733 16 128 Full Speed 66 SDRAM
Celeron 3 800 1300 16 128 Full Speed 100 PC100
Celeron 4 1000 1400 16 256 Full Speed 100 PC100
Celeron P4 1700 2700 8 128 Full Speed 400 PC133
Cel D 315-351 2260 3260 16 256 Full Speed 533
Cel M 310-340 1200 1500 32 512 Full Speed 400
Cel M 350-380 1300 1600 32 1024 Full Speed 400
Pentium III 450 600 16 512 Half Speed 100 PC100
Pentium IIIB 533 600 16 512 Half Speed 133 PC133
Pentium IIIE 550 1100 16 256 Full Speed 100 PC100
Pentium IIIEB 533 1330 16 256 Full Speed 133 PC133
Pentium IIIT 800 1400 16 512 Full Speed 133 PC133
All the following have full speed L2 cache
CPU MHz MHz KB L1 KB L2 FSB Typical
From To Cache Cache MHz RAM
Pentium M 900 1600 32 1024 400
PM 735-765 1700 2100 32 2048 400
PM 730-780 1600 2267 32 2048 533
CS U1300-1400 1060 1200 32 2048 533
CS T1300-1400 1666 1833 32 2048 667
CD T2300-2700 1666 2333 32*2 2048 667
CM U7500-7600 1066 1333 32*2 2048 533
CM T5200- 1600 32*2 2048 533
CM T5500-5600 1666 1833 32*2 2048 667
CM L7300-7500 1600 1800 32*2 4096 800
CM T7100 1800 32*2 2048 800
CM T7200-7700 2000 2400 32*2 4096 800
Mob P4 518+ 2800 3460 16 1024 533
Pentium 4 1300 2000 8 256 400 RDRAM
Pentium 4N 1600 2200 8 512 400 PC2100
Pentium 4N2 2200 3060 8 512 533 PC2700
Pentium 4N3 2400 3200 8 512 800 PC3200
Pentium 4EE 3200 8 512+L3# 800 PC3200
Pentium 4E 2800 3600 16 1024 800 PC3200
P4 505-519 2666 3060 16 1024 533
P4 520-571 2800 3600 16 1024 800
P4 630-672 3000 3800 16 2048 800
P4 820-840 D 2800 3200 16*2 1024*2 800
P4 920-960 D 2800 3600 16*2 2048*2 800
P4EE 840 3200 16*2 1024*2 800
P4EE 955-965 3460 3730 16*2 2048*2 1066
C2D 4300-4500 1800 2200 16*2 2048*2 800
C2D 6300-6400 1860 2130 32*2 2048 1066
C2D 6600-6700 2400 2666 32*2 4096 1066
C2D 6540-6850 2333 3000 32*2 4096 1333
C2E 6800 2930 32*2 4096 1066
C2Q 6600-6700 2400 2666 32*4 8192 1066
QXE 6700-6800 2666 6800 32*4 8192 1066
QXE 6850 3000 32*4 8192 1333
2048# Full Speed
CS = Core Solo Mobile Processor
CD = Core Duo Mobile Processor
CM = Core 2 Duo Mobile Processor
D = Dual CPU chip, caches on each
C2D = Core 2 Duo Processor, shared L2 cache
C2E = Core 2 Extreme, shared L2 cache
C2Q = Core 2 Quad Processor
QXE = Core 2 Extreme Quad Processor
|
Back To Contents List
AMD
Athlon MP varieties are essentially the same as XP.
CPU MHz MHz KB L1 KB L2 Type L2 FSB Typical
From To Cache Cache Cache MHz RAM
K5 75 100 8 512 External 50-66 EDO
K6 166 300 32 512 External 66 SDRAM
K62 300 475 32 512-1024 External 100 PC100
K63 400 500 32 256+L3 Full Speed 100 PC100
Duron 550 1800 64 64 Full Speed 200 PC100
Athlon 500 750 64 512 Half or < 200 PC100
Athlon Tbird 650 1400 64 256 Full Speed 200 PC133
Athlon Tbird 1000 1400 64 256 Full Speed 266 PC2100
AXP 1500-2100 1333 1733 64 256 Full Speed 266 PC2100
AXP 1700-2200 1467 1800 64 256 Full Speed 266 PC2100
AXP 2600-2700 2083 2167 64 256 Full Speed 333 PC2700
AXP 2500-3000 1833 2167 64 512 Full Speed 333 PC2700
AXP 3000-3200 2100 2200 64 512 Full Speed 400 PC3200
Sem 2200-2800 1500 2000 64 256 Full Speed 333 PC2700
Sem 3000 2000 64 512 Full Speed 333 PC2700
All the following have full speed L2 cache and HyperTransport bus
CPU MHz MHz KB L1 KB L2 Maximum
From To Cache Cache RAM MB/Sec
Sem 3100 1800 64 256 3200
Sem 3000 1800 64 128 3200
Sem 2600-3300 1600 2000 64 128 3200
Sem 2500-3600 1400 2200 64 256 3200
SMo 2600-3500 1600 2000 64 128
SMo 2800-3800 1600 2200 64 256
SMo 3200-3600 1600 1800 64 512
A64 FX51-74 2200 3000 64 1024 3200 * 2 Chan
A64 2800-3700 1600 2400 64 1024 DD2 1 Chan
A64 2800-3700 1800 2400 64 512 DD2 1 Chan
A64 3000-4000 1800 2600 64 512 DD2 2 Chan
A64 3700-4000 2200 2400 64 1024 DD2 2 Chan
AX2 3800-5400 2000 2800 64*2 512*2 DD2 2 Chan
AX2 4400-6000 2200 3000 64*2 1024*2 DD2 2 Chan
AX2 BE2300-50 1900 2100 64*2 512*2 DD2
AMo 2700-3000 1600 2000 64 512 DD2 1 Chan
AMo 2800-4000 1600 2600 64 1024 DD2 1 Chan
TMo ML28-ML32 1600 1800 64 512 DD2 1 Chan
TMo ML30-ML34 1600 1800 64 1024 DD2 1 Chan
TMo MT32 1800 64 512 DD2 1 Chan
TMo MT34-MT37 1800 2000 64 1024 DD2 1 Chan
TMd TL50 1600 64*2 256*2 DD2 1 Chan
TMd TL52-TL66 1600 2300 64*2 512*2 DD2 1 Chan
Opt 140-150 1400 2400 64 1024 DD2 2 Chan
Opt 840-850 1400 2400 64 1024 DD2 2 Chan
Opt 842-856 1600 3000 64 1024 DD2 2 Chan
Opt 242-256 1600 3000 64 1024 DD2 2 Chan
Opt 144-154 1800 2800 64 1024 DD2 2 Chan
Opt 1210-1222 1800 3000 64 2048 667 2 Chan
Opt 2210-2222 1800 3000 64 2048 533 2 Chan
Opt 8212-8222 2000 3000 64 2048 533 2 Chan
O2C 265-290 1800 2800 64*2 1024*2 DD2 2 Chan
O2C 865-890 1800 2800 64*2 1024*2 DD2 2 Chan
O2C 165-185 1800 2600 64*2 1024*2 DD2 2 Chan
AXP = Athlon XP, Sem = Sempron, SMo = Mobile Sempron
A64 = Athlon 64, AX2 = A64 Dual core, AMo = Mobile A64
Opt = Opteron, O2C = Opt Dual Core. TMo = Turion 64 Mobile
TMd = Dual Core Turion 64 Mobile
DD2 - Latest DDR2 up to 800 MHz
|
Back To Contents List
Cyrix
CPU MHz MHz KB L1 KB L2 Type L2 FSB Typical
From To Cache Cache Cache MHz RAM
Cyrix 120 200 16 256-512 External 60-66 SDRAM
Cyrix MX 150 233 64 512 External 66 SDRAM
Cyrix M2 225 333 64 512-1024 External 75-83 SDRAM
|
Back To Contents List
3. Benchmark Results Used
CPUID (WhatCPU.zip) has tests using 1, 2, 3 and 4 registers, simply
adding 1. Tests are on integers, 32 bit SP and 64 bit DP floating
point, MMX 32 bit integers, SSE SPFP, 3DNow SPFP and SSE2 DPFP.
Assembly code loop has 20 adds and decrement/jump. The latter are
included in instruction count for integer speed.
BusSpd2K (.zip) is read only using streamed sequential integer data
using 64 AND from memory integer instructions and 512 MMX MOV
64 bit instructions in the assembly code timed loops. The MMX test
is intended to show maximum data transfer rate from memory.
RandMem (.zip) uses C code with of eight & xi[xi[i+0]] | xi[xi[i+2]]
statements (FP +, -, *) reading data or eight xi[xi[i+2]] =
xi[xi[i+0]] statements for read/write. The pair are used for
reading/writing 128 bytes in the loop with the indexing producing
either sequential or random accessing. Four tests cover 32 bit
integer operation and four 64 bit double precision floating point.
With indexing calculations, this benchmarks has more instructions
per word read or written than the other memory tests. Data transfer
rate may be less but the MIPS/MHz ratio higher.
BenchNT.zip provides Whetstone, Dhrystone, Linpack and Livermore
Loops Classic benchmarks, representing old code much with a small
number of instructions in loops. Benchmark codes are provided
compiled with full optimisation and no optimisation.
SSE3DNow (.zip) uses the same read and read/write functions as
MemSpeed - C code - s=s+x[m]*y[m] and x[m]=x[m]+y[m]. The tests
are run via the C code in Single Precision Floating Point with 16
unrolled statements in the inner loop (read 128 bytes, read/write
64/64 bytes). When available, the tests are also run using assembly
instructions for SSE (SP), 3DNow (SP) and SSE2 (DP) handling the
same amount of data in the inner loop.
FFTGraf (.zip) calculates Fast Fourier Transforms using single and
double precision floating point numbers. Version 1 uses optimised
C code with later versions using assembly code instructions. Memory
accessing is on a random/skipped sequential basis where performance
suffers due to burst reading from RAM. Earlier systems read in 32
byte bursts with later ones using 64 bytes with potentially slower
speed on this and RandMem benchmarks.
BusSpd2K, RandMem and SSE3DNow run tests with increasing memory
demands to measure performance via caches and RAM. Similarly,
FFTGraf uses increasing FFT sizes.
Back To Contents List
4. CPU/L1 Cache Speed %MIPS/MHz - 32 bit integers
CPUM is CPUID
BUS1 and BUS2 are BusSpd2K ANDing to 1 and 2 registers
RSRD and RSWR are RandMem serial read, read/write
DOPT and DNOP are Dhrystone 2 Optimised and Non-optimised
WHET is from Whetstone average integer MIPS
MMX1 is CPUID and MMX2 BusSpd2K
CPU CPUM BUS1 BUS2 RSRD RSRW DOPT DNOP WHET MMX1 MMX2
80486 90 45 47 53 45 53 19 44 N/A
Pentium 198 49 95 114 89 135 29 98 N/A
Pentium Pro 196 95 95 181 57 156 45 157 N/A
Pentium MMX 200 49 96 121 96 137 30 105 363 197
Celeron 196 98 96 186 89 160 45 159 292 196
Pentium II 196 97 97 186 89 160 45 159 293 196
Pentium III 197 97 98 186 89 160 45 159 289 197
Celeron M 196 97 99 193 90 176 50 156 292 199
Pentium M 196 97 99 198 91 211 52 181 286 199
Pentium 4 287 95 93 198 82 120 14 47 200 187
Pentium 4E/D 275 94 93 178 69 118 19 63 200 190
Core 2 Duo 293 99 99 204 146 269 52 180 363 197
AMD K6X 197 94 96 130 109 144 22 99 197 197
Duron 244 99 187 129 93 167 61 160 399 380
Athlon 244 99 184 129 93 167 61 160 399 376
Athlon XP 244 100 187 129 93 167 61 160 389 379
Turion 64 292 99 191 137 97 197 61 169 395 395
Athlon 64 293 100 193 126 92 200 62 170 400 397
Cyrix MX 86 128 86 69 135 30 85 163
|
Note: assuming CPUs of the same MHz, Pentium 4 is faster than
AMD 64 on only one example and AMD CPUs have larger L1 cache.
Pentiums could also be faster than P4s, probably due to shorter
pipelines. Pentium 4E has even longer pipelines and performance
characteristics are different to earlier P4s, some tests producing
slower results and others faster.
Pentium M/Celeron M and Core 2 Duo CPUs show superior performance
to the P4 range on many of these tests, influenced by shorter
pipelines.
Back To Contents List
5. CPU/L1 Cache Speed %MFLOPS/MHz - 32/64 bit SP/DP floating point
CPUF is CPUID where SP and DP results are usually the same
SSRD and SSRW are SSE3DNow normal SP Read and Read/Write
RSRD is RandMem serial read, usually same as random read
WHET is Whetstone SP benchmark average
LMAX and LAVG are Livermore Loops maximum and average
FFSP and FFDP are from FFTGraf version 1 for smallest FFTs
FFS2 and FFD2 are from FFTGraf optimised version 2
CPU CPUF SSRD SSRW RSRD WHET LMAX LAVG FFSP FFDP FFS2 FFD2
80486 8 8 5 6 6 8 4 5 4 5 4
Pentium 94 33 9 22 17 33 12 18 11 17 14
Pen Pro 100 66 18 33 21 60 17 24 19 40 30
Pent MMX 95 39 11 24 17 34 13 21 17 19 16
Celeron 100 65 17 33 21 59 18 37 31 46 40
PII 100 66 17 33 21 59 18 37 28 44 33
PIII 100 66 18 33 21 59 18 37 32 46 42
Celeron M 90 90 19 43 21 87 21 46 39 56 50
Pentium M 90 90 19 43 21 88 22 45 39 55 49
P4 80 32 9 28 9 60 11 26 19 38 32
P4E/D 66 51 14 22 8 50 10 25 23 29 29
Core 2 Duo 100 91 19 43 21 93 22 52 43 63 54
AMD K6X 47 29 9 15 15 36 12 14 14
Duron 100 87 23 34 17 113 20 44 38 60 59
Athlon 100 86 22 34 17 113 20 44 38
Athlon XP 100 88 23 34 17 113 20 47 42 65 58
Turion 64 100 84 22 35 17 114 21 46 40 64 57
Athlon 64 100 88 23 34 17 116 21 49 44 64 59
Cyrix MX 11 10 18 8
|
AMD CPUs show superior performance to P4 of same MHz on all these
floating point tests, average 1.9 times. They also have a larger
L1 cache. Pentium M/Celeron M and Core 2 Duo CPU scores are similar
to AMD’s but L1 cache sizes are smaller.
Back To Contents List
6. CPU/L1 Cache Speed %MFLOPS/MHz - SSE SP, 3DNow SP, SSE2 DP FP
xxSP is either 32 bit SSE or 3DNow
xxDP is 64 bit SSE2
CPxx is CPUID
RDxx is SSE3DNow Read and RWxx Read/Write
FFxx is FFTGraf version 3 single and double precision
80486 -
Pentium -
Pen Pro -
Pent MMX -
Celeron -
PII -
Cyrix MX -
CPU CPSP CPDP RDSP RDDP RWSP RWDP FFSP FFDP
PIII 184 - 160 - 76 - 54 -
Celeron M 187 94 191 89 93 48 61 40
Pentium M 189 94 193 90 97 49 60 40
P4 200 100 122 61 65 33 38 32
P4E/D 200 100 146 74 86 43 34 31
Core 2 Duo 400 200 264 132 198 99 93 72
AMD K6X 199 - 96 - 81 - ?? -
Duron 199 - 177 - 105 - 63 -
Athlon 200 - 171 - 102 - 63 -
Athlon XP* 200 - 196 - 99 - 63 -
Turion 64+ 200 100 187 90 96 48 63 44
Athlon 64+ 197 100 194 97 99 50 71 44
- Extensions N/A
* has SSE and 3DNow, similar performance
+ has SSE, 3DNow and SSE2
|
Again AMD performance is superior to P4s but similar to
Pentium M/Celeron M. Core 2 Duo scores are by far the best
on these SSE/SSE2 tests.
Back To Contents List
7. CPU/L2 Cache Speed %MIPS/MHz - 32 bit integers
BUS2 is BusSpd2K ANDing to 2 registers
RSRD is RandMem serial read
RSWR is RandMem serial read/write
RRRD is RandMem random read
MMX2 is BusSpd2K
CPU BUS2 RSRD RSWR RRRD MMX2
80486 11 19 9 10
Pentium 29 48 24 20
Pentium Pro 57 135 49 62
Pentium MMX 29 54 25 24 33
Celeron 54 122 63 75 91
Celeron 2 61 147 81 123 106
Pentium II 19 93 26 48 30
Pentium III 19 93 26 48 30
Pentium IIIE 61 148 81 123 107
Celeron M 63 143 75 96 108
Pentium M 63 146 76 98 108
Pentium 4 69 150 42 83 106
Pentium 4E/D 63 117 36 64 111
Core 2 Duo 85 179 138 95 110
AMD K62 21 62 20 13 21
AMD K62+ 63 107 79 29 82
Duron 55 89 56 54 74
Athlon 19 40 19 16 30
Athlon TB 63 93 60 55 80
Athlon XP 63 93 60 55 80
Turion 64 72 78 50 70 80
Athlon 64 78 91 57 81 121
Cyrix MX 45 22 22 38
|
Pentium 4 is faster than AMD CPUs of the same MHz up to Athlon
XP on these L2 cache tests. Athlon 64 shows improvements on
these speeds. Core 2 Duo is better than both P4 and Athlon 64
on most tests and has a much larger L2 cache.
Back To Contents List
8. CPU/L2 Cache Speed %MFLOPS/MHz - 32/64b SP/DP floating point
LINP is Lipack (Large L1 cache helps)
SSRD and SSRW are SSE3DNow normal SP Read and Read/Write
RSRD is RandMem serial read, usually same as random read
FFSP and FFDP are from FFTGraf version 1 for FFTs that use L2
FFS2 and FFD2 are from FFTGraf optimised version 2
CPU LINP SSRD SSRW RSRD FFSP FFDP FFS2 FFD2
80486 4 5 2 3 5 4 5 4
Pentium 13 17 6 7 12 9 16 13
Pentium Pro 24 60 13 26 20 19 39 30
Pentium MMX 12 23 7 9 12 11 16 13
Celeron 27 52 16 17 25 22 40 33
Celeron 2 31 56 17 17 30 27
Pentium II 15 40 11 11 16 13 26 18
Pentium III 15 40 11 11 16 13 26 18
Pentium IIIE 31 56 17 18 30 27 44 39
Celeron M 42 70 16 29 31 29 50 45
Pentium M 45 71 16 31 31 29 50 45
Pentium 4 28 31 9 30 23 18 39 32
Pentium 4E/D 21 40 12 21 22 20 21 22
Core 2 Duo 59 87 21 37 33 28 52 48
AMD K62 12 8 7 5
AMD K63 14 23 7 11
Duron 38 40 12 14 16 15 26 16
Athlon 36 8
Athlon TB 38 40 12 20 16 15 28 28
Athlon XP 38 40 12 20 16 15 37 29
Turion 64 37 43 12 21 16 15 42 28
Athlon 64 38 48 12 22 19 19 45 32
Cyrix MX 11 7
|
Latest AMD CPUs have similar average performance to P4s of the same
MHz, the former benefiting from faster floating point and the latter
from more efficient L2 cache. Pentium M/Celeron M are relatively
better but top marks go to Core 2 Duo.
Back To Contents List
9. CPU/L2 Cache Speed %MFLOPS/MHz - SSE SP, 3DNow SP, SSE2 DP FP
xxSP is either 32 bit SSE or 3DNow, xxDP is 64 bit SSE2
RDxx is SSE3DNow Read and RWxx Read/Write
FFxx is FFTGraf version 3 single and double precision
CPU RDSP RDDP RWSP RWDP FFSP FFDP
Pentium III - - -
Pentium IIIE 106 - 42 - 52 -
Celeron M 110 55 42 21 51 36
Pentium M 110 55 40 20 51 36
Pentium 4 118 60 46 23 40 34
Pentium 4E/D 139 69 54 27 36 30
Core 2 Duo 178 87 86 43 79 61
AMD K62 - - -
AMD K63 62 - 36 - -
Duron 64 - 27 - 27 -
Athlon - - -
Athlon TB 68 - 26 - 30 -
Athlon XP 68 - 26 - 37 -
Turion 64 73 35 21 11 43 25
Athlon 64 94 45 25 12 47 28
- Extensions N/A
|
Performance of these SIMD streaming instructions is superior on the
P4 versus AMD CPUs of the same MHz. Athlon 64 shows improvements
over earlier AMD CPUs. Again, Core 2 Duo is best.
Back To Contents List
10. RAM Speed %MIPS/MHz - 32 bit integers
Performance via data in RAM may also depend on CPU MHz besides
RAM/bus speeds. This can result in a reduction in %MIPS/MHz with
higher speed CPUs. There can also be wide variations according to
mainboard/chipset.
BUS2 is BusSpd2K ANDing to 2 registers
RSRD is RandMem serial read
RSWR is RandMem serial read/write
RRRD is RandMem random read
MMX2 is BusSpd2K MMX instructions
CPU MHz RAM BUS2 RSRD RSWR RRRD MMX2
80486 66 9 18 6 5
Pentium 100 18 31 13 6
Pentium Pro 200 30 36 14 9
Pentium MMX 200 17 34 16 6 17
Celeron A 300 66 23 42 14 11 37
Celeron A 550 66 14 26 10 8 20
Celeron 2 566 66 15 27 11 8 23
Celeron 2 800 PC100 14 25 9 6 23
Pentium II 450 PC100 18 37 12 11 27
Pentium III 450 PC100 18 37 12 11 27
Pentium III 800 PC100 15 34 9 9 24
Pentium IIIEB 666 PC133 17 32 10 8 24
Pentium IIIEB 1000 PC133 15 28 6 6 23
Celeron M 1295 44 13 4
Pentium M 1862 PC2700D 31 62 16 10 34
Pentium 4 1400 PC800 24 52 17 4 29
Pentium 4 2400 PC800 16 34 11 3 20
Pentium 4N 2533 PC1066 21 43 15 4 23
Pentium 4N 1800 PC2100 20 43 13 3 23
Pentium 4N 2533 PC2100 16 34 9 2 17
Pentium 4N 3000 PC2700 18 37 8 2 19
Pentium 4N 3300 PC3200D 23 49 19 3 26
Pentium 4E/D 3000 PC3200D 30 65 21 3 32
Core 2 Duo XX 2400 DD2533D 45 95 9 4 51
Core 2 Duo 2400 DD2533D 45 102 29 9 54
Core 2 Duo 2400 DD2800D 51 109 35 11 60
AMD K6X 500 PC100 13 31 8 3 13
Duron 600 PC133 18 23 12 3 38
Duron 1000 PC133 10 15 7 2 26
Athlon 800 PC100 11 16 8 2 31
Athlon TB 750 PC133 17 18 9 3 31
Athlon TB 1000 PC133 11 14 7 2 26
Athlon XP 1200 PC2100 13 18 9 4 23
Athlon XP 1733 PC2100 13 19 9 5 19
Athlon XP 2167 PC2700D 12 22 13 5 18
Turion 64 1900 DD2320D 33 50 25 7 35
Athlon 64 2000 PC3200D 34 43 25 3 37
Athlon 64 2210 PC3200D 32 54 24 4 33
Cyrix MX 225 66 30 10 7
D = dual channel DD2 = DDR2
XX - slow nForce 570 chipset on some tests
|
Comparing CPUs of the same MHz and the same memory speed, P4s show
superiority performance of speed via data in RAM compared with AMD
with CPUs prior to Athlon 64. Core 2 Duo performance is outstanding
on some of these tests but poor on others, perhaps due to using the
nForce 570 chipset. Later results via Intel 965 chipset confirmed this
suspicion, giving rise to Core 2 Duo with DDR2 RAM taking the lead on performance.
Back To Contents List
11. RAM Speed %MFLOPS/MHz - 32/64b SP/DP floating point
Examples - see paragraph 10 notes, but FP variations are less
than with integers.
SSRD and SSRW are SSE3DNow normal SP Read and Read/Write
RSRD and RRRD are RandMem serial read and random read
FFSP and FFDP are from FFTGraf version 1 for largest FFTs
FFS2 and FFD2 are from FFTGraf optimised version 2
CPU MHz SSRD SSRW RSRD RRRD FFSP FFDP FFS2 FFD2
80486 66 5 2 2 1.0 3 3 4 3
Pentium 100 10 4 4 1.6 6 6 12 9
Pentium Pro 200 16 5 6 2.2 7 6 23 15
Pentium MMX 200 14 5 5 1.3 7 6 13 11
Celeron 300 19 5 4 2.2 8 6 22 12
Pentium II 450 16 5 4 2.0 5 5 15 10
Pentium III 450 16 5 4 2.0 5 5 15 10
Pentium IIIEB 800 12 4 4 1.6 5 4 22 16
Celeron M 1295 28 7 9 0.9 3 3 19 14
Pentium M DCDDR 1862 33 9 11 2.8 6 5 26 21
Pentium 4 RDRAM 2000 22 5 6 0.9 3 3 15 12
Pentium 4 DDR 2533 17 3 5 0.4 2 2 10 10
Pentium 4 DCDDR 3300 25 7 7 0.8
Pentium 4E DCDDR 3000 27 9 10 0.9 4 3 13 12
Core 2 Duo DDR2a 2400 8 3 5 1.2 3 3 26 17 XX
Core 2 Duo DDR2a 2400 50 16 16 2.5 8 7 35 31
Core 2 Duo DDR2c 2400 54 18 18 2.8 10 9 37 33
AMD K63 400 6 2 4 0.7 3 3
Duron 950 8 3 4 0.5 3 2 13 8
Athlon 550 5 0.8
Athlon TB 1000 4 0.5 2 2 13 8
Athlon XP 2167 15 6 5 1.3 3 3 16 12
Turion 64 DDR2d 1900 32 10 10 1.9 5 5 22 17
Athlon 64 2000 32 9 11 1.2
Athlon 64 DCDDR 2210 33 10 11 1.5 5 5 23 18
Cyrix MX 225 4 1.2
XX - slow nForce 570 chipset on some tests
DDR2a 533 MHz, DDR2c 800 MHz, DDR2d 320 MHz
|
Superior floating point performance with AMD CPUs helps to overcome
inferior RAM speeds. Again, some nForce 570 chipset, Core 2 Duo scores are disappointing.
Back To Contents List
12. RAM Speed %MFLOPS/MHz - SSE SP, 3DNow SP, SSE2 DP FP
Examples - see 10. notes, but FP variations are less than with
integers.
xxSP is either 32 bit SSE or 3DNow, xxDP is 64 bit SSE2
RDxx is SSE3DNow Read and RWxx Read/Write
FFxx is FFTGraf version 3 single and double precision
CPU MHz RDSP RDDP RWSP RWDP FFSP FFDP
Pentium IIIE 800 26 - 8 - 23 -
Celeron M 1295 28 14 8 4 22 13
Pentium M DCDDR 1862 31 16 9 5 34 20
Pentium 4 DDR 2533 18 9 4 2 12 9
Pentium 4 RDRAM 2411 24 12 7 4 14 11
Pentium 4 DCDDR 2533 22 11 6 3 15 11
Pentium 4E DCDDR 3000 31 15 10 5 22 16
Core 2 Duo DDR2a 2400 7 4 3 2 26 17 XX
Core 2 Duo DDR2a 2400 51 26 16 8 46 33
Core 2 Duo DDR2c 2400 60 30 20 10 49 39
AMD K63 400 7 - 2 - -
Duron 950 15 - 5 - 13 -
Athlon Tb 1200 9 - 4 - 12 -
Athlon XP 2167 19 - 7 - 17 -
Turion 64 1900 38 19 14 7 25 17
Athlon 64 2000 34 17 10 5 25 16
Athlon 64 DCDDR 2210 39 20 12 6 26 17
- Extensions N/A XX - slow nForce 570 chipset on some tests
DDR2a 533 MHz, DDR2c 800 MHz, DDR2d 320 MHz
|
All these could justify that a 2 GHz Athlon 64 is equivalent to
a 3 GHz P4 when running SIMD streaming instructions with data
in RAM. Yet again, some nForce 570 chipset, Core 2 Duo results are very poor.
END
Back To Contents List
To Start
More Historic Computer Speeds
Computer Speeds From Instruction Mixes pre-1960 to 1971
Whetstone Benchmark History and Results (1960's to 1990's, PCs to 2006)
Computer Speed Claims 1980 to 1996 - PCs and IBM Mainframes to 2004
Main Page Roy Longbottom's PC Benchmark Collection
To Start
Produced by Roy Longbottom December 2004
Updated December 2007
|