Title

PC CPU Performance Comparisons

From Roy Longbottom's Benchmark Results

This page was set up as 770 pixels wide and accommodates preformatted text <PRE> results tables. Some browsers
produce monospaced font of an unexpected size but this might be adjustable via browser Preferences.

Summary

The document contains tables of CPU performance from 80486 to Intel Core 2 Duo and AMD Athlon 64, including speed via caches and RAM derived from my benchmarks and results reports, which are available from Compuserve PC Hardware Forum and my web site.

More Historic Data

In analysing results of the different benchmarks, in virtually all cases (except for RAM), CPU speed, including using data in cache, is shown to be proportional to CPU MHz for a particular type of processor. So it has been decided to show the MHz of available CPUs and provide simple tables with one set of numbers for each type of CPU. For results depending on memory speed, results are provided for a range of RAM types.

As the number and type of instructions used in the benchmarks is known, performance results can be easily converted to Millions of Instructions Per Second (MIPS) or Millions of Floating Point Operations Per Second (MFLOPS). Comparison numbers represent percentages of MIPS or MFLOPS divided by CPU MHz.

Contents

Introduction
CPU Characteristics Intel AMD Cyrix
Benchmark results used
CPU/L1 Cache Speed 32 bit integers 32/64 bit floating point SSE, SSE2 floating point
CPU/L2 Cache Speed 32 bit integers 32/64 bit floating point SSE, SSE2 floating point
RAM Speed 32 bit integers 32/64 bit floating point SSE, SSE2 floating point


1. CPU Performance Tables - Introduction

In analysing results of the different benchmarks, in virtually all cases, CPU speed, including using data in cache, is shown to be proportional to CPU MHz for a particular type of processor. So it has been decided to show the MHz of available CPUs and provide simple tables with one set of numbers for each type of CPU. As the number and type of instructions used in the benchmarks is known, performance results can be easily converted to Millions of Instructions Per Second (MIPS) or Millions of Floating Point Operations Per Second (MFLOPS). Comparison numbers represent percentages of MIPS or MFLOPS divided by CPU MHz.

CPUs can have more than one Integer or Floating Point Pipeline which each may be able to produce one result per Hz. Also, P4 Integer Pipelines can execute two instructions per cycle. Thus it is possible that the percentage calculations can be greater than 100.

Overall processor performance depends on the type/version and speed of CPU and cache sizes along with memory speed, which is dependent on the system board used. No attempt is made to provide an overall performance rating, the overall aim being to show some strengths and weaknesses of each type of processor.

Note that these benchmarks reflect performance of one CPU on multi-core processors.

Back To Contents List


2. CPU Characteristics

Some of the following have been gathered from benchmark results and magazines and may not be completely accurate. Mobile CPUs have the same speeds using mains power but might have larger L2 caches and slower RAM.

Typical type of RAM is shown but some will have faster or slower variations, including dual memory channels.

Back To Contents List

Intel

XEON CPUs could be Pentium Pro, III or P4 usually with options for larger L2 caches.


 CPU            MHz   MHz KB L1  KB L2     Type L2     FSB Typical
               From    To Cache  Cache       Cache     MHz  RAM   

 Pentium         75   200    8  256-512    External  50-66  FPM   
 Pentium Pro    180   200    8  256-512    Internal  60-66  EDO   
 Pentium MMX    166   233   16  256-512    External     66  SDRAM 
 Pentium II     233   333   16      512  Half Speed     66  SDRAM 
 Pentium II     350   500   16      512  Half Speed    100  PC100 
 Celeron        266   300   16        0        None     66  SDRAM 
 Celeron A      300   550   16      128  Full Speed     66  SDRAM 
 Celeron 2      566   733   16      128  Full Speed     66  SDRAM 
 Celeron 3      800  1300   16      128  Full Speed    100  PC100 
 Celeron 4     1000  1400   16      256  Full Speed    100  PC100 
 Celeron P4    1700  2700    8      128  Full Speed    400  PC133 
 Cel D 315-351 2260  3260   16      256  Full Speed    533        
 Cel M 310-340 1200  1500   32      512  Full Speed    400        
 Cel M 350-380 1300  1600   32     1024  Full Speed    400        
 Pentium III    450   600   16      512  Half Speed    100  PC100 
 Pentium IIIB   533   600   16      512  Half Speed    133  PC133 
 Pentium IIIE   550  1100   16      256  Full Speed    100  PC100 
 Pentium IIIEB  533  1330   16      256  Full Speed    133  PC133 
 Pentium IIIT   800  1400   16      512  Full Speed    133  PC133 

 All the following have full speed L2 cache

 CPU            MHz   MHz KB L1  KB L2     FSB Typical
               From    To Cache  Cache     MHz  RAM

 Pentium M      900  1600   32     1024    400
 PM 735-765    1700  2100   32     2048    400
 PM 730-780    1600  2267   32     2048    533
 CS U1300-1400 1060  1200   32     2048    533
 CS T1300-1400 1666  1833   32     2048    667
 CD T2300-2700 1666  2333  32*2    2048    667
 CM U7500-7600 1066  1333  32*2    2048    533
 CM T5200-     1600        32*2    2048    533
 CM T5500-5600 1666  1833  32*2    2048    667
 CM L7300-7500 1600  1800  32*2    4096    800
 CM T7100      1800        32*2    2048    800
 CM T7200-7700 2000  2400  32*2    4096    800 
 Mob P4 518+   2800  3460   16     1024    533
 Pentium 4     1300  2000    8      256    400  RDRAM 
 Pentium 4N    1600  2200    8      512    400  PC2100
 Pentium 4N2   2200  3060    8      512    533  PC2700
 Pentium 4N3   2400  3200    8      512    800  PC3200
 Pentium 4EE   3200          8   512+L3#   800  PC3200
 Pentium 4E    2800  3600   16     1024    800  PC3200
 P4 505-519    2666  3060   16     1024    533
 P4 520-571    2800  3600   16     1024    800
 P4 630-672    3000  3800   16     2048    800
 P4 820-840 D  2800  3200  16*2   1024*2   800
 P4 920-960 D  2800  3600  16*2   2048*2   800
 P4EE 840      3200        16*2   1024*2   800
 P4EE 955-965  3460  3730  16*2   2048*2  1066
 C2D 4300-4500 1800  2200  16*2   2048*2   800 
 C2D 6300-6400 1860  2130  32*2    2048   1066
 C2D 6600-6700 2400  2666  32*2    4096   1066
 C2D 6540-6850 2333  3000  32*2    4096   1333
 C2E 6800      2930        32*2    4096   1066
 C2Q 6600-6700 2400  2666  32*4    8192   1066
 QXE 6700-6800 2666  6800  32*4    8192   1066
 QXE 6850      3000        32*4    8192   1333

        2048# Full Speed

 CS  = Core Solo Mobile Processor
 CD  = Core Duo Mobile Processor
 CM  = Core 2 Duo Mobile Processor
 D   = Dual CPU chip, caches on each
 C2D = Core 2 Duo Processor, shared L2 cache
 C2E = Core 2 Extreme, shared L2 cache
 C2Q = Core 2 Quad Processor
 QXE = Core 2 Extreme Quad Processor
                     

Back To Contents List


AMD

Athlon MP varieties are essentially the same as XP.


 CPU            MHz   MHz KB L1  KB L2     Type L2     FSB  Typical
               From    To Cache  Cache       Cache     MHz  RAM 
        
 K5              75   100    8      512    External  50-66  EDO
 K6             166   300   32      512    External     66  SDRAM
 K62            300   475   32 512-1024    External    100  PC100
 K63            400   500   32   256+L3  Full Speed    100  PC100
 Duron          550  1800   64       64  Full Speed    200  PC100
 Athlon         500   750   64      512  Half or <     200  PC100
 Athlon Tbird   650  1400   64      256  Full Speed    200  PC133
 Athlon Tbird  1000  1400   64      256  Full Speed    266  PC2100
 AXP 1500-2100 1333  1733   64      256  Full Speed    266  PC2100
 AXP 1700-2200 1467  1800   64      256  Full Speed    266  PC2100
 AXP 2600-2700 2083  2167   64      256  Full Speed    333  PC2700
 AXP 2500-3000 1833  2167   64      512  Full Speed    333  PC2700
 AXP 3000-3200 2100  2200   64      512  Full Speed    400  PC3200
 Sem 2200-2800 1500  2000   64      256  Full Speed    333  PC2700
 Sem 3000      2000         64      512  Full Speed    333  PC2700

All the following have full speed L2 cache and HyperTransport bus

 CPU            MHz   MHz KB L1  KB L2    Maximum 
               From    To Cache  Cache    RAM MB/Sec

 Sem 3100      1800         64      256    3200
 Sem 3000      1800         64      128    3200
 Sem 2600-3300 1600  2000   64      128    3200
 Sem 2500-3600 1400  2200   64      256    3200
 SMo 2600-3500 1600  2000   64      128
 SMo 2800-3800 1600  2200   64      256
 SMo 3200-3600 1600  1800   64      512
 A64 FX51-74   2200  3000   64     1024    3200 * 2 Chan
 A64 2800-3700 1600  2400   64     1024    DD2  1 Chan
 A64 2800-3700 1800  2400   64      512    DD2  1 Chan
 A64 3000-4000 1800  2600   64      512    DD2  2 Chan
 A64 3700-4000 2200  2400   64     1024    DD2  2 Chan
 AX2 3800-5400 2000  2800  64*2    512*2   DD2  2 Chan
 AX2 4400-6000 2200  3000  64*2   1024*2   DD2  2 Chan
 AX2 BE2300-50 1900  2100  64*2    512*2   DD2  
 AMo 2700-3000 1600  2000   64      512    DD2  1 Chan
 AMo 2800-4000 1600  2600   64     1024    DD2  1 Chan
 TMo ML28-ML32 1600  1800   64      512    DD2  1 Chan
 TMo ML30-ML34 1600  1800   64     1024    DD2  1 Chan
 TMo MT32      1800         64      512    DD2  1 Chan
 TMo MT34-MT37 1800  2000   64     1024    DD2  1 Chan
 TMd TL50      1600        64*2    256*2   DD2  1 Chan
 TMd TL52-TL66 1600  2300  64*2    512*2   DD2  1 Chan
 Opt  140-150  1400  2400   64     1024    DD2  2 Chan
 Opt  840-850  1400  2400   64     1024    DD2  2 Chan
 Opt  842-856  1600  3000   64     1024    DD2  2 Chan
 Opt  242-256  1600  3000   64     1024    DD2  2 Chan
 Opt  144-154  1800  2800   64     1024    DD2  2 Chan
 Opt 1210-1222 1800  3000   64     2048    667  2 Chan
 Opt 2210-2222 1800  3000   64     2048    533  2 Chan
 Opt 8212-8222 2000  3000   64     2048    533  2 Chan
 O2C  265-290  1800  2800  64*2   1024*2   DD2  2 Chan
 O2C  865-890  1800  2800  64*2   1024*2   DD2  2 Chan
 O2C  165-185  1800  2600  64*2   1024*2   DD2  2 Chan

 AXP = Athlon XP,    Sem = Sempron,       SMo = Mobile Sempron         
 A64 = Athlon 64,    AX2 = A64 Dual core, AMo = Mobile A64 
 Opt = Opteron,      O2C = Opt Dual Core. TMo = Turion 64 Mobile
 TMd = Dual Core Turion 64 Mobile
 DD2 - Latest DDR2 up to 800 MHz  

Back To Contents List

Cyrix


 CPU            MHz   MHz KB L1  KB L2     Type L2     FSB  Typical
               From    To Cache  Cache       Cache     MHz  RAM

 Cyrix          120   200   16  256-512    External  60-66  SDRAM
 Cyrix MX       150   233   64      512    External     66  SDRAM
 Cyrix M2       225   333   64 512-1024    External  75-83  SDRAM

Back To Contents List

3. Benchmark Results Used

CPUID (WhatCPU.zip) has tests using 1, 2, 3 and 4 registers, simply adding 1. Tests are on integers, 32 bit SP and 64 bit DP floating point, MMX 32 bit integers, SSE SPFP, 3DNow SPFP and SSE2 DPFP. Assembly code loop has 20 adds and decrement/jump. The latter are included in instruction count for integer speed.

BusSpd2K (.zip) is read only using streamed sequential integer data using 64 AND from memory integer instructions and 512 MMX MOV 64 bit instructions in the assembly code timed loops. The MMX test is intended to show maximum data transfer rate from memory.

RandMem (.zip) uses C code with of eight & xi[xi[i+0]] | xi[xi[i+2]] statements (FP +, -, *) reading data or eight xi[xi[i+2]] = xi[xi[i+0]] statements for read/write. The pair are used for reading/writing 128 bytes in the loop with the indexing producing either sequential or random accessing. Four tests cover 32 bit integer operation and four 64 bit double precision floating point. With indexing calculations, this benchmarks has more instructions per word read or written than the other memory tests. Data transfer rate may be less but the MIPS/MHz ratio higher.

BenchNT.zip provides Whetstone, Dhrystone, Linpack and Livermore Loops Classic benchmarks, representing old code much with a small number of instructions in loops. Benchmark codes are provided compiled with full optimisation and no optimisation.

SSE3DNow (.zip) uses the same read and read/write functions as MemSpeed - C code - s=s+x[m]*y[m] and x[m]=x[m]+y[m]. The tests are run via the C code in Single Precision Floating Point with 16 unrolled statements in the inner loop (read 128 bytes, read/write 64/64 bytes). When available, the tests are also run using assembly instructions for SSE (SP), 3DNow (SP) and SSE2 (DP) handling the same amount of data in the inner loop.

FFTGraf (.zip) calculates Fast Fourier Transforms using single and double precision floating point numbers. Version 1 uses optimised C code with later versions using assembly code instructions. Memory accessing is on a random/skipped sequential basis where performance suffers due to burst reading from RAM. Earlier systems read in 32 byte bursts with later ones using 64 bytes with potentially slower speed on this and RandMem benchmarks.

BusSpd2K, RandMem and SSE3DNow run tests with increasing memory demands to measure performance via caches and RAM. Similarly, FFTGraf uses increasing FFT sizes.

Back To Contents List


4. CPU/L1 Cache Speed %MIPS/MHz - 32 bit integers


 CPUM is CPUID
 BUS1 and BUS2 are BusSpd2K ANDing to 1 and 2 registers
 RSRD and RSWR are RandMem serial read, read/write
 DOPT and DNOP are Dhrystone 2 Optimised and Non-optimised
 WHET is from Whetstone average integer MIPS
 MMX1 is CPUID and MMX2 BusSpd2K

 CPU         CPUM BUS1 BUS2 RSRD RSRW DOPT DNOP WHET MMX1 MMX2
       
 80486         90   45   47   53   45   53   19   44  N/A     
 Pentium      198   49   95  114   89  135   29   98  N/A     
 Pentium Pro  196   95   95  181   57  156   45  157  N/A     
 Pentium MMX  200   49   96  121   96  137   30  105  363  197
 Celeron      196   98   96  186   89  160   45  159  292  196
 Pentium II   196   97   97  186   89  160   45  159  293  196
 Pentium III  197   97   98  186   89  160   45  159  289  197
 Celeron M    196   97   99  193   90  176   50  156  292  199
 Pentium M    196   97   99  198   91  211   52  181  286  199
 Pentium 4    287   95   93  198   82  120   14   47  200  187
 Pentium 4E/D 275   94   93  178   69  118   19   63  200  190
 Core 2 Duo   293   99   99  204  146  269   52  180  363  197

 AMD K6X      197   94   96  130  109  144   22   99  197  197
 Duron        244   99  187  129   93  167   61  160  399  380
 Athlon       244   99  184  129   93  167   61  160  399  376
 Athlon XP    244  100  187  129   93  167   61  160  389  379
 Turion 64    292   99  191  137   97  197   61  169  395  395
 Athlon 64    293  100  193  126   92  200   62  170  400  397

 Cyrix MX           86  128   86   69  135   30   85       163

Note: assuming CPUs of the same MHz, Pentium 4 is faster than AMD 64 on only one example and AMD CPUs have larger L1 cache. Pentiums could also be faster than P4s, probably due to shorter pipelines. Pentium 4E has even longer pipelines and performance characteristics are different to earlier P4s, some tests producing slower results and others faster. Pentium M/Celeron M and Core 2 Duo CPUs show superior performance to the P4 range on many of these tests, influenced by shorter pipelines.

Back To Contents List

5. CPU/L1 Cache Speed %MFLOPS/MHz - 32/64 bit SP/DP floating point


 CPUF is CPUID where SP and DP results are usually the same
 SSRD and SSRW are SSE3DNow normal SP Read and Read/Write
 RSRD is RandMem serial read, usually same as random read
 WHET is Whetstone SP benchmark average
 LMAX and LAVG are Livermore Loops maximum and average     
 FFSP and FFDP are from FFTGraf version 1 for smallest FFTs
 FFS2 and FFD2 are from FFTGraf optimised version 2

 CPU         CPUF SSRD SSRW RSRD WHET LMAX LAVG FFSP FFDP FFS2 FFD2
                                                        
 80486         8     8    5    6    6    8    4    5    4    5    4
 Pentium      94    33    9   22   17   33   12   18   11   17   14
 Pen Pro     100    66   18   33   21   60   17   24   19   40   30
 Pent MMX     95    39   11   24   17   34   13   21   17   19   16
 Celeron     100    65   17   33   21   59   18   37   31   46   40
 PII         100    66   17   33   21   59   18   37   28   44   33
 PIII        100    66   18   33   21   59   18   37   32   46   42
 Celeron M    90    90   19   43   21   87   21   46   39   56   50
 Pentium M    90    90   19   43   21   88   22   45   39   55   49
 P4           80    32    9   28    9   60   11   26   19   38   32
 P4E/D        66    51   14   22    8   50   10   25   23   29   29
 Core 2 Duo  100    91   19   43   21   93   22   52   43   63   54

 AMD K6X      47    29    9   15   15   36   12   14   14
 Duron       100    87   23   34   17  113   20   44   38   60   59
 Athlon      100    86   22   34   17  113   20   44   38          
 Athlon XP   100    88   23   34   17  113   20   47   42   65   58
 Turion 64   100    84   22   35   17  114   21   46   40   64   57
 Athlon 64   100    88   23   34   17  116   21   49   44   64   59

 Cyrix MX                     11   10   18    8                    

AMD CPUs show superior performance to P4 of same MHz on all these floating point tests, average 1.9 times. They also have a larger L1 cache. Pentium M/Celeron M and Core 2 Duo CPU scores are similar to AMD’s but L1 cache sizes are smaller.

Back To Contents List

6. CPU/L1 Cache Speed %MFLOPS/MHz - SSE SP, 3DNow SP, SSE2 DP FP


 xxSP is either 32 bit SSE or 3DNow
 xxDP is 64 bit SSE2
 CPxx is CPUID
 RDxx is SSE3DNow Read and RWxx Read/Write
 FFxx is FFTGraf version 3 single and double precision

 80486         -                                    
 Pentium       -                                    
 Pen Pro       -                                    
 Pent MMX      -                                    
 Celeron       -                                    
 PII           -                                    
 Cyrix MX      -                                    

 CPU         CPSP CPDP RDSP RDDP RWSP RWDP FFSP FFDP

 PIII        184     -  160    -   76    -   54    -
 Celeron M   187    94  191   89   93   48   61   40
 Pentium M   189    94  193   90   97   49   60   40
 P4          200   100  122   61   65   33   38   32
 P4E/D       200   100  146   74   86   43   34   31
 Core 2 Duo  400   200  264  132  198   99   93   72

 AMD K6X     199     -   96    -   81    -   ??    -
 Duron       199     -  177    -  105    -   63    -
 Athlon      200     -  171    -  102    -   63    -
 Athlon XP*  200     -  196    -   99    -   63    -
 Turion 64+  200   100  187   90   96   48   63   44
 Athlon 64+  197   100  194   97   99   50   71   44

 - Extensions N/A                        
 * has SSE and 3DNow, similar performance
 + has SSE, 3DNow and SSE2               

Again AMD performance is superior to P4s but similar to Pentium M/Celeron M. Core 2 Duo scores are by far the best on these SSE/SSE2 tests.

Back To Contents List

7. CPU/L2 Cache Speed %MIPS/MHz - 32 bit integers


 BUS2 is BusSpd2K ANDing to 2 registers
 RSRD is RandMem serial read
 RSWR is RandMem serial read/write
 RRRD is RandMem random read
 MMX2 is BusSpd2K

 CPU         BUS2 RSRD RSWR RRRD MMX2

 80486         11   19    9   10
 Pentium       29   48   24   20
 Pentium Pro   57  135   49   62 
 Pentium MMX   29   54   25   24   33
 Celeron       54  122   63   75   91
 Celeron 2     61  147   81  123  106
 Pentium II    19   93   26   48   30
 Pentium III   19   93   26   48   30
 Pentium IIIE  61  148   81  123  107
 Celeron M     63  143   75   96  108
 Pentium M     63  146   76   98  108
 Pentium 4     69  150   42   83  106
 Pentium 4E/D  63  117   36   64  111
 Core 2 Duo    85  179  138   95  110

 AMD K62       21   62   20   13   21
 AMD K62+      63  107   79   29   82
 Duron         55   89   56   54   74
 Athlon        19   40   19   16   30
 Athlon TB     63   93   60   55   80
 Athlon XP     63   93   60   55   80
 Turion 64     72   78   50   70   80
 Athlon 64     78   91   57   81  121

 Cyrix MX           45   22   22   38

Pentium 4 is faster than AMD CPUs of the same MHz up to Athlon XP on these L2 cache tests. Athlon 64 shows improvements on these speeds. Core 2 Duo is better than both P4 and Athlon 64 on most tests and has a much larger L2 cache.

Back To Contents List

8. CPU/L2 Cache Speed %MFLOPS/MHz - 32/64b SP/DP floating point


 LINP is Lipack (Large L1 cache helps)                        
 SSRD and SSRW are SSE3DNow normal SP Read and Read/Write     
 RSRD is RandMem serial read, usually same as random read     
 FFSP and FFDP are from FFTGraf version 1 for FFTs that use L2
 FFS2 and FFD2 are from FFTGraf optimised version 2           

CPU         LINP SSRD SSRW RSRD FFSP FFDP FFS2 FFD2
                                                                
 80486          4    5    2    3    5    4    5    4
 Pentium       13   17    6    7   12    9   16   13
 Pentium Pro   24   60   13   26   20   19   39   30
 Pentium MMX   12   23    7    9   12   11   16   13
 Celeron       27   52   16   17   25   22   40   33
 Celeron 2     31   56   17   17   30   27          
 Pentium II    15   40   11   11   16   13   26   18
 Pentium III   15   40   11   11   16   13   26   18
 Pentium IIIE  31   56   17   18   30   27   44   39
 Celeron M     42   70   16   29   31   29   50   45
 Pentium M     45   71   16   31   31   29   50   45
 Pentium 4     28   31    9   30   23   18   39   32
 Pentium 4E/D  21   40   12   21   22   20   21   22
 Core 2 Duo    59   87   21   37   33   28   52   48

 AMD K62       12              8    7    5          
 AMD K63       14   23    7   11                    
 Duron         38   40   12   14   16   15   26   16
 Athlon        36              8                    
 Athlon TB     38   40   12   20   16   15   28   28
 Athlon XP     38   40   12   20   16   15   37   29
 Turion 64     37   43   12   21   16   15   42   28
 Athlon 64     38   48   12   22   19   19   45   32

 Cyrix MX      11              7                    

Latest AMD CPUs have similar average performance to P4s of the same MHz, the former benefiting from faster floating point and the latter from more efficient L2 cache. Pentium M/Celeron M are relatively better but top marks go to Core 2 Duo.

Back To Contents List

9. CPU/L2 Cache Speed %MFLOPS/MHz - SSE SP, 3DNow SP, SSE2 DP FP


 xxSP is either 32 bit SSE or 3DNow, xxDP is 64 bit SSE2
 RDxx is SSE3DNow Read and RWxx Read/Write
 FFxx is FFTGraf version 3 single and double precision

 CPU          RDSP RDDP RWSP RWDP FFSP FFDP

 Pentium III          -         -         -
 Pentium IIIE  106    -   42    -   52    -
 Celeron M     110   55   42   21   51   36
 Pentium M     110   55   40   20   51   36
 Pentium 4     118   60   46   23   40   34
 Pentium 4E/D  139   69   54   27   36   30
 Core 2 Duo    178   87   86   43   79   61

 AMD K62              -         -         -
 AMD K63        62    -   36    -         -
 Duron          64    -   27    -   27    -
 Athlon               -         -         -
 Athlon TB      68    -   26    -   30    -
 Athlon XP      68    -   26    -   37    -
 Turion 64      73   35   21   11   43   25
 Athlon 64      94   45   25   12   47   28

                     - Extensions N/A

Performance of these SIMD streaming instructions is superior on the P4 versus AMD CPUs of the same MHz. Athlon 64 shows improvements over earlier AMD CPUs. Again, Core 2 Duo is best.

Back To Contents List

10. RAM Speed %MIPS/MHz - 32 bit integers

Performance via data in RAM may also depend on CPU MHz besides RAM/bus speeds. This can result in a reduction in %MIPS/MHz with higher speed CPUs. There can also be wide variations according to mainboard/chipset.


 BUS2 is BusSpd2K ANDing to 2 registers
 RSRD is RandMem serial read
 RSWR is RandMem serial read/write
 RRRD is RandMem random read
 MMX2 is BusSpd2K MMX instructions

 CPU            MHz    RAM BUS2 RSRD RSWR RRRD MMX2

 80486           66           9   18    6    5     
 Pentium        100          18   31   13    6     
 Pentium Pro    200          30   36   14    9     
 Pentium MMX    200          17   34   16    6   17
 Celeron A      300     66   23   42   14   11   37
 Celeron A      550     66   14   26   10    8   20
 Celeron 2      566     66   15   27   11    8   23
 Celeron 2      800  PC100   14   25    9    6   23
 Pentium II     450  PC100   18   37   12   11   27
 Pentium III    450  PC100   18   37   12   11   27
 Pentium III    800  PC100   15   34    9    9   24
 Pentium IIIEB  666  PC133   17   32   10    8   24
 Pentium IIIEB 1000  PC133   15   28    6    6   23
 Celeron M     1295               44   13    4     
 Pentium M     1862 PC2700D  31   62   16   10   34
 Pentium 4     1400  PC800   24   52   17    4   29
 Pentium 4     2400  PC800   16   34   11    3   20
 Pentium 4N    2533 PC1066   21   43   15    4   23
 Pentium 4N    1800 PC2100   20   43   13    3   23
 Pentium 4N    2533 PC2100   16   34    9    2   17
 Pentium 4N    3000 PC2700   18   37    8    2   19
 Pentium 4N    3300 PC3200D  23   49   19    3   26
 Pentium 4E/D  3000 PC3200D  30   65   21    3   32
 Core 2 Duo XX 2400 DD2533D  45   95    9    4   51
 Core 2 Duo    2400 DD2533D  45  102   29    9   54
 Core 2 Duo    2400 DD2800D  51  109   35   11   60

 AMD K6X        500  PC100   13   31    8    3   13
 Duron          600  PC133   18   23   12    3   38
 Duron         1000  PC133   10   15    7    2   26
 Athlon         800  PC100   11   16    8    2   31
 Athlon TB      750  PC133   17   18    9    3   31
 Athlon TB     1000  PC133   11   14    7    2   26
 Athlon XP     1200 PC2100   13   18    9    4   23
 Athlon XP     1733 PC2100   13   19    9    5   19
 Athlon XP     2167 PC2700D  12   22   13    5   18
 Turion 64     1900 DD2320D  33   50   25    7   35
 Athlon 64     2000 PC3200D  34   43   25    3   37
 Athlon 64     2210 PC3200D  32   54   24    4   33

 Cyrix MX       225     66        30   10    7     

 D = dual channel     DD2 = DDR2  
 XX - slow nForce 570 chipset on some tests


Comparing CPUs of the same MHz and the same memory speed, P4s show superiority performance of speed via data in RAM compared with AMD with CPUs prior to Athlon 64. Core 2 Duo performance is outstanding on some of these tests but poor on others, perhaps due to using the nForce 570 chipset. Later results via Intel 965 chipset confirmed this suspicion, giving rise to Core 2 Duo with DDR2 RAM taking the lead on performance.

Back To Contents List

11. RAM Speed %MFLOPS/MHz - 32/64b SP/DP floating point

Examples - see paragraph 10 notes, but FP variations are less than with integers.


 SSRD and SSRW are SSE3DNow normal SP Read and Read/Write 
 RSRD and RRRD are RandMem serial read and random read    
 FFSP and FFDP are from FFTGraf version 1 for largest FFTs
 FFS2 and FFD2 are from FFTGraf optimised version 2       

 CPU               MHz SSRD SSRW RSRD RRRD FFSP FFDP FFS2 FFD2

 80486              66    5    2    2  1.0    3    3    4    3
 Pentium           100   10    4    4  1.6    6    6   12    9
 Pentium Pro       200   16    5    6  2.2    7    6   23   15
 Pentium MMX       200   14    5    5  1.3    7    6   13   11
 Celeron           300   19    5    4  2.2    8    6   22   12
 Pentium II        450   16    5    4  2.0    5    5   15   10
 Pentium III       450   16    5    4  2.0    5    5   15   10
 Pentium IIIEB     800   12    4    4  1.6    5    4   22   16
 Celeron M        1295   28    7    9  0.9    3    3   19   14
 Pentium M DCDDR  1862   33    9   11  2.8    6    5   26   21
 Pentium 4 RDRAM  2000   22    5    6  0.9    3    3   15   12
 Pentium 4 DDR    2533   17    3    5  0.4    2    2   10   10
 Pentium 4 DCDDR  3300   25    7    7  0.8                    
 Pentium 4E DCDDR 3000   27    9   10  0.9    4    3   13   12
 Core 2 Duo DDR2a 2400    8    3    5  1.2    3    3   26   17 XX
 Core 2 Duo DDR2a 2400   50   16   16  2.5    8    7   35   31
 Core 2 Duo DDR2c 2400   54   18   18  2.8   10    9   37   33

 AMD K63           400    6    2    4  0.7    3    3          
 Duron             950    8    3    4  0.5    3    2   13    8
 Athlon            550              5  0.8                    
 Athlon TB        1000              4  0.5    2    2   13    8
 Athlon XP        2167   15    6    5  1.3    3    3   16   12
 Turion 64 DDR2d  1900   32   10   10  1.9    5    5   22   17
 Athlon 64        2000   32    9   11  1.2                    
 Athlon 64 DCDDR  2210   33   10   11  1.5    5    5   23   18

 Cyrix MX          225              4  1.2                    

 XX - slow nForce 570 chipset on some tests
 DDR2a 533 MHz, DDR2c 800 MHz, DDR2d 320 MHz


Superior floating point performance with AMD CPUs helps to overcome inferior RAM speeds. Again, some nForce 570 chipset, Core 2 Duo scores are disappointing.

Back To Contents List

12. RAM Speed %MFLOPS/MHz - SSE SP, 3DNow SP, SSE2 DP FP

Examples - see 10. notes, but FP variations are less than with integers.


 xxSP is either 32 bit SSE or 3DNow, xxDP is 64 bit SSE2
 RDxx is SSE3DNow Read and RWxx Read/Write
 FFxx is FFTGraf version 3 single and double precision

 CPU               MHz RDSP RDDP RWSP RWDP FFSP FFDP

 Pentium IIIE      800   26    -    8    -   23    -
 Celeron M        1295   28   14    8    4   22   13
 Pentium M DCDDR  1862   31   16    9    5   34   20
 Pentium 4 DDR    2533   18    9    4    2   12    9
 Pentium 4 RDRAM  2411   24   12    7    4   14   11
 Pentium 4 DCDDR  2533   22   11    6    3   15   11
 Pentium 4E DCDDR 3000   31   15   10    5   22   16
 Core 2 Duo DDR2a 2400    7    4    3    2   26   17 XX
 Core 2 Duo DDR2a 2400   51   26   16    8   46   33
 Core 2 Duo DDR2c 2400   60   30   20   10   49   39

 AMD K63           400    7    -    2    -         -
 Duron             950   15    -    5    -   13    -
 Athlon Tb        1200    9    -    4    -   12    -
 Athlon XP        2167   19    -    7    -   17    -
 Turion 64        1900   38   19   14    7   25   17
 Athlon 64        2000   34   17   10    5   25   16
 Athlon 64 DCDDR  2210   39   20   12    6   26   17

 - Extensions N/A  XX - slow nForce 570 chipset on some tests
 DDR2a 533 MHz, DDR2c 800 MHz, DDR2d 320 MHz


All these could justify that a 2 GHz Athlon 64 is equivalent to a 3 GHz P4 when running SIMD streaming instructions with data in RAM. Yet again, some nForce 570 chipset, Core 2 Duo results are very poor.

END

Back To Contents List


To Start


More Historic Computer Speeds

Computer Speeds From Instruction Mixes pre-1960 to 1971
Whetstone Benchmark History and Results (1960's to 1990's, PCs to 2006)
Computer Speed Claims 1980 to 1996 - PCs and IBM Mainframes to 2004
Main Page Roy Longbottom's PC Benchmark Collection

To Start
 

Produced by Roy Longbottom December 2004
Updated December 2007