New SSE2 Instruction Set
Intel introduced the MMX (MMX for MultiMedia eXtensions) instructions set
back in
1996.
This was the first instruction addition to the x86
architecture since the i386
was released. Remember, the MMX instruction set was full of multimedia
dedicated features that were here to accelerate applications which used
them. MMX applications came a bit later on the market but most users enjoyed
MMX benefits especially in games since game developers adopted them quickly.
More recently Intel added SSE (Streaming SIMD Extension) to its Pentium III:
this was a set of
70
supplementary extensions that used the SIMD (Single Instruction Multiple
Data) technology just like the MMX or
3D
Now (From AMD). SIMD principle is simple: it should treat only one pass of
several data with only one instruction. Compared to a standard SISD x86
instruction (Single Instruction Single Data) where one instruction should
give one result the SSE can give up to
4
results in the same clock cycle.
The SSE
instruction set principally enhances audio and video compression processes
as shown by our tests: indeed compressing an audio file of
130
MB took
8.5
seconds against
10.3
seconds using a Pentium III
1GHz:
for this kind of task the Pentium
4
boosts performance up to
28%.
SSE
2
brings several enhancements dedicated to boost MPEG
2
encoding and file encrypting processes. First it adds
144
new instructions (oriented on memory and cache management) to the SSE & MMX
existing ones but it can now handle integers of
128-bit
numbers (1
per cycle), and double precision floating of
64-bit
(two per cycle). Just like the MMX the SSE
2
set of instructions is no use if you don’t have compliant applications that
take benefit from it: actually no applications manage it except the Direct X
8.0
API. But some compatible applications should be out very soon like the
Windows Media Encoder
2,
Dragon Naturally Speaking
4,
etc. We also ran another test showing the undeniable power the Pentium
4
brings to high demanding multimedia applications: compressing an Indeo video
of
15
MB into an MPEG2
one took
1.05
minutes against
1.38
minutes for the Pentium III
1
GHz, showing a difference of more than
42%.
The performance enhancements shown by the tests are due both to the higher
frequency of the CPU and its various MMX and SSE instructions.
Thermal Security
The Pentium
4
CPU features a security system called Thermal Monitor. This exclusive
function permanently monitors the temperature of the CPU in order to protect
the processor against long term reliability damages. If the temperature of
the CPU or the chassis is higher than what it’s recommended, the rotation
speed of the fans will be increased (if available) and the CPU performance
may drop below the peak level to prevent critical damages. If ever the CPU
reaches a high temperature of approximately
69°
Celsius or 155 fahrenheit (in case the fan is disconnected, broken, etc.), the Thermal Monitor
will instantly shut down the computer so your CPU will not burn.
Pentium 4 2.8GHz CPU Design
The Pentium 4 2.8 GHz uses the µPGA Socket478
package that was introduced by the Pentium
4 2
GHz. It is clearly the reference socket for future Pentium
4
processors. Even if this new socket permits the Pentium
4
chip to be much smaller, despite
55
newly added pins, it isn’t compatible with the existing Socket
423
pin motherboards. However there are already some CPU converters that
let you use Socket
478
Pentium on Socket
423
motherboards.
Intel Pentium 4 2.8GHz
After
having changed the packaging of Pentium
4
CPUs by making them smaller with the µPGA
478
pin, Intel has also changed their core. The first technical advance is the
use of a new Px60
process so NorthWood CPUs are now engraved in
0,13µ.
The engraving superiority is not all, since the connections used in the
processor are now copper instead of aluminium. Using copper is now essential
to miniaturize processors, since it’s a very good conductor with very good
heat dissipating capabilities. This new manufacturing process ensures the
CPU can reach higher frequencies while heating less. Not to say the power
consumption of the CPU has been reduced from
1.75
to
1.5
Volts. But there are other interests for Intel to use the
0.13µ
manufacturing process: Intel has developed new
300
mm
wafers that let them multiply their production volume by
2.25
(over previous
200
mm
wafers) implying an appreciable cost reduction of
25%.
The Northwood now includes
55
million transistors, instead of
42
million for P4
Willamette.
Pentium 4 0.13µ Wafer (click
to enlarge)
Since the
engraving process is finer and more accurate there’s more room available on
the silica plaque so Intel engineers have increased the level
2
cache memory to
512
KB. Initially the Pentium
4
was featuring a
256
KB L2
cache memory which was clearly limiting its performance. This L2
cache size increased, and now provides
10-15%
higher performance at equal frequency.
As usual
the top part of the CPU features a metal cover that was added by Intel to
protect the die unit from damage if the radiator was incorrectly mounted or
is too heavy as well as to ensure better thermal dissipation. The heat sink
has been totally revamped. Intel has released a new plastic retention
mechanism (which is much handier) to attach the cooler and heat sink to the
motherboard. Nonetheless you can still use old school Pentium
4
heat sink systems.
Pentium 4 Heatsink with
retention mechanism (click to enlarge)
Actually
the Pentium
4 2.8
GHz FSB 533 MHz uses a clock frequency multiplier of 21 and is officially compatible with two different chipsets: the
Intel i850e,
Intel i845GL
(and its various declensions) thus you can use it with Rambus or DDR memory.
|