Posted on 04/30/2008 1:04:12 PM PDT by ShadowAce
In a move that could have broad implications for the high-performance computing (HPC) market, Intel and Cray have announced a broad collaboration that will see engineers from the two companies work together on future products and projects.
With the first Intel-Cray products appearing in the 2010-2011 timeframe, it's clear that three Intel technologies have caught Cray's eye: the native 32nm Sandy Bridge microarchitecture, the QuickPath Interconnect (QPI) scheme, and the forthcoming discrete, x86-based graphics product, codenamed Larrabee. Cray will plug all of these components into its SeaStar interconnect fabric, and when combined with Cray Linux they'll make for an HPC and floating-point monster.
The first piece of the Intel-Cray puzzle is Sandy Bridge, which will debut in 2010 as Intel's first native 32nm processor. Unlike its predecessor, Westmere, which is a 32nm die shrink of the 45nm Nehalem, Sandy Bridge will be designed from the ground up for the 32nm node. One of Sandy Bridge's most widely talked-about features is a brand-new set of 256-bit vector extensions called Advanced Vector Extensions (AVX). In addition to vector registers that are double the size of the current 128-bit SSE registers, AVX will also introduce a nondestructive three-operand format for the first time in the x86 ISA's history. (For more on the operand format issue, see either Chapter 8 of my book.)
In all, AVX's vector length and new operand format will massively boost the peak theoretical floating-point performance of Intel's microarchitecture, making it a vector beast that can eat up as much bandwidth as Cray can throw at it. (Note that AltiVec coders are turning their nose up at AVX, but it still seems like it has to be an improvement. I hope to take up this issue and others in an AVX vs. Larrabee Vec-16 vs. SSE5 article at some point.)
Speaking of bandwidth, Intel's QuickPath Interconnect, with debuts with Nehalem later this year, will be firmly established by the time Sandy Bridge drops. QPI will give Cray the ability to do with Intel's processors what it already does (and will apparently continue to do) with AMD's, that is, plug them into the company's high-bandwidth interconnect fabric, SeaStar, via a bridge chip.
The graphic above is taken from a Cray XT4 brochure, and it shows two Opteron nodes attached via HyperTransport to Cray's 3D SeaStar interconnect fabric. An Intel-based Cray supercomputer will probably look similar, but with QPI swapped for HyperTransport and Sandy Bridge and/or Larrabee processors swapped for Opterons.
The HyperTransport interface in the SeaStar router node shown above would be swapped with a QPI interface, an effort that would require some engineering help from Intel and is likely to be the first place that the two companies collaborate.
Moving on to Intel's Larrabee, which is also slated for the 2010 timeframe, Cray CEO Peter Ungaro told TGDaily that Cray expects to integrate the many-core GPU product into its Intel-based supercomputers.
What's not clear is the method that Cray would use to integrate Larrabee into its products. The most obvious option would be to put Larrabee-based PCIe daughtercards into some of the Sandy Bridge nodes via a PCIe-to-QPI bridge. Another, possibly cheaper and more power-efficient option would be available if Larrabee supports QPI; in this case, Larrabee coprocessor sockets could be dropped right into the QPI fabric along with Sandy Bridge processors. Intel hasn't said, however, if Larrabee has a QPI interface or not; it's clear from an earlier leaked slide that it does have a PCIe interface, though.
The reason for including Larrabee in the new Cray supercomputers is pretty straightforward: each Larrabee core has a 512-bit vector floating-point unit that can grind through twice as many FLOPS as Sandy Bridge. The combination of Sandy Bridge's 256-bit vectors and Larrabee's 512-bit vectors will make for a very potent x86 vector processing machine, and it's the kind of thing that NVIDIA is already trying to get out in front of with its escalating verbal war against Intel CPUs.
One thing that worries some developers as they look at Larrabee and Sandy Bridge is that they'll have different vector extensions, and both of these will be different from AMD's forthcoming SSE5 extensions. There are no details yet on how Intel will get AVX and Larrabee's 512-bit extensions to work together, but I'm sure that whatever they have in mind involves this EXOCHI "accelerator exoskeleton with an IA 'look-n-feel'" that I haven't quite figured out yet.
For their part, NVIDIA and AMD aren't standing still in the HPC space by any means. Word recently got out that NVIDIA Tesla GPUs will play a prominent role in an upcoming French supercomputer, and by the time 2010 rolls around those Tesla GPUs will look a lot like Larrabee in that they'll have a fairly robust and flexible ISA that exposes a ton of vector floating-point resources.
AMD is also moving ahead with its HPC-oriented GPGPU plans, and the company already has a well-established place in Cray's XT3 and XT4 supercomputers that is likely to persist. Still, Sunnyvale can't be happy with Intel moving in this close on a piece of critical turf in the HPC arena that has been so kind to Opteron.
"Intel-based Cray" - that is just sooooo wrong...
BTT
But Can It Run Vista?
< |:)~
Not yet. Maybe SP2.
Now that would be a monstrous waste of resources.
But it would be as fast as Windows 2000.
Friends don’t let friends run Vista.
256-bit SIMD with non-destructive three-vector operands.
This sucker’s gonna scream.
Sorry, three-operand format. I’m tired, can’t see straight.
Oh I thought this was a post on Web Hubble and Hillary mating to produce the Chelseed!
Finally! A machine that will run Crysis at max video settings.
There seems to be a missing market segment in XPPCs, extremely powerful personal computers. Computers far more powerful than for existing software.
One concept is to have several computers bundled together, yet acting in a compartmentalized, yet parallel fashion, much like our brains.
To start with, an entire sub computer acts as the human interface. Not just with typical I/O, but with voice recognition and expression. It is in a continual learning mode to achieve full communications symbiosis with its user. You have a conversation with your computer.
Closely tied in is another sub computer singularly for artificial intelligence. Its purpose is both to command the rest of the computers, but also for “abbreviation”, so that instead of issuing a long linear progression of commands to the computer, a simple command is given, and the AI “knows” what to do. It is the brain behind the speech and hearing.
Since most computing is repetitive and habitual, this is not as difficult as it sounds, but it is far more complex than mere macro operations.
Another sub computer is a “research” computer. Also an AI, it performs “smart searches”. The value of doing this is obvious with even a 250Gb hard drive. If it can just predict half a dozen potential subdirectories to search, it will find things much faster than if it searches the whole hard drive. But extrapolate such searches to the Internet, and you have what amounts to a “Super Google”.
If you see any information that looks interesting, it is preserved and organized like bookmarks, but as data. Imagine if every interesting fact you saw while surfing was preserved in a recoverable manner, and could be quickly recovered. Every subject under the sun, and organized.
Sub computer after sub computer, with the whole being greater than the parts. When software comes along, it joins with the “brain”, to make a better brain.
Give it time--I'm sure they'll be able to shrink it all down someday.
Of course Seymour came from CDC. Designed the 6600 and 7600.
I admit that this one is way over my head. I do know that Cray is, or was, synonymous with “biggest and baddest”, though.
Once you're inside that package, on that single chip, then you're in the world of Intel and AMD and few others. System makers such as Cray no longer build their CPUs. They buy them, several to a single die package.
This is just the next step in an unstoppable evolution that has been ongoing in the computer business since World War II.
Imagine a Beowulf cluster of these.....
Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.