Posted on 08/31/2012 11:29:09 AM PDT by Ernest_at_the_Beach
Hot Chips An opportunity to define the future of server processing comes along once every decade or so, and Applied Micro Circuit, a company known for its networking chips and PowerPC-based embedded controllers, wants to move up into the big leagues to take on Intel, Advanced Micro Devices, and the handful of remaining etchers of RISC processors for the data center.
Applied Micro may be an upstart in the server chip racket and it may be facing long odds, but Paramesh Gopi, the company's president and CEO, and Gaurav Singh, its vice president of engineering for ARM and PowerPC processors (and who worked at AMD on Athlon and Opteron processor designs), are not playing with the X-Gene ARM processor that they have created for modern, hyperscale data center workloads.
If people were skeptical about the design and deliverability of the X-Gene processor when it was announced last October, they would have been more encouraged if they had attended the Hot Chips 24 conference in Cupertino this week.
Not only did Gopi and Singh show up to talk up the X-Gene chip, but Gopi whipped out a completed X-Gene server card, ready to plug into a PCI-Express 3.0 backplane and be put to work, when he was giving his presentation on the stage.
Gopi quickly tucked it into his coat pocket and left, leaving Singh to go over the details, but El Reg chased Gopi down in the parking lot and convinced him to disobey the lawyers and show off the finished board.
The smartest thing that Hewlett-Packard, IBM, or Dell could do right now is to buy Applied Micro before it gets too expensive. And maybe AMD or Intel should do it, too.
Applied Micro has $211.3m in trailing revenues for the past four quarters, and has been hemorrhaging cash, losing a staggering $110m in large part due to its radically increased research and development expenses related to the X-Gene processor.
With a market capitalization of $335.2m right now, Applied Micro is not going to get any cheaper than it is today. It's arguably worth at least $500m and maybe even more, given all the hard work it has done to come to market with the very first 64-bit ARMv8 processor design and one designed for profitable servers, at that.
AMCC's X-Gene server card--no longer a simulation running on an FPGAs
That said, it would be just fine for Applied Micro to enter the ranks of server chip makers all by itself, doing the designs and letting Taiwan Semiconductor Manufacturing Corp do the fabbing, just like AMD, Oracle, Nvidia, and Fujitsu, among others, do for GPUs and GPUs.
Applied Micro has given out some of the basic concepts embodied in the X-Gene ARM processor and its related processor and system interconnect, but has thus far been a bit vague about the core counts on the chip and many of the basic feeds and speeds of the processor.
Singh revealed a bunch of the specifics of the processor at Hot Chips, but plenty of the details are still to be revealed when the chip and related server boards are announced. Gopi tells El Reg that the X-Gene chip will sample to OEM customers sometime before the end of the year and that volume shipments will begin next year.
Die shot of the X-Gene ARM server processor
Calxeda. Marvell, and any other potential ARM server contenders don't have much time to come up with their own ARMv8 derivatives. Although a full-on ARM chip from Intel, IBM, or AMD would certainly cause a pause for Applied Micro.
The X-Gene chip, like the 32-bit ARM Cortex A9-derived EnergyCore processors from Calxeda, is a true system on a chip, and that server card above looks a lot smaller than it appears in the photo. (You really can put it in your sportcoat inside pocket, and El Reg was tempted to try.) That means everything but DDR3 main memory sticks and disk drives is on the chip, including a system interconnect that can lash multiple processors together over a server midplane.
A comparison with the Calxeda ECX-1000 is illustrative. Each Cortex-A9 core in the ECX-1000 chip has 32KB of L1 data cache and 32KB of L1 instruction cache each, plus a 4MB L2 cache that is shared across those four cores. These cores have an eight-stage pipeline and can do out-of-order execution as most modern processors do (even those powering IBM mainframes for the past several years).
The cores are all power gated, so they can be turned off if they are not in use. Each Cortex-A9 core runs at 1.1GHz or 1.4GHz and includes a scalar floating point unit that can do single-precision or double-precision operations, plus a NEON SIMD media processing unit that has 64-bit and 128-bit registers and that can also do floating point ops, too.
Calxeda has implemented a very fast distributed Layer 2 Ethernet switch at the heart of the EnergyCore ECX-1000 processors to let multiple sockets talk to each other as distinct server nodes. The EXC-1000s currently put up to four Cortex-A9 cores on a die, where they can share L2 cache and present a single image to an operating system as a four-threaded processor. The EnergyCore fabric switch can present Gigabit or 10 Gigabit Ethernet links to the outside world.
The ECX-1000 chips are implemented in a 40 nanometer process and baked by TSMC, just like the X-Gene chips from Applied Micro are. And like Calxeda, Applied Micro's next chips will be etched using TSMC's 28 nanometer process, which is being used by AMD and Nvidia for the current GPUs, and by AMD and Oracle for their impending CPUs.
**********************************SNIP**************************************
fyi
*******************************CONTINUED*****************************************
The X-Gene processor is quite a bit different from the ECX-1000 in a number of respects, starting with the fact that it is a 64-bit design from the ground up and is based on the 64-bit ARMv8 specification announced last fall.
"ARM has really done a fine job in implementing an architecture that makes a very fine server part," Singh explained.
The 64-bitness of ARMv8 spec means it has significantly more potential memory addressability than any 32-bit or 40-bit ARMv7 variant currently on the market, and it also means that code written for 64-bit x86 or RISC processors can be more easily ported to this chip. Cramming 64-bit code back into 32 bit memory spaces is not trivial, although there are plenty of webby workloads where a 32-bit operating system and memory space will do the job. But in the long run (meaning next year), 64-bit memory addressing will be the rule for the ARM server chips.
**********************************SNIP******************************
Hey Ernest, thanks for giving ARM a little love on FR.
If you havve youngun's get them on the path to a career in programming software....we have enough Lawyers.
Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.