Intel demos Knights Ferry CPU, 1 teraflop performance from a single chip

Shawn Knight

TechSpot Staff
Staff member
Intel has revealed a new processor with more than 50 cores at the SC11 supercomputing conference in Seattle. The groundbreaking chip was running in a test machine and is capable…

Read the whole story
 

TomSEA

TechSpot Chancellor
Now if they can just get software to keep up with hardware innovations and improvements.
 
G

Guest

2600K = 83.3 GFLOPOS
source: http://www.tomshardware.com/charts/desktop-cpu-charts-2010/Raw-Performance-SiSoftware-Sandra-2010-Pro-GFLOPS,2409.html

1 TF = 1000 GF

1000/83.3 = 12.0048... or 12X

so 12x Faster then a 2600k @ stock speed
 

Wagan8r

TS Evangelist
I remember about 3-4 years ago, Intel made an 80 core processor that reached the 1 teraflop barrier, and they said that it would come to market in 5 years. They've got about a yeart left, but I don't see it happening.
 

MarkHughes

TS Addict
I read about it as Knights Corner too, And it does say that in the screenshot....

Quite looking forward too seeing this in action, The Industry I work in could well benefit from something like this.
 
G

Guest

Chances are that many people already have a 1 TFLOPs processor in their machines called the GPU. The newest AMD HD 6990 can do 5.40 TFLOPS in single precision, and 1.37 TFLOPS in double precision. There are boards out there that will support four or MORE of these cards, meaning that a single DESKTOP can do 21.6 TFLOPS single precision and 5.48 TFLOPS in double precision. Given that these cards cost about $700 each, which means that for about $4000, you can have one mean HPC machine.

Yes, I do agree that having this type of processing on a single chip is definitely impressive, but the graphics card processing power can't be overlooked.
 

red1776

Omnipotent Ruler of the Universe
Chances are that many people already have a 1 TFLOPs processor in their machines called the GPU. The newest AMD HD 6990 can do 5.40 TFLOPS in single precision, and 1.37 TFLOPS in double precision. There are boards out there that will support four or MORE of these cards, meaning that a single DESKTOP can do 21.6 TFLOPS single precision and 5.48 TFLOPS in double precision. Given that these cards cost about $700 each, which means that for about $4000, you can have one mean HPC machine.

Yes, I do agree that having this type of processing on a single chip is definitely impressive, but the graphics card processing power can't be overlooked.
Not quite Guest. the 6990 ids a dual GPU card. crossfire and SLI both afford up to 4 GPU's so only two of the 6990 (crossfire), and two GTX 590 (SLI) is supported.
The maximum (before OC'ing) for Crossfire at this time is 10.8 TF. I know, this is what i am running (12.6 TF) with OC'ing
 

dividebyzero

trainee n00b
Not quite Guest. the 6990 ids a dual GPU card. crossfire and SLI both afford up to 4 GPU's so only two of the 6990 (crossfire), and two GTX 590 (SLI) is supported.
The maximum (before OC'ing) for Crossfire at this time is 10.8 TF. I know, this is what i am running (12.6 TF) with OC'ing
True enough to a degree...except that no one would use SLI or CFX for HPC, and the oft-quoted TF numbers bandied around are theoretical maximums for such reeeeeeaallly useful apps such as Linpack....so your flops are only as useful as the programming running on them

Fastra II with 6 x GTX295 + 1 x GTX 275 yields 11.7 TF single precision...but for anything other than pure number crunching, something like this 8.2 TF Tesla powered Colfax CXT8000 would leave it for dead.
 

red1776

Omnipotent Ruler of the Universe
dividebyzero said:
Not quite Guest. the 6990 ids a dual GPU card. crossfire and SLI both afford up to 4 GPU's so only two of the 6990 (crossfire), and two GTX 590 (SLI) is supported.
The maximum (before OC'ing) for Crossfire at this time is 10.8 TF. I know, this is what i am running (12.6 TF) with OC'ing
True enough to a degree...except that no one would use SLI or CFX for HPC, and the oft-quoted TF numbers bandied around are theoretical maximums for such reeeeeeaallly useful apps such as Linpack....so your flops are only as useful as the programming running on them

Fastra II with 6 x GTX295 + 1 x GTX 275 yields 11.7 TF single precision...but for anything other than pure number crunching, something like this 8.2 TF Tesla powered Colfax CXT8000 would leave it for dead.
I was speaking strictly as a matter of logistics. he said " you could have one mean desktop" and 4 is the limit. Radeons are higher rated in FLOPs than Nvidia so by the numbers, 10.6 is the max Flop rating at this time for a desktop .As far as the usefulness of FLOPS, certainly not the only metric to consider. Radeons outFLOP Nvidias offerings, in the case of the 6990 vs the GTX 590 by a 2:1 margin (5100Gflops to 2700Gflops) and performance doesn't quite reflect that discrepancy.
 

dividebyzero

trainee n00b
True enough on the numbers- although "doesn't quite reflect that discrepancy" might be a bit of an understatement unless the HD 6990 suddenly became twice as productive as a GTX 590.
I read the Guest comment as:
There are boards out there that will support four or MORE of these cards...Given that these cards cost about $700 each, which means that for about $4000
Assuming that the Guest isn't mathematically challenged, I interpreted the comment to mean 4000 divided by 700 - which is closer to 6 than four - and also (maybe) co-incidentally six is the maximum number of cards used for compute in a standard ATX (7 PCI-E slot) motherboard. The seventh often being employed primarily for video out.

I would tend to regard the relative FLOP count of a CPU or GPU as more a marketing bulletpoint for the most part- if it weren't then AMD would surely be talking up the numbers.
As far as I'm aware, Bitcoin and distributed computing (BOINC, Seti@Home etc.) are about the only enviroments that fully utilise the FLOPS's of AMD's GPU's. That isn't likely to change unless/until AMD put some serious resources into OpenCL. The HD6990's supposed FLOP superiority doesn't do a hell of a lot of good with regard to F@H for example

So, to my way of thinking, the HD 6990 could be a Bugatti Veyron -great stats, but if the software enviroment equates to an eight-year-old kid that can't reach to depress the gas pedal more than a quarter of the way to the floor....

I think that there are reasons that desktop HPC doesn't generally offer a great choice in AMD GPU's and those reasons likely stem from CUDA has been well supported in the HPC/WS enviroment- the fact that CUDA ports easily to both OpenCL and Linux, Nvidia's adoption and use of ECC, and their QA program.
I'd also wonder if AMD thought that raw floating-point calculation is the answer, why are they moving to a compute based GCN arch ? especially as VLIW4 is less than a year in the marketplace.

If I had to guess I'd probably point to Nvidia's HPC/WS marketshare. There's probably some degree of status appearing in desktop HPC systems like the Cray CX1, Colfax, Dell (look at the high margin/mission critical HPC options), Amax, SuperMicro,HP etc. ( Lenovo don't even offer an AMD card with their WS's - ironic no?).