If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
![]() |
|
|
#1 | |||
|
Senior Member
|
Here
Quote:
Quote:
Quote:
EDIT: Perhaps then, they are holding out to release it on 32 nm. |
|||
|
|
|
|
|
#2 |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 2,700
|
It might be the other way around.
LRB new instructions are not compatible with the main line of x86 ISA extensions. Intel is possibly holding back from deploying Larrabee more widely to avoid fracturing the field with another x86 extension. If the third generation of Larrabee has cores that can be used in consumer CPUs, then it would probably happen after Larrabee and the x86 main lines hit some convergent ISA extension.
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
#3 |
|
Senior Member
|
Well, if they wanted to introduce LRBni to the desktop, surely, they would have thought of it before they forked the x86 ISA. While they can emulate LRBni using AVX2 or some such thing, they may be actually trying to emulate the legacy x86 crap.
|
|
|
|
|
|
#4 | |
|
Senior Member
|
Quote:
In the many-core era does not that problem get worse and worse? Say that feature X costs you 0.01% of your total core's budget and you deploy 32 cores on a single chip or more... that seemed small for a single core, but over 32+ cores that does add up. I like the idea of x86 compatibility, but I feel that technology wise ARM (Intel still has a license) could have helped make the chip smaller without sacrificing performance. What do you think?
__________________
"Any idea worth a damn is already patented... twice" -Mfa |
|
|
|
|
|
|
#5 | |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 2,700
|
Quote:
Its support of x86 past the original Pentium is zero, and other elements of Intel do not want to expose it as a general programming target. We'll also have to see where the opcode space is allocated LRBni. Given the weak coordination between the main line and the Larrabee team, we don't know what LRB encodings might already be taken up by existing x86 extensions.
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
|
#6 | ||
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 2,700
|
Quote:
Quote:
Comparing a Pentium to a roughly contemporaneous RISC lead to an estimated 12-16% penalty in die area. Some things have come to light that make the guess less applicable, such as the fact that Intel has narrowed the standard x86 issue capability to 1 instruction, with a possible commensurate reduction in the amount of hardware on the x86 side of the core. edit: Other confounding factors are that at the time I didn't know if there would be texturing hardware and what proportion of the die woudl be taken up by other things besides cores, which reduces the proportion. It might be something like 10% in aggregate. What power penalty is something I don't have the data to calculate, and it would be dominated by the vector unit. We wouldn't really know without Nvidia or somebody springing an ARM Larrabee on us. edit edit: Although, the die shots show that the L2 is smaller than 1/4 of the core+cache tile area, which means with the vector unit taking up 1/3, the area that is x86 is actually somewhat bigger than what I guessed at.
__________________
Dreaming of a .065 micron etch-a-sketch. Last edited by 3dilettante; 09-Jun-2009 at 19:54. |
||
|
|
|
|
|
#7 |
|
Member
Join Date: Nov 2007
Location: Santa Clara, CA
Posts: 427
|
"One critical point we were told was that 1st and 2nd generation Larrabee GPUs will not be compatible with 3rd generation Larrabee."
I'm missing why (even if this is true) that one wouldn't expect something like this anyway. C++ with vector intrinsics isn't exactly designed to be ideal even within the same architecture over generations, got DX11/OpenGL/OpenCL for that (and even that isn't ideal either). Given AMD's entrance of 64bit extensions, different cacheline sizes, all the changes to SSE over the years, x86 hasn't ever been future safe. Even with x86's backwards compatibility, it has still been a good idea to recode performance critical assembly or intrinsics code for different generations of x86 arch, so who cares if the Larrabee ISA changes with different generations!
__________________
Timothy Farrar :: blog |
|
|
|
|
|
#8 |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 2,700
|
There are subtle difference between new and older x86 chips in various places that can cause problems, even with backwards compatibility.
To state that there is an actual break in compatibility is indicative of something more substantial than a tightened specification or differing instruction corner cases. It means something more significant might happen at that point for Larrabee, possibly more widespread use.
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
#9 |
|
Senior Member
|
It would be a bit depressing to see the Larrabee ISA hubbled with all the superfluous legacy of umpteen generations of x86.
|
|
|
|
|
|
#10 | ||
|
Senior Member
|
Quote:
I don't like the idea of x86 compatibility, but yes an ARM LRB would be great. Quote:
And it would probably need 300W to stay alive.
|
||
|
|
|
|
|
#11 | |
|
Senior Member
|
Quote:
Spaghetti, anyone? |
|
|
|
|
|
|
#12 | |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 2,700
|
Quote:
Shifting the Larrabee instruction set so that it can be used more widely might justify breaking the compatibility with the insular accelleration board variants. There hasn't been any comment about something frightfully wrong with the current instructions, so why remove compatibility on a whim?
__________________
Dreaming of a .065 micron etch-a-sketch. |
|
|
|
|
|
|
#13 |
|
Senior Member
|
They might go narrower
|
|
|
|
|
|
#14 |
|
Senior Member
Join Date: Jan 2003
Location: Ghent, Belgium
Posts: 1,332
|
|
|
|
|
|
|
#15 |
|
Junior Member
Join Date: Oct 2008
Location: Redmond area
Posts: 25
|
I'd take some of this Tom's reporting with a grain of salt, as I said here, a few of their latest articles contradict each other.
Until Intel states what the ISA compatibility story is from LRB1 to LRB2 and LRB2 to LRB3, this story is just speculation. |
|
|
|
|
|
#16 |
|
Senior Member
|
In a LRBni compatible way? I mean with all the masks and swizzles etc.?
|
|
|
|
|
|
#17 |
|
Senior Member
Join Date: Jun 2005
Location: California, USA
Posts: 195
|
Now, now, this is pure speculation. Let's wait for the actual hardware before declaring winners.
I personally would be very impressed if they get a Larrabee implementation matching GTX 285 - with completely software DX10 driver mind you. If this kind of performance can be achieved while emulating a hardware oriented API, I can't wait to see what direct low level access can get you in terms of performance, quality and features. |
|
|
|
|
|
#18 | |||
|
Senior Member
|
Quote:
Quote:
To be fair though, I am really impressed with the automatic load balancing and the multi-LRB scaling possibility. Quote:
|
|||
|
|
|
|
|
#19 |
|
Darlek ******
Join Date: Jun 2004
Posts: 5,987
|
A noob question if I may
I'd buy larrabee to replace my gpu / cpu or both ?
__________________
Guardian of the "Sacred Terabyte of Gaming Goodness™" |
|
|
|
|
|
#20 |
|
Senior Member
|
Good for you then. I'd prefer to have something which has better perf/price. I really don't care if my GPU can run MS Word.
|
|
|
|
|
|
#21 |
|
Senior Member
Join Date: Jan 2008
Posts: 268
|
RPG - lrb is about programmability. In theory, a pure accumulator architecture can run the same shaders as GT200 or LRB. In practice...
I see no reason to remove x86 compatibility from LRB, although it'd be a good idea to shrink the overhead as much as possible. Also, LRB has some pretty big handicaps compared to NV's GPU efforts: 1. First discrete GPU by Intel (in a long time) 2. First time the design team has worked together 3. First time the driver team has worked together etc. etc. Intel is coming at this from a big disadvantage and hopefully they can get close to NV. DK
__________________
www.realworldtech.com |
|
|
|
|
|
#22 | ||
|
Member
|
Quote:
Quote:
|
||
|
|
|
|
|
#23 | |
|
Senior Member
Join Date: Sep 2003
Location: Well within 3d
Posts: 2,700
|
Quote:
Now that we see die shots of Larrabee, a lot of data about what is taken up by the cores, cache, and other hardware is now known, and this does dilute the x86 penalty further. The dominant die penalty as far as current graphics applications are concerned might not be x86, so much as Intel's insistence on using a fully featured CPU core 32 times over. x86 might have made the solution in the ballpark of 10% larger than it otherwise would have been--although there are more modern examples of even slimmer cores than the earlier RISC I used as a baseline (and some really tiny embedded cores). Until more exotic methods of rendering become available for a good comparison, we can see from the Larrabee rasterizer description that a decent chunk of the instructions and much of the general capability of the x86 cores is not used. A general-purpose RISC core would still have this functionality, and while it might be slimmer, it would still not be used and multiplied 32 times over. I cannot really figure on what power penalties there might be, as there is no good way to compare without physical implementations, and too much can vary between manufacturers and processes. The vector units themselves will likely swamp the TDP calculations during peak loads.
__________________
Dreaming of a .065 micron etch-a-sketch. Last edited by 3dilettante; 10-Jun-2009 at 13:47. |
|
|
|
|
|
|
#24 | |
|
Senior Member
Join Date: Jan 2003
Location: Ghent, Belgium
Posts: 1,332
|
Quote:
On the other hand, AVX does support vectors of small integers. But with 512-bit registers I don't think that actually matters much anyway. |
|
|
|
|
|
|
#25 |
|
Junior Member
Join Date: Oct 2008
Location: Redmond area
Posts: 25
|
LRB1 will be a discrete add in card, eg GPU.
|
|
|
|
![]() |
| Bookmarks |
| Thread Tools | |
| Display Modes | |
|
|