If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
![]() |
|
|
#1 |
|
Senior Member
|
There is a thread going on, but since SB is already out, and IB will be competing with Trinity in the market, I thought it makes more sense to compare those.
Known stuff IB has 33% more EU's. Trinity has BD cores IB will have finfets Trinity will have NI cores IB will be DX11, ~3 years late. Expected stuff Trinity will have 10 vliw4 simd's. Trinity will have better, more integrated turbo. Wanted stuff Trinity should have better cache level integration. IB should integrate the gpu deeply into it's coherency protocol. Trinity should have quick sync hw. DK's speculations are here. |
|
|
|
|
|
#2 | ||
|
Senior Member
Join Date: Oct 2002
Posts: 2,433
|
Can't see why it would be more than 8.
Quote:
Quote:
|
||
|
|
|
|
|
#3 |
|
Senior Member
Join Date: Jul 2008
Posts: 2,146
|
How much would the NI architecture benefit from using L3 cache anyways?
It's not like the GPU was designed to take advantage of it, afaik. IMO, what AMD needs to worry in Trinity is increasing the APU's memory bandwidth, either through special sideport channels for the GPU, more memory channels or faster memory. It's Llano's main bottleneck, for the moment. |
|
|
|
|
|
#4 | |
|
Senior Member
|
IIRC, they said trinity would be >50% bump.
Quote:
|
|
|
|
|
|
|
#5 | |
|
Senior Member
|
Quote:
As per TR's benches, Llano with dual mem channels performs just like a single channel i5. |
|
|
|
|
|
|
#6 | |
|
Member
Join Date: Jan 2010
Posts: 416
|
Quote:
|
|
|
|
|
|
|
#7 | ||
|
Senior Member
Join Date: Oct 2002
Posts: 2,433
|
Quote:
I think it's a much cheaper way to increase "bandwidth" than your other suggestions (well if you factor in that it's useful for the cpu too). 50%. 8 simds with a slightly higher clock is enough, if that figure was even peak flops (for all we know they could have been talking texture filtering rate...). Quote:
|
||
|
|
|
|
|
#8 | ||
|
Senior Member
|
Quote:
Quote:
|
||
|
|
|
|
|
#9 | |
|
Senior Member
Join Date: Jul 2008
Posts: 2,146
|
Quote:
Getting a big chunk of cache inside the APU (that usually takes a sizeable amount of die area) is "much cheaper" than creating i.e. a 64-bit sideport GDDR5 channel for the IGP?! |
|
|
|
|
|
|
#10 | |
|
Senior Member
Join Date: Oct 2002
Posts: 2,433
|
Quote:
Granted it's a bit a theoretical view without knowing how much performance you could get from using L3 cache. Faster memory OTOH is a good option it's just not really available (apart from some minimal incremental increase). More memory channels aren't viable neither I think. Now if the L3 would only help the GPU it would probably be too expensive but considering it helps the cpu too it looks quite cheap to me. |
|
|
|
|
|
|
#11 | ||
|
Member
Join Date: Jan 2010
Location: Hamburg, Germany
Posts: 985
|
Quote:
But a large L3 could serve as some kind of increased ROP cache holding far more framebuffer tiles than the color/Z caches within the ROPs itself, a bit like the eDRAM in some consoles. Or it could serve as a 3rd level texture cache (Sandybridge bypasses the L3 for texture reads, if DK's article is correct; obviously intel decided it's not worth it). Quote:
__________________
x: RCP_sat R2.x, R1.y y: RCP_sat ____, R1.y z: RCP_sat ____, R1.y |
||
|
|
|
|
|
#12 | |||
|
Senior Member
Join Date: Jul 2008
Posts: 2,146
|
Quote:
I don't think it would be much harder to implement a driver-enabled "high performance mode" with the Sideport enabled and the rest of the time just use the UMA. Quote:
Bloomfield (3-channel) has 200 more "pins" than Lynnfield (2-channel), and Lynnfield actually has 40M transistors more because of integrated PCI-Express and DMA. I don't really understand what you mean by that. AMD has been using motherboards using Sideport+UMA combinations for several years, increasing the IGP's performance. What's so different here? Quote:
Increased memory bandwidth has shown to drastically change Llano's results (25% more gaming performance with 33% higher bandwidth). Of course, UMA is the future.. Given Llano's results, I think a high-performance Sideport could be a good temporary option, untill DDR4 is ready for market. |
|||
|
|
|
|
|
#13 | ||
|
Member
Join Date: Mar 2010
Posts: 331
|
Quote:
Quote:
And essentially you're proposing that all motherboards should come with GDDR5 built in (say 512 MB if you're proposing a 64 bit channel for the GPU). Thats not cheap and i would imagine it isnt power efficient either |
||
|
|
|
|
|
#14 | |||||
|
Senior Member
Join Date: Oct 2002
Posts: 2,433
|
Quote:
Quote:
Quote:
Quote:
I don't know how much performance you can really gain with L3, but I find the sandy bridge results with 1 memory channel (also in that techreport article) quite amazing on that front, it only loses about 20% of the performance for half the memory bandwidth. Sure part of that is because the GPU isn't all that fast compared to Llano (hence it needs less memory bandwidth), but still I think part of that is the usage of L3 cache for the GPU. I don't have any proof for that though (some comparisons with Arrendale could be interesting maybe, unfortunately you can't switch off the L3 cache AFAIK...). Quote:
|
|||||
|
|
|
|
|
#15 | |
|
Senior Member
|
Quote:
|
|
|
|
|
|
|
#16 |
|
Senior Member
Join Date: Oct 2002
Posts: 2,433
|
|
|
|
|
|
|
#17 | |
|
Senior Member
Join Date: Jul 2008
Posts: 2,146
|
Quote:
I've also heard that in some cases it's only a 16-bit bus, but I'm pretty sure the 780G in my Ferrari One is using a 32bit Sideport with 384MB. The access to UMA is blocked through the bios, though Last edited by ToTTenTranz; 29-Jun-2011 at 18:31. |
|
|
|
|
|
|
#18 |
|
yes, i'm drunk
|
How is 8 VLIW4 SIMDs 50% increase over 5 VLIW5 SIMDs, even if you bump the clocks slightly?
__________________
I'm nothing but a shattered soul... Been ravaged by the chaotic beauty... Ruined by the unreal temptations... I was betrayed by my own beliefs... |
|
|
|
|
|
#19 |
|
Senior Member
Join Date: Oct 2002
Posts: 2,433
|
You "only" need a 17% clock increase to achieve that 50% increase for 8 vliw4 simds over 5 vliew5 ones. I don't know if that's realistic or not (though compared to discrete parts the clocks certainly wouldn't be extraordinary high, and overclocking attempts also suggest it's doable). But in any case it would be a very substantial increase in graphic power (more than those 50%!).
|
|
|
|
|
|
#20 |
|
Member
Join Date: Nov 2007
Location: 'Zona
Posts: 514
|
What happened to Trinity having a vliw4 GPU based on 6850?
|
|
|
|
|
|
#21 |
|
Entirely Suboptimal
Join Date: Mar 2003
Location: WI, USA
Posts: 6,845
|
Only Cayman (6950) is VLIW4. Trinity is indeed VLIW4 as well according to reports/rumors. Maybe one day we'll actually have some use for VLIW4 (GPGPU). But then AMD did just show us how they want to leave it behind too.
|
|
|
|
|
|
#22 | ||
|
Member
Join Date: Nov 2007
Location: 'Zona
Posts: 514
|
Quote:
Quote:
|
||
|
|
|
|
|
#23 |
|
Gamerscore Wh...
Join Date: Jan 2002
Posts: 12,946
|
That was a confusion. He thought that the 6800 was based on VLIW4. He meant to say that the architecture is based on the 6900 series.
|
|
|
|
|
|
#24 |
|
Member
Join Date: Nov 2007
Location: 'Zona
Posts: 514
|
|
|
|
|
|
|
#25 |
|
yes, i'm drunk
|
Since it's now "the past", was 6800-series meant to be VLIW4, but 32-40nm case forced it to be VLIW5?
__________________
I'm nothing but a shattered soul... Been ravaged by the chaotic beauty... Ruined by the unreal temptations... I was betrayed by my own beliefs... |
|
|
|
![]() |
| Tags |
| amd, fusion, intel, ivy bridge, trinity |
| Thread Tools | |
| Display Modes | |
|
|