Welcome, Unregistered.

If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.

Reply
Old 23-Feb-2010, 17:02   #251
3dilettante
Senior Member
 
Join Date: Sep 2003
Location: Well within 3d
Posts: 4,109
Default

Quote:
Originally Posted by rpg.314 View Post
>4 years to make a new uarch? WTH are they trying to do with BD? re-implement x86 from scratch?
CPUs, particularly new designs that have the complexity of a multicore high-performance OoO x86, do take that long.
It's not clear just how long elements of Bulldozer have been in development, as various x86 projects to succeed K8 have been cancelled, and the internal project code-named Bulldozer may have been restarted or had its codename transfered at some point.

However, it's not exactly like GPUs are as fleet of foot as some may think. The upper bounds of GPU gestation periods overlap pretty well with the lower bounds of CPU design periods as of late.
__________________
Dreaming of a .065 micron etch-a-sketch.
3dilettante is offline   Reply With Quote
Old 23-Feb-2010, 17:19   #252
Jawed
Regular
 
Join Date: Oct 2004
Location: London
Posts: 9,863
Send a message via Skype™ to Jawed
Default

Quote:
Originally Posted by rpg.314 View Post
>4 years to make a new uarch? WTH are they trying to do with BD? re-implement x86 from scratch?
I really don't care much about x86, it's boring (except Larrabee, but that's only because it hasn't arrived, once it's here it'll be even more boring). But hasn't AMD junked at least two designs since K8?

Quote:
What troubles do you foresee with this kind of coupling?
The GPU would munch through L2, starving the core, which has other things to do. Better to stream through L2 into L3 for the GPU-SIMD to use, I suspect.

Also, with module-local, you can only maximise GPU-SIMD usage by pushing work through all the modules, enforcing fine-grained tasks when only a coarse-grained (single host thread) task is required.

Quote:
The idea is that by putting a simd engine next to a full blown x86 core, the gpu kernels can be programmed to call back x86 code when it is finished, and have it's return data siting right there. Or vice versa, with the x86 setting up launch parameters in the L2 and then issuing kernel calls. IOW, reduce data latency between cpu and gpu.
Going through L3 seems better to me. The difference in latency won't break your balls. These are still OpenCL kernels, they're not a few in-thread SSE instructions you're trying to speed-up.

Quote:
Personally, I am looking out for kernel launch granularity to reduce from a bunch of work groups to just one work group.
Going to be hard to avoid having at least 1 workgroup per SIMD.

Quote:
With recursive block calls, ie work groups calling work groups (same size of course) and with ability to launch x86 functions at the end of workgroups, it will lead to better synergy between cpu and gpu code.
Do things need to be more tangled than the fairly clean OpenCL event model?

Quote:
Power management is another plus. With increasingly finer granularity of core level power gating, the same infrastructure can be reused for gpu power gating.
I don't see how that's affected by the tightness of coupling.

Jawed
__________________
Can it play WoW?
Jawed is offline   Reply With Quote
Old 23-Feb-2010, 23:44   #253
entity279
Member
 
Join Date: May 2008
Location: Romania
Posts: 340
Send a message via Yahoo to entity279
Default

Quote:
Originally Posted by rpg.314 View Post
>4 years to make a new uarch? WTH are they trying to do with BD? re-implement x86 from scratch?

Any way, with >4 years in the oven, and a new gpu being built along side, it seems reasonable to assume that there has been a lot of influence both ways.
.
If i recall correctly, I can remember Bulldozer being talked about even at the beginnings of Intel Core era. It has been pushed back ever since.

Back in the days, AMD had also delayed the Hammer architecture for quite a while.

Edit: oups, 3dilettante already "produced" a better alternative to my post; yeah BD may have been just a name
entity279 is offline   Reply With Quote
Old 24-Feb-2010, 01:01   #254
Jawed
Regular
 
Join Date: Oct 2004
Location: London
Posts: 9,863
Send a message via Skype™ to Jawed
Default Charlie says NI is 28nm

http://www.semiaccurate.com/forums/s...5&postcount=20
__________________
Can it play WoW?
Jawed is offline   Reply With Quote
Old 24-Feb-2010, 01:10   #255
nutball
Senior Member
 
Join Date: Jan 2003
Location: en.gb.uk
Posts: 1,550
Default

Who's doing 28nm? and when?

*whistles*
nutball is offline   Reply With Quote
Old 24-Feb-2010, 01:24   #256
Squilliam
Beyond3d isn't defined yet
 
Join Date: Jan 2008
Location: New Zealand
Posts: 3,041
Default

Quote:
Originally Posted by nutball View Post
Who's doing 28nm? and when?

*whistles*
Global Foundries 28nm??? He does say that in his lil snippet.

Now if we go by the piece in the Industry forums, whose party are the going to try and show up for on time? Could this be the first ever time they release the parts for laptops before they get the mainline desktop parts out the door? Noone sneezes at a 140mm^2 Juniper replacement with better power/performance in the laptop space and thats a pretty good pipe cleaner as well.

Does anyone see any reason why they shouldn't release from the middle and work up/downwards? Say 66xx part first followed by 64xx then 67xx and finally 68xx/69xx and bringing up the rear the IGP and Fusion solutions? Laptops are bigger now than the desktop markets and time to market for this segement is paramount given the lead time for manufacturing? A 66xx would be an excellent first OEM part, IMO!
Squilliam is offline   Reply With Quote
Old 24-Feb-2010, 01:56   #257
digitalwanderer
Dangerously Mirthful
 
Join Date: Feb 2002
Location: Winfield, IN USA
Posts: 15,292
Default

Quote:
Originally Posted by nutball View Post
Who's doing 28nm? and when?

*whistles*
digitalwanderer is offline   Reply With Quote
Old 24-Feb-2010, 02:46   #258
rpg.314
Senior Member
 
Join Date: Jul 2008
Location: /
Posts: 4,070
Send a message via Skype™ to rpg.314
Default

Quote:
Originally Posted by Jawed View Post
  • A new architecture
  • 28 nm
  • 2H10

My head is spinning.

PS: All of it assumes, of course, that GF's 28 nm has an easier time (with sensible yields and reasonable volume) taking birth than TSMC's 40nm, which compared favorably with a greek tragedy.
__________________
The views presented here are my own and not my employer's.
Quote:
Originally Posted by Alexko View Post
So in a nutshell, model [BLANK] will have [BLANK], up to [BLANK], and even [BLANK] for a power consumption of just [BLANK]. Impressive.
rpg.314 is offline   Reply With Quote
Old 24-Feb-2010, 04:42   #259
rpg.314
Senior Member
 
Join Date: Jul 2008
Location: /
Posts: 4,070
Send a message via Skype™ to rpg.314
Icon Arrow

Quote:
Originally Posted by Jawed View Post
I really don't care much about x86, it's boring (except Larrabee, but that's only because it hasn't arrived, once it's here it'll be even more boring). But hasn't AMD junked at least two designs since K8?
I don't care about x86 either. I have no idea about the mess AMD has made internally. I always thought BD began after Barcelona was finished.


Quote:
The GPU would munch through L2, starving the core, which has other things to do. Better to stream through L2 into L3 for the GPU-SIMD to use, I suspect.
Yeah, the gpu will eat up the L2. But if you share via L3, it'll eat up L3 too. The only way out seems to be to put an upper limit on the amount of cache the gpu can hog. And if you are gonna cap L3 usage, might as well put it together with a module and cap L2 uage as well.

Quote:
Also, with module-local, you can only maximise GPU-SIMD usage by pushing work through all the modules, enforcing fine-grained tasks when only a coarse-grained (single host thread) task is required.
Good point, this approach will need some kind of across module load balancer to throw work groups at other, free simd's. Probably better off implmented in sw with some hw support.

Quote:
Going through L3 seems better to me. The difference in latency won't break your balls. These are still OpenCL kernels, they're not a few in-thread SSE instructions you're trying to speed-up.


Going to be hard to avoid having at least 1 workgroup per SIMD.


Do things need to be more tangled than the fairly clean OpenCL event model?


I don't see how that's affected by the tightness of coupling.

Jawed
Sharing via L3 means it is no better (architecturally at least) compared to Llano though they may very well g down that road. The overall direction is to make it system level programmable, instead of using the gpu as a device visible to apps. I feel they'll take their cues (both regarding what to do and what not do) from Cell's execution model (definitely not it's programming model though ) which is amenable to providing a JIT level interface and allows flexible communication/signal patterns between the cpu and gpu cores.
__________________
The views presented here are my own and not my employer's.
Quote:
Originally Posted by Alexko View Post
So in a nutshell, model [BLANK] will have [BLANK], up to [BLANK], and even [BLANK] for a power consumption of just [BLANK]. Impressive.
rpg.314 is offline   Reply With Quote
Old 24-Feb-2010, 05:17   #260
tannat
Junior Member
 
Join Date: Dec 2009
Location: Malmö
Posts: 61
Default

"Lurkermode off"

What if Ati has been quite happy with being first to the table with HD5xxx and 40 nm in spite of the problems of TSCM. They are not that afraid of (and quite good at) being early adopters to a smaller process node.

It has several advantages:

1) They can be agressive while playing safe by double up with GF and TSCM making two layouts for the chip. Minimizing risks of a repeat of the problematic 40 nm adoption.

2)Whatever first fab ready with 28 nm out makes launch date. If glofo is delayed TSCM may come through and vv

3) If TSCM and glofo are ready quite simultaneously there will be less constraints of cards at launch since two 28 nm lines are used to satisfy demand. Even if early cypress yields wer low, say in 30-40% Ati may have realized that one line operating att 90% yield would not have been enough for early demand.

This could have been clear while still in negotiation with TSCM and glofo about NI. They just went for both.
tannat is offline   Reply With Quote
Old 24-Feb-2010, 05:27   #261
Shtal
Senior Member
 
Join Date: Jun 2005
Posts: 1,320
Default

Hi!

This might be true that ATI R9xx GPU possibly be released second half of 2010 on 28nm or 32nm, right here press release from http://www.globalfoundries.com/newsr...90930_GSA.aspx dated back September 30th, 2009 - it talks about that volume production of 32-nm super high performance process (SOI+HKMG) will begin in 2H 2010, while volume production of 28-nm high performance process (HKMG) will as well start in 2H 2010. Here is another http://www.globalfoundries.com/newsr.../20100215.aspx it says how high-end ARM designs will be out in 2H 2010 using both Global foundries' 28-nm high performance HKMG and 28-nm super low power HKMG process nodes. So, it looks like it is defiantly is entirely feasible for ATI to launch the "Northern Islands" generation on Global foundries 28-nm process this year; no doubt supplies will be constrained for first few months, but it wouldn't be any worse than what ATI went through last year with TSMC's 40-nm process.
__________________
What is the meaning of life? - Why I'm here, I know my past, because I return to the past but I'm going forward to see my future, to find the truth, meaning of the existence and purpose.
Shtal is offline   Reply With Quote
Old 24-Feb-2010, 05:56   #262
Tahir2
Itchy
 
Join Date: Feb 2002
Location: United Queendom
Posts: 2,858
Default

I thought GloFlo and TSMC are going straight to 28nm?
Tahir2 is offline   Reply With Quote
Old 24-Feb-2010, 07:22   #263
dizietsma
Naughty Boy!
 
Join Date: Mar 2004
Posts: 1,172
Default

I wonder if moving to GF is a result of the problems with the problems with 40nm from Taiwan fab? Mind you AMD when it owned the foundry didn't have massive success going to new process nodes, 65nm for instance....
dizietsma is offline   Reply With Quote
Old 24-Feb-2010, 07:39   #264
rpg.314
Senior Member
 
Join Date: Jul 2008
Location: /
Posts: 4,070
Send a message via Skype™ to rpg.314
Default

http://www.semiaccurate.com/forums/s...5&postcount=33


So sometime October, November then? I wonder how many chips are coming in 2010.
__________________
The views presented here are my own and not my employer's.
Quote:
Originally Posted by Alexko View Post
So in a nutshell, model [BLANK] will have [BLANK], up to [BLANK], and even [BLANK] for a power consumption of just [BLANK]. Impressive.
rpg.314 is offline   Reply With Quote
Old 24-Feb-2010, 07:43   #265
Shtal
Senior Member
 
Join Date: Jun 2005
Posts: 1,320
Default

Quote:
ATI's next gen DX11 chip is 40nm http://www.fudzilla.com/content/view/17782/1/
I don't believe Northern Islands will be on 40nm since it will increase die size and it is against their "sweet spot" policy.
__________________
What is the meaning of life? - Why I'm here, I know my past, because I return to the past but I'm going forward to see my future, to find the truth, meaning of the existence and purpose.
Shtal is offline   Reply With Quote
Old 24-Feb-2010, 07:49   #266
rpg.314
Senior Member
 
Join Date: Jul 2008
Location: /
Posts: 4,070
Send a message via Skype™ to rpg.314
Default

If fudo is right, then may be NI is Evergreen+2.

But then it flies in the face of what Charlie is claiming and what Anand said.

All in all, the only known is that some new chips are coming later this year.
__________________
The views presented here are my own and not my employer's.
Quote:
Originally Posted by Alexko View Post
So in a nutshell, model [BLANK] will have [BLANK], up to [BLANK], and even [BLANK] for a power consumption of just [BLANK]. Impressive.
rpg.314 is offline   Reply With Quote
Old 24-Feb-2010, 07:54   #267
Mindfury
Member
 
Join Date: Oct 2009
Posts: 232
Default

Maybe ATI will try new architecture on 40nm,make a chip slightly bigger than Cypress.When 28nm is ready,double everything to make a monster chip.
Mindfury is offline   Reply With Quote
Old 24-Feb-2010, 08:26   #268
Silent_Buddha
Regular
 
Join Date: Mar 2007
Posts: 8,979
Default

Quote:
Originally Posted by tannat View Post
"Lurkermode off"

What if Ati has been quite happy with being first to the table with HD5xxx and 40 nm in spite of the problems of TSCM. They are not that afraid of (and quite good at) being early adopters to a smaller process node.

It has several advantages:

1) They can be agressive while playing safe by double up with GF and TSCM making two layouts for the chip. Minimizing risks of a repeat of the problematic 40 nm adoption.

2)Whatever first fab ready with 28 nm out makes launch date. If glofo is delayed TSCM may come through and vv

3) If TSCM and glofo are ready quite simultaneously there will be less constraints of cards at launch since two 28 nm lines are used to satisfy demand. Even if early cypress yields wer low, say in 30-40% Ati may have realized that one line operating att 90% yield would not have been enough for early demand.

This could have been clear while still in negotiation with TSCM and glofo about NI. They just went for both.
Well HD 58xx (Cypress) benefitted from the experience they had on 40 nm with Rv740. That allowed them to avoid many of the problems that Rv740 ran into (going by Anandtech's article), thus allowing Cypress to have less problems than they might otherwise have had.

I'd believe the 28 nm for N.I. more if ATI releases something before then to "test" the process first to see if there's any unexpected problems they might run into, and thus allow them to better adjust N.I. for those potential problems.

Regards,
SB
Silent_Buddha is offline   Reply With Quote
Old 24-Feb-2010, 08:55   #269
Squilliam
Beyond3d isn't defined yet
 
Join Date: Jan 2008
Location: New Zealand
Posts: 3,041
Default

Quote:
Originally Posted by Silent_Buddha View Post
Well HD 58xx (Cypress) benefitted from the experience they had on 40 nm with Rv740. That allowed them to avoid many of the problems that Rv740 ran into (going by Anandtech's article), thus allowing Cypress to have less problems than they might otherwise have had.

I'd believe the 28 nm for N.I. more if ATI releases something before then to "test" the process first to see if there's any unexpected problems they might run into, and thus allow them to better adjust N.I. for those potential problems.

Regards,
SB
Wouldn't it be more practical to switch foundries with a new chip than it would be productive to simply port a chip from one to the other? Also theres the thing that they don't want to telegraph their intentions to Nvidia/TSMC whom both have a lot to lose in an effective transition. If they have a lot of leeway in terms of time it seems to make sense for them to jump in with both feet and work it out later if they know they have plenty of time before TSMC makes the switch (as they should as a client of both). Thats the flip side to what you wrote. Im a chronic contradictor, whats a GPU?
Squilliam is offline   Reply With Quote
Old 24-Feb-2010, 08:59   #270
eastmen
Senior Member
 
Join Date: Mar 2008
Posts: 4,932
Default

Quote:
Originally Posted by Silent_Buddha View Post
Well HD 58xx (Cypress) benefitted from the experience they had on 40 nm with Rv740. That allowed them to avoid many of the problems that Rv740 ran into (going by Anandtech's article), thus allowing Cypress to have less problems than they might otherwise have had.

I'd believe the 28 nm for N.I. more if ATI releases something before then to "test" the process first to see if there's any unexpected problems they might run into, and thus allow them to better adjust N.I. for those potential problems.

Regards,
SB
For all we know N.I may have already tapped out and they may be testing it already. I'm going to think they know alot more about GF's processes than TMSC's.

Mabye the first chip we will see is early summer cypress on 28nm to test the waters. They'd be able to up the clocks while lowing power , most likely be able to refresh the 5970 also. 1.2ghz on a 5890 and 1ghz on a 5990 would be a good refresh product .
eastmen is online now   Reply With Quote
Old 24-Feb-2010, 08:59   #271
Silent_Buddha
Regular
 
Join Date: Mar 2007
Posts: 8,979
Default

Quote:
Originally Posted by Squilliam View Post
Wouldn't it be more practical to switch foundries with a new chip than it would be productive to simply port a chip from one to the other? Also theres the thing that they don't want to telegraph their intentions to Nvidia/TSMC whom both have a lot to lose in an effective transition. If they have a lot of leeway in terms of time it seems to make sense for them to jump in with both feet and work it out later if they know they have plenty of time before TSMC makes the switch (as they should as a client of both). Thats the flip side to what you wrote. Im a chronic contradictor, whats a GPU?
Switching your entire new line would be extremely risky if you have nothing prior to indicate whether the transition will go well or go horribly or anything in between.

Add to that if 28 nm is delayed for some reason, you've basically blown an entire product cycle. The potential to have a longer delay than Fermi would be great.

Then again, with high risk there's also high reward. If it does go smoothly, 28 nm would be a huge advantage (in theory). Just like if 40nm had gone well with Fermi, it would have been a huge counter to Cypress.

But really, it's like an all or nothing bet without something to show its viability or expose any problems. Not to mention crossing your fingers and hoping the process node doesn't get delayed.

Regards,
SB
Silent_Buddha is offline   Reply With Quote
Old 24-Feb-2010, 09:25   #272
tannat
Junior Member
 
Join Date: Dec 2009
Location: Malmö
Posts: 61
Default

Quote:
Originally Posted by Silent_Buddha View Post
Well HD 58xx (Cypress) benefitted from the experience they had on 40 nm with Rv740. That allowed them to avoid many of the problems that Rv740 ran into (going by Anandtech's article), thus allowing Cypress to have less problems than they might otherwise have had.

I'd believe the 28 nm for N.I. more if ATI releases something before then to "test" the process first to see if there's any unexpected problems they might run into, and thus allow them to better adjust N.I. for those potential problems.

Regards,
SB

Yes, a Rv8xx chip on 28 nm would be a strong indication of 28 nm.

Going GloFo and TSMC in parallel may be additionally rewarding if they want to continue launch several die size versions within a short time frame as with Rv8xx.

It would also be an incentive for TSMC to be on the toes and not be sloppy with tool calibrations. Also, if Ati go through the effort of making separate layouts and prework on both companies 28 nm processes, then SoI on GloFo and bulk Si on TSMC may not be a huge additional step.

Rv8xx has been quite successful, this double approach may possibly mitigate most problems seen with Rv8xx launch with while enabling Ati to jump on the new node for next architecture safely.
tannat is offline   Reply With Quote
Old 24-Feb-2010, 09:29   #273
neliz
MSI Man
 
Join Date: Mar 2005
Location: In the know
Posts: 4,885
Send a message via ICQ to neliz Send a message via MSN to neliz
Default

the next RV8xx will still be on 40nm and the chances that RV9xx will go to GloFo seem to be slim.
__________________
I miss you CJ, 1976 - 2010
neliz is offline   Reply With Quote
Old 24-Feb-2010, 10:15   #274
Psycho
Member
 
Join Date: Jun 2008
Location: Copenhagen
Posts: 554
Default

"The next RV8xx"?
Psycho is offline   Reply With Quote
Old 24-Feb-2010, 10:23   #275
neliz
MSI Man
 
Join Date: Mar 2005
Location: In the know
Posts: 4,885
Send a message via ICQ to neliz Send a message via MSN to neliz
Default

Quote:
Originally Posted by Psycho View Post
"The next RV8xx"?
Hecatoncheires, the RV8xx refresh. i.e the part nv doesn't compare GF100 to when they claim "worlds fastest graphics solution."
__________________
I miss you CJ, 1976 - 2010
neliz is offline   Reply With Quote

Reply

Tags
Барт, Кайман

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 05:03.


Powered by vBulletin® Version 3.8.6
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.