If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
![]() |
|
|
#2426 | |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
He's referring to his own speculation here.
Quote:
__________________
What the deuce!? |
|
|
|
|
|
#2427 |
|
Regular
|
Trinibwoy ... it doesn't, but when you are looking into the future by 4 months and a lot can still go wrong it's a rather small margin of error.
|
|
|
|
|
#2428 |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
Sure, but Rys isn't necessarily referring to GT300 showing up at Newegg. There are lots of other tidbits that could leak or be released before that.
__________________
What the deuce!? |
|
|
|
|
#2429 |
|
Nutella Nutellae
Join Date: Feb 2002
Location: San Francisco
Posts: 4,297
|
Regarding the MIMD rumour, it's quite unlikely that GT300 has an instruction decoder and scheduler (+ I$?) per SP. Perhaps it refers to be able to have some sort of task parallelism while running in compute mode.
__________________
[twitter] More samples, we need more samples! [Dean Calver] The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way |
|
|
|
|
#2430 |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
Ever since I read that dynamic warp formation paper, whenever I hear about MIMD on GPUs I always assume it's still SIMD hardware but MIMD from the view of the running program. I've never really understood why the data sent to a SIMD has to be from the same warp. The SIMD doesn't care does it?
Could they practically extend GT200's scoreboarding mechanism to simply collect bundles of "ready" threads and their associated operands from any/all running warps? It would probably require some sort of operand buffering mechanism and trickier prioritization but it doesn't sound much different from what they're doing now anyway.
__________________
What the deuce!? |
|
|
|
|
#2431 |
|
Nutella Nutellae
Join Date: Feb 2002
Location: San Francisco
Posts: 4,297
|
There are two different arguments mixing here. One thing is being able to re-converge your threads and pack in a warp only (or mostly..) threads that share the same IP. Another thing is being able to schedule instructions from diverged control flow or even different programs into the same warp. The latter is way more complex (and requires more instruction decoders)
__________________
[twitter] More samples, we need more samples! [Dean Calver] The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way |
|
|
|
|
#2432 |
|
Regular
|
|
|
|
|
|
#2433 |
|
Regular
|
BTW, what is already accumulated in warps at the moment? Are only vertices/fragments of a single drawcall combined? Or will it try to see what changes in between drawcalls?
|
|
|
|
|
#2434 | |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
Quote:
__________________
What the deuce!? |
|
|
|
|
|
#2435 | |
|
Nutella Nutellae
Join Date: Feb 2002
Location: San Francisco
Posts: 4,297
|
Quote:
What's harder is to pack different instructions in the same warp, as you need to improve, among other things, your instructions decoding rate.
__________________
[twitter] More samples, we need more samples! [Dean Calver] The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way |
|
|
|
|
|
#2436 | |
|
Member
Join Date: Apr 2004
Posts: 325
|
Quote:
Does this get easier if your instruction set is simplified? I don't recall the exact details, but it seems like the instruction set is already pretty sparse, with MADD being the odd outlier and operand types (int16 vs. int32 vs. float vs. double?) contributing. Do we gain much by splitting the MADD? At some cost to operand bandwidth, one could gain greater use of the two math units, and it is a simplification (no more triple operand fetches, fewer instructions to support). |
|
|
|
|
|
#2437 | |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
Quote:
Edit: I'm assuming here that instructions and operand addresses are kept separate. If that's not the case then ignore me
__________________
What the deuce!? |
|
|
|
|
|
#2438 | |
|
Nutella Nutellae
Join Date: Feb 2002
Location: San Francisco
Posts: 4,297
|
Quote:
__________________
[twitter] More samples, we need more samples! [Dean Calver] The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way |
|
|
|
|
|
#2439 |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
Ok, now I understand what you're saying.
__________________
What the deuce!? |
|
|
|
|
#2440 | |
|
Nutella Nutellae
Join Date: Feb 2002
Location: San Francisco
Posts: 4,297
|
Quote:
__________________
[twitter] More samples, we need more samples! [Dean Calver] The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way |
|
|
|
|
|
#2441 | |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
Quote:
__________________
What the deuce!? |
|
|
|
|
|
#2442 |
|
Nutella Nutellae
Join Date: Feb 2002
Location: San Francisco
Posts: 4,297
|
Not that simple, there's a reason (actually more than one..) why NVIDIA hw logical SIMD width doesn't match physical SIMD width.
__________________
[twitter] More samples, we need more samples! [Dean Calver] The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way |
|
|
|
|
#2443 |
|
Regular
|
You could simply run more warps of the same program ... trade flexibility and branch granularity for less control circuitry. Are there any annotated die graphs to show how much area they could save in this way? It seems to me that even for NVIDIA it's not really an issue.
|
|
|
|
|
#2444 | |
|
Nutella Nutellae
Join Date: Feb 2002
Location: San Francisco
Posts: 4,297
|
Quote:
__________________
[twitter] More samples, we need more samples! [Dean Calver] The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way |
|
|
|
|
|
#2445 |
|
Regular
|
They double pump the SIMD array so they can easier deal with latency of the instructions ... but that's a completely orthogonal issue (you'd still be doing that even if you ran multiple warps with the same instruction unit). As for why they aren't doing it already, I personally don't think it makes sense for them to do it ... as I said you lose flexibility (if you don't have enough WARPs to run part of the SIMD lies idle) and the branch granularity increases (branch paths get shared between warps).
I don't think the control circuitry necessary for 16 wide scalar SIMD vs. a wider SIMD is really what is keeping their flops/mm2 down, that's why I asked for the annotated die micrograph. |
|
|
|
|
#2446 | |
|
Senior Member
Join Date: Dec 2002
Location: Under a Crushing Burden
Posts: 4,290
|
Quote:
I still think you meant that physX becoming popular would be bad b/c it would become entrenched in a dominant position and lead to less innovation due to pressuring others out of GPU market. If that is not what you meant feel free to correct my understanding.
__________________
You bought horse armor didn't you? |
|
|
|
|
|
#2447 | |
|
Mord's imaginary friend
Join Date: Jan 2004
Location: PT, EU
Posts: 3,506
|
Quote:
Chalnoth: wrt the wrappers. I tried a couple, unfortunately EF2000 was one of the earliest titles using GLIDE and the game ran in a weird DOS/win32 mode which isn't compatible with today's Windows. You can run the game in DOS or Windows or this weird mode. GLIDE is only supported in this last one. Btw, I don't believe a wrapper will be really necessary as PhysX is more abandonment-proof (that's a mouthful) since it runs, however slowly, on X86.
__________________
The optimist proclaims that we live in the best of all possible worlds, and the pessimist fears this is true. - James Branch Cabell |
|
|
|
|
|
#2448 |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
I'm not discounting any possibility. Who knows, maybe instruction issue now runs at the shader clock.
__________________
What the deuce!? |
|
|
|
|
#2449 |
|
Darlek ******
Join Date: Jun 2004
Posts: 9,489
|
@Richard EF2000 didnt work with a wrapper because the game talked directly to the card and didnt use a driver
__________________
Guardian of the Most holy Two Terabytes of Gaming Goodness™ |
|
|
|
|
#2450 |
|
Junior Member
Join Date: Sep 2009
Location: The Nederlands
Posts: 51
|
According to CJ the GT300 is faster then HD5870, but not faster then HD5870X2
Link: http://translate.google.nl/translate...age%2F32639927 |
|
|
![]() |
| Tags |
| nvidia, speculation |
| Thread Tools | |
| Display Modes | |
|
|