NVIDIA GF100 & Friends speculation

Discussion in 'Architecture and Products' started by Arty, Oct 1, 2009.

  1. Lonbjerg

    Newcomer

    Joined:
    Jan 8, 2010
    Messages:
    197
    Likes Received:
    0
    You mean like this?
    http://www.anandtech.com/showdoc.aspx?i=1683&p=6
    (9700 Pro)

    Or like this?
    http://www.anandtech.com/video/showdoc.aspx?i=2870
    (8800GTX)

    Or like this?
    http://www.anandtech.com/video/showdoc.aspx?i=2870
    (2900XT)

    Or like this:
    http://www.anandtech.com/video/showdoc.aspx?i=3643&p=17
    (5870)

    It's not unheard off...especially not when dumping a new architechture on the market.

    I'm not convinced it's not a fake benchmark thoguh...just saying the validity of dismissing benches due to one side having a large advantage in most games is a flawed stance.
     
  2. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    The benchmarks leaked last weekend (DiRT2 and Far Cry 2) were 100% real, with Far Cry 2 being the "best case scenario"
     
  3. rpg.314

    Veteran

    Joined:
    Jul 21, 2008
    Messages:
    4,298
    Likes Received:
    0
    Location:
    /
    My take on this,
    • 8 SM's
    • ~1.5G hot clock
    • 256bit gddr5, base clock close to gt200's

    Assuming it is kinda gt200 + dx11 perf wise, then @330mm2 it is going to have nice time competing against juniper on perf/(manufacturing cost).

    I have a feeling that shielding 480 from the evils of afr, embodied in that creation of satan called 5970, will be a part of epic battle between good and evil. :roll:
     
  4. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    Not likely considering the architectural inefficiencies. if GF100 is hot and hungry compared to cypress, GF104 will be hot and hungry compared to Juniper.

    Sounds healthy to me

    The Devil doesn't wear green.
     
  5. KimB

    Legend

    Joined:
    May 28, 2002
    Messages:
    12,928
    Likes Received:
    230
    Location:
    Seattle, WA
    Even if true, bear in mind that there are a wide variety of ways for benchmarks to be misleading. In this case, we could be talking about immature GF100 drivers, for instance.
     
  6. KimB

    Legend

    Joined:
    May 28, 2002
    Messages:
    12,928
    Likes Received:
    230
    Location:
    Seattle, WA
    That's not necessarily the case, not if the GF100 turns out to significantly outperform, because then nVidia could significantly lower the clocks of their lower-end parts to get better perf/power ratios.
     
  7. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    The card being late doesn't mean the drivers had zero development in them.
     
  8. Picao84

    Veteran

    Joined:
    Feb 15, 2010
    Messages:
    2,109
    Likes Received:
    1,196
    Care to elaborate on this please? What architecture inneficiencies? That it requires more power may not 100% justify that affirmation.
     
  9. KimB

    Legend

    Joined:
    May 28, 2002
    Messages:
    12,928
    Likes Received:
    230
    Location:
    Seattle, WA
    Of course not. But drivers need quite a lot of work to be ready for actual use in games. In any case, we should find out either tomorrow or Monday what the real situation is at launch. It will, of course, take a few months to see how the driver situation shapes out, but for now the benchmarks available within the next few days will give us a good idea of how well the cards will perform when they first reach peoples' hands.
     
  10. GZ007

    Regular

    Joined:
    Jan 22, 2010
    Messages:
    416
    Likes Received:
    0
    Multiple dies on substrate acting as a single gpu could solve all of this in the future.
    The nvidia results did contain SLI gtx480 numbers. And 3D Vision Surround will work only on SLI. So both companys count with mGPU.
     
  11. fellix

    Veteran

    Joined:
    Dec 4, 2004
    Messages:
    3,552
    Likes Received:
    514
    Location:
    Varna, Bulgaria
  12. rpg.314

    Veteran

    Joined:
    Jul 21, 2008
    Messages:
    4,298
    Likes Received:
    0
    Location:
    /
    And how do you solve the resulting heat problem?:grin:
     
  13. GZ007

    Regular

    Joined:
    Jan 22, 2010
    Messages:
    416
    Likes Received:
    0
    Maybe the monster gtx480 cooler would be enough to cool down 2 cypress dies at once :grin:
    Xenos daughter die had some core logic and that was some years ago. So its not just in realm of fantasy.
     
  14. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    Given the partitioning into gpcs, that would mean 2 GPCs with 5 SMs each. If the GPCs otherwise remain unchanged, would mean though can "only" rasterize 16 pixels/clock, which if we're assuming 32 rops (if that's a 256bit bus) seems a bit low. What about 288 cores instead? 3 GPC with 3 SMs each? Not sure though what you could expect from such a card, with bandwidth like Cypress yet alu/tmu throughput below Juniper (unless for tmu you take Arun's 3 half-quad tmus per SM). Actually the ratios (except tmu) would be close to GT200...
     
  15. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,716
    Likes Received:
    2,137
    Location:
    London
    Bandwidth and latency are not the same thing, so systematic PCI Express latencies are real - see any GPGPU effort for the work-arounds required to minimise that impact.

    Additionally the API has some fairly fundamental, coarse, granularities in it. As I said to PeterT earlier, this is why D3D10 implements finer-grained update of state (e.g. constant buffers) and why D3D11 allows multi-threaded construction of state.

    All of these things conspire against old graphics engines that are built on out of date techniques with the DX9 API. The efficiency gains in D3D11 are enough that it's worth running the game/drivers in D3D11 mode even though the hardware is only capable of DX9.

    Still don't even have a decent answer why tessellation in Heaven 1.0 is so slow on ATI. Somehow I think we'll be waiting a long time.

    Sure, that would be a hardware limitation. It might be a fairly noticeable bandwidth-efficiency limitation (see the 8xMSAA performance in GT200) or it might simply be not enough bandwidth. Or setup rate. Just have to prove that the specific game is sensitive in that respect.

    Of course review sites that even bother to activate 8xMSAA or adaptive/transparency MSAA are pathetically few.

    If you're going to make a reasoned comparison of the gains in a replacement GPU, you've gotta take the mix into account: unit counts + bandwidth + serialisations. I don't expect to see many existing games benefit from the dramatically higher setup rate in GF100 - but the architecture's more finely-grained rasterisation (which is a result of the parallel setup architecture) may mean that those same games see "better than expected" scaling on GF100. If that proves to be the case, then it's another parameter to investigate in Cypress scaling - though it seems unlikely there'll be much of that done, either.

    That may not turn out to be due to finely-grained rasterisation. It might be to do with the way hardware threads are launched. etc.

    Why's that relevant? It's quite clear that a lot of game developers don't have the time/resources to implement a state of the art scalable and efficient engine. And if the API stands in the way...

    The bottom line is you have to prove your test is evaluating what you say it is, before you can say that the test indicates X about the test subject.

    I'm certainly not excluding the possibility of end-of-an-architecture problems in Cypress, where scaling has hit the end-stops due to something. I suspect some fixed-function stuff is out of its depth, but I say that because of the poor tessellation performance, not because of existing games.

    It's notable that we're (generally, presumably some did know) only now realising that Crysis 1920x1200 with 4xMSAA is limited by the capacity of video RAM. How long have reviewers been testing that game at that setting? Why's it taken so long to discover the RAM limitation? What folly it has been to say it doesn't scale, when the card is running out of memory. (Though I think the game is still meant to scale substantially with CPU at that kind of setting, too - honestly Crysis has long seemed like a red herring as analysis has been woeful.)

    Jawed
     
  16. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
  17. Vincent

    Newcomer

    Joined:
    May 28, 2007
    Messages:
    235
    Likes Received:
    0
    Location:
    London
    In terms of prototype level, gf104 does not exist right now.
     
  18. ShaidarHaran

    ShaidarHaran hardware monkey
    Veteran

    Joined:
    Mar 31, 2007
    Messages:
    4,027
    Likes Received:
    90
    "Zero development" was never mentioned, and is an absurd assumption. It's simple common sense that a new card with a brand new architecture stands to gain more performance from the evolution of drivers than one that has been out for 6 months and is itself a revision of an existing architecture.
     
  19. neliz

    neliz GIGABYTE Man
    Veteran

    Joined:
    Mar 30, 2005
    Messages:
    4,904
    Likes Received:
    23
    Location:
    In the know
    And that's my point. with the original GF100 being expected about the same time as Cypress, driver development surely hasn't lagged for all that time.

    And with nV's driver team always seen as outperforming AMD's, who's to say that they weren't able to do just as much work over the past few months.
    This is, after all a very expensive and very important product, you don't want to ruin that launch because the drivers are teh suxx, assuming they've been sleeping and had little time to work on drivers is imho a wrong PoV.
     
  20. Pressure

    Veteran

    Joined:
    Mar 30, 2004
    Messages:
    1,655
    Likes Received:
    593
    Yeah, that worked out great for 3Dfx. Just check out the Voodoo5 6000.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...