NVIDIA Fermi: Architecture discussion

Discussion in 'Architecture and Products' started by Rys, Sep 30, 2009.

  1. Sontin

    Banned

    Joined:
    Dec 9, 2009
    Messages:
    399
    Likes Received:
    0
    It's possible since the G80. There is no link between rops/memory interface and the cluster/SM.
    And they did the same with the gtx275 and gtx295.
     
    #3301 Sontin, Jan 10, 2010
    Last edited by a moderator: Jan 10, 2010
  2. A.L.M.

    Newcomer

    Joined:
    Jun 2, 2008
    Messages:
    144
    Likes Received:
    0
    Location:
    Looking for a place to call home
    My fault. Anyway, there are other specs wrong in that list:

    L2 cache shared? No way, it's on chip.
    Dual core design gpu? Come on, no one still believes that...
    Fermi should be doing texture filtering through alus (but I ask for Rys on this), thus I'm not that sure that Fermi has "128TFU"...
     
  3. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,213
    Wrong , Fermi has independent Texture Mapping Units probably 128 , you can read Rys's piece at techreport .
     
  4. Sontin

    Banned

    Joined:
    Dec 9, 2009
    Messages:
    399
    Likes Received:
    0
    He adds all the specification to one. AMD did the same with the 5970.
     
  5. A.L.M.

    Newcomer

    Joined:
    Jun 2, 2008
    Messages:
    144
    Likes Received:
    0
    Location:
    Looking for a place to call home
    TMUs have TAU units and TFU units. for sure there should be 128 TAU, I'm not sure that there are 128TFUs.

    Nothing to say about the "shared L2" and dual core gpu? :lol:
     
  6. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,213
    Yes they are , but since GT200 the number of TFUs are equal to TAUs , that is the cause of the GT200 tremendous texture brute force .
     
  7. Sontin

    Banned

    Joined:
    Dec 9, 2009
    Messages:
    399
    Likes Received:
    0
    He is counting all specifications together. nVidia did a press event for GF100, so it's possible that the guy got the information directly from nVidia.
     
  8. jimmyjames123

    Regular

    Joined:
    Apr 14, 2004
    Messages:
    810
    Likes Received:
    3
    #3308 jimmyjames123, Jan 10, 2010
    Last edited by a moderator: Jan 10, 2010
  9. PSU-failure

    Newcomer

    Joined:
    May 3, 2007
    Messages:
    249
    Likes Received:
    0
    Except the fact there are obvious inconsistencies, like shared L2 and dual core GPU?

    Yeah, it could quite be "dual core" by dedicating 1/3rd of the memory bus to inter-GPU communication assuming adress bus is R/W, but L2 wouldn't be shared that way and that wouldn't stick with other "specs".
     
  10. OlegSH

    Regular

    Joined:
    Jan 10, 2010
    Messages:
    805
    Likes Received:
    1,636
    Maybe you need to look at this http://www.freepatentsonline.com/7616206.pdf There is describe of efficient private bus utilizing free MC to create fast connect. There are also some interesting methods of tile like interleve of render targets using cache to help hide added latency. Has some more new patents about this link but im too lazy to search them:smile:
     
  11. MfA

    MfA
    Legend

    Joined:
    Feb 6, 2002
    Messages:
    7,610
    Likes Received:
    825
    Ugh, it's nice that NVIDIA is thinking of doing it ... but it really really doesn't deserve a patent.
     
  12. PSU-failure

    Newcomer

    Joined:
    May 3, 2007
    Messages:
    249
    Likes Received:
    0
    The theory is ok, but I hope you're not assuming they would use just 1 MC to communicate between GPUs...

    With this "NUMA-like" approach, inter-GPU bandwidth must be equal to local memory bandwidth to achieve optimal efficiency, and that would still imply a quite high latency.

    Even with 1/3 of the bus dedicated to inter-GPU comm, it would still be quite bad and that would give a composite 512bit bus, which is not in line with the "specs" given.
     
  13. Sontin

    Banned

    Joined:
    Dec 9, 2009
    Messages:
    399
    Likes Received:
    0
  14. Ninjaprime

    Regular

    Joined:
    Jun 8, 2008
    Messages:
    337
    Likes Received:
    1
    CES is over in a few hours, and we know nothing really new, I guess that Rahja dude was full of shit?
     
  15. OlegSH

    Regular

    Joined:
    Jan 10, 2010
    Messages:
    805
    Likes Received:
    1,636
    1 MC can be possibly enough for some types of SLI rendering. In discribed method interliave only RT information, geometry and etc as usual doubled in each local GPU memory. But i'm not sure it's about nowadays becouse patent reffered to 3Mb or more caches to hide latency
     
  16. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,059
    Likes Received:
    3,119
    Location:
    New York
    Not that it isn't pure fantasy but why would inter-GPU bandwidth need to be equal when the load on that path would be far lower than GPU<->Mem? I'm not sure what purpose it would serve anyway, didn't both AMD and Nvidia claim that their current proprietary links have sufficient bandwidth for their purposes?
     
  17. jimmyjames123

    Regular

    Joined:
    Apr 14, 2004
    Messages:
    810
    Likes Received:
    3
    "Rahja" said that more info would be available after the "12th". According to Chris Ray, an NDA is expiring on that day, or at least changing to some extent, so that they will be allowed to pass new info on GF100. Chris said that we will get this information "very soon" (so presumably sometime within the next few days, or at least soon this month, based on his wording).
     
  18. Ninjaprime

    Regular

    Joined:
    Jun 8, 2008
    Messages:
    337
    Likes Received:
    1
    Waiting for tuesday then, hopefully not another let down like the "wait for CES!" one.
     
  19. hatter

    Newcomer

    Joined:
    Dec 26, 2009
    Messages:
    32
    Likes Received:
    0
    Anand on Fermi

    AnandTech finally says something on Fermi @ CES

    I think GTX265 is a type for GTX285

    http://www.anandtech.com/tradeshows/showdoc.aspx?i=3719&p=3
     
  20. Sontin

    Banned

    Joined:
    Dec 9, 2009
    Messages:
    399
    Likes Received:
    0
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...