NVIDIA Fermi: Architecture discussion

Discussion in 'Architecture and Products' started by Rys, Sep 30, 2009.

  1. air_ii

    Newcomer

    Joined:
    May 2, 2007
    Messages:
    134
    Likes Received:
    0
    Unless 285's bottleneck was shader throughput, which I think is unlikely (to that extent on average, at least)...
     
  2. Ailuros

    Ailuros Epsilon plus three
    Legend Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    9,511
    Likes Received:
    224
    Location:
    Chania
    I was scratching my head while reading Psycho's 21% bandwidth increase but then I went back to the techreport article and saw a 1300MHz memory clock for the GTX285, which is actually clocked at 1242MHz if memory serves well which equals to ~159GB/s.

    I can't know where Rys got his frequency estimates got from but it seems that his memory frequency for GF100 is quite pessimistic and his hot clock estimates way too optimistic considering his core frequency estimate. I continue to expect a roughly 2.2x ALU:TMU ratio and a memory frequency =/>1100MHz.
     
  3. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    I'm looking forward to see those... but in the meantime: G94 was the most balanced chip I've seen in years, when taking into account the whole package.
     
  4. Psycho

    Regular

    Joined:
    Jun 7, 2008
    Messages:
    746
    Likes Received:
    41
    Location:
    Copenhagen
    Yeah, some "clock reconfiguration" benchmarking for gtx285 would be interesting :)
    For instance test at 325/750/620 (half 285) vs 500/1800/750 (half 380) if your shaders can go that high. In some realistisc modern games and maybe also with/out some cpu downclocking.

    Also puzzled me a bit, as it would seem pretty memory limited (except that 5770/5870 isn't that badly limited after all), and faster memory is an easy option.
     
  5. XMAN26

    Banned

    Joined:
    Feb 17, 2003
    Messages:
    702
    Likes Received:
    1
    This is my plan of action.

    I'll load up Crysis, Sta;ler and Oblivion, the 3 most shader intensive games I have. I will use 1920x1080 my monitors native res and I'll post results of testing using those 3 games. Each game shal have multiple results. First result will be single GTX260 216 at 650/1512, then at 325/1512 and then at 650/756. Then after some reconfiguring, I'll post 3 more results of the same games with my pair of XFX 9600GTs Alpha Dogs in SLi their default clocks are 700/1700 wihch I will reduce to 650/1512, the same clocks at which my GTX260 will be running. AA will not be used as that is fillrate/bandwidth intensive and would severely affect the scores.

    My rig
    Q6600@3.0(I will try and tweek it this week for 3.6 with my water cooling setup)
    8GB DDR2 1066 ram
    160GB HDD
    780i motherboard
    Win7U 64bit
    Cosmos 1000

    Cards to be used
    2x XFX Alpha Dog 9600GTs 700/1700 GPU clocks
    1 of my 2 eVGA GTX260 216 OCs

    Games:
    Crysis
    Stalker:SoC
    Oblivion using a saved game file from an area that stresses the shaders very hard.

    I wish I had a GTX280 or GTX285 for a better comparison, hell a 9800GTX would be nice too.

    Now given the games selected are shader intensive, the 9600Gts in SLI shouldn't be within 40% of the GTX260 216. But I can tell you now, it will be within 10-15%. 9600GTS in SLI have SLI scaling, 128SPs and 16 ROPs, the GTX260 216 is 216SPs and I think 24 or 28 ROPs, dont remember which.
     
    #2105 XMAN26, Dec 15, 2009
    Last edited by a moderator: Dec 15, 2009
  6. Psycho

    Regular

    Joined:
    Jun 7, 2008
    Messages:
    746
    Likes Received:
    41
    Location:
    Copenhagen
    I consider crysis pretty bandwidth dependent, and considering it's age oblivion could be the same. Some bandwidth scaling would be nice too. But ofcourse within this thread I would especially like to see the proposed "fermi scaling" compared to the gt200.
    The dual 9600s would also have dual rasterizers and more bandwidth (similar with the dual 4890s clearly outperforming the 5870)
     
  7. ShaidarHaran

    ShaidarHaran hardware monkey
    Veteran

    Joined:
    Mar 31, 2007
    Messages:
    4,027
    Likes Received:
    90
    Just because a game makes heavy use of shaders doesn't mean its entirely dependent upon shader performance. The bottleneck can change within areas of an individual frame, let alone from frame to frame or game to game.
     
  8. Groo The Wanderer

    Regular

    Joined:
    Jan 23, 2007
    Messages:
    334
    Likes Received:
    2
    When did GT240 ship? Why did it take that long? GDDR5 was a problem, and it looks to be fixed. What's the problem?

    Blink. What is your question? Hot lot time for TSMC 40nm is 6-8 weeks.

    See above, 8 weeks ~= 2 months.

    -Charlie
     
  9. Groo The Wanderer

    Regular

    Joined:
    Jan 23, 2007
    Messages:
    334
    Likes Received:
    2
    ATI took 6-7 months from Evergreen tapeout to cards on sale. _IF_ GF100 goes on sale in Feb, that is 8 months. That is not very fast.

    -Charlie
     
  10. Groo The Wanderer

    Regular

    Joined:
    Jan 23, 2007
    Messages:
    334
    Likes Received:
    2
    Wow, and I thought Sherbin was delusional, but you seem to be on quite another level. You are well suited to marketing, but I will give that spin 3/10 for effort.

    -Charlie
     
  11. Sontin

    Banned

    Joined:
    Dec 9, 2009
    Messages:
    399
    Likes Received:
    0
    7 Months because your tape-out date was mid-end july:

    http://www.semiaccurate.com/2009/07/29/miracles-happen-gt300-tapes-out/
     
  12. Razor1

    Veteran

    Joined:
    Jul 24, 2004
    Messages:
    4,232
    Likes Received:
    749
    Location:
    NY, NY
    You were talking about Fermi if I remember correctly and it was somewhere in this thread, so I don't know where you are coming from then.


    You do know hot lots aren't the same as tape out? :grin:


    I don't think you looked at what I quoted.


    Says the blind man with the blind dog :wink:
     
  13. itsmydamnation

    Veteran

    Joined:
    Apr 29, 2007
    Messages:
    1,349
    Likes Received:
    470
    Location:
    Australia
    said the blind man to his deaf dog..........




    i have nothing more to add.....
     
  14. digitalwanderer

    digitalwanderer Dangerously Mirthful
    Legend

    Joined:
    Feb 19, 2002
    Messages:
    18,992
    Likes Received:
    3,532
    Location:
    Winfield, IN USA
    "Said the blind man to his deaf son as he picked up his hammer and saw"

    Jeeze people, get it right! :razz:
     
  15. entity279

    Veteran Subscriber

    Joined:
    May 12, 2008
    Messages:
    1,332
    Likes Received:
    500
    Location:
    Romania
    When he talked about the memory controller, I recall it was only in relation to GT2xx GDDR5 parts. He possibly could have speculated that Fermi may be affected as well, but that's it.
     
  16. Vincent

    Newcomer

    Joined:
    May 28, 2007
    Messages:
    235
    Likes Received:
    0
    Location:
    London

    I want my Ginger Fermi this Christmas !
     
  17. Groo The Wanderer

    Regular

    Joined:
    Jan 23, 2007
    Messages:
    334
    Likes Received:
    2
    Yup, and it you notice other people comparing the GDDR5 controllers pixel for pixel in the 2xx and Fermi pics, you might see that it is very likely the same, or very similar controllers.

    -Charlie
     
  18. chavvdarrr

    Veteran

    Joined:
    Feb 25, 2003
    Messages:
    1,165
    Likes Received:
    34
    Location:
    Sofia, BG
    http://www.fudzilla.com/content/view/16873/1/
    ?
     
  19. argor

    Newcomer

    Joined:
    Nov 25, 2008
    Messages:
    96
    Likes Received:
    0
  20. Alexko

    Veteran Subscriber

    Joined:
    Aug 31, 2009
    Messages:
    4,541
    Likes Received:
    964
    Yet another event about computing with nothing about gaming... yay!

    Oh and since we were on the topic of GF100's die size, Dudler from the SemiAccurate forums dug this up: http://www.fudzilla.com/content/view/15782/34/

    Between Charlie's 23.x mm * 23.y mm (where x may be 8), Neliz's 24 mm * 24 mm, Fellix's die-shot matching and scaling and Fudo's... whatever that was, everything seems to indicate something between 550 and 576mm².
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...