G70 Core Clock Variance

Discussion in 'Architecture and Products' started by pharma, Jul 4, 2005.

  1. Arun

    Arun Unknown.
    Legend

    Joined:
    Aug 28, 2002
    Messages:
    5,023
    Likes Received:
    302
    Location:
    UK
    Well, the best thing to do at this point would be to test the triangle setup rate. This implies basically abusing the vertex cache to the maximum, so that the VS idles.
    I don't have a framework to do this efficiently right now, and I'm not sure VBO would be best to test this (since it tends to preprocess a bit; if the same triangles came up, perhaps the driver would detect it; but then again it's always possible to use different triangles but you're limited to n*n possibilities then). If nobody feels like bothering, I always could see what I can come up with though...
    Honestly, I can't believe NVIDIA saying they improved triangle setup/rasterisation efficiency. These parts must have been pretty much unchanged since the TNT2! Sure, they might have had a revamp or two since then, but there's no reason to redesign them completely for every new generation of GPUs. Which would point towards an asychronous clock rate being used in that area, at least imo.
    As for people asking "Why should part of the chip run at 470Mhz, and not all of it?" - from my verry limited understanding of hardware engineering (I'm sure some other members could give a much better answer ;)), different parts of all chips have different clocking potential. The NetBurst architecture, for example, has some of its ALUs at a much higher clock rate than the rest of the chip, because they were handtuned to support such a high frequency and because ALU design, if done properly, tends to be slightly more tolerant to higher clock rates.
    And let me reinsist that I'm talking of Triangle Setup performance here - "geometry tests" wouldn't highlight this, should it be those units that are running at a higher frequency.

    Uttar
     
  2. KimB

    Legend

    Joined:
    May 28, 2002
    Messages:
    12,928
    Likes Received:
    230
    Location:
    Seattle, WA
    It does make sense to change the triangle setup/rasterization for generating more general pixel positions for multisampling and supersampling. This is probably the change that was talked about for the 7800: it allows RGSS. You can, of course, hack RGSS/RGMS by having the triangle setup engine just output many more triangles (and have them recombined so they're rendered in an efficient order), but it'd be better to increase the efficiency of the triangle setup engine when dealing with attempting to set up a non-ordered grid.
     
  3. Ailuros

    Ailuros Epsilon plus three
    Legend Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    9,511
    Likes Received:
    224
    Location:
    Chania
    You can't or refuse to? ;)

    A claim does not automatically mean that it's the whole story.

    I understood that and to that I replied; see your PM.
     
  4. Marc

    Newcomer

    Joined:
    Mar 7, 2003
    Messages:
    74
    Likes Received:
    0
  5. digitalwanderer

    digitalwanderer Dangerously Mirthful
    Legend

    Joined:
    Feb 19, 2002
    Messages:
    18,992
    Likes Received:
    3,532
    Location:
    Winfield, IN USA
  6. Dave Baumann

    Dave Baumann Gamerscore Wh...
    Moderator Legend

    Joined:
    Jan 29, 2002
    Messages:
    14,090
    Likes Received:
    694
    Location:
    O Canada!
    You need hit the link to see which test is being used as the quote is out of context without it.
     
  7. CK

    CK
    Newcomer

    Joined:
    Jun 27, 2004
    Messages:
    15
    Likes Received:
    0
    Crap, I just thought I compared 6800U and 7800GTX on a "fair" basis using same clock and 16pp/6vs config this proves me wrong :(

    - Ohh well the result where surprisingly positive for the G70 chip compared to NV40.

    Great for the users, but not so great for reviewers or others trying to lurk some interesting result out of the chip.
     
  8. AndrewM

    Newcomer

    Joined:
    May 28, 2003
    Messages:
    219
    Likes Received:
    2
    Location:
    Brisbane, QLD, Australia
    Ah, yes sorry Dave, I must have read that part in haste. You dont have any other benchmarks for testing stencil performance?
     
  9. ben6

    Regular

    Joined:
    Feb 8, 2002
    Messages:
    864
    Likes Received:
    3
    Strangely, this ties in exactly with something interesting I observed. A card manufacturer came in with a um interesting clocked 7800GTX 490/1.3 GHz
    What's interesting about it is their default clocked 7800GTX is 450/1.25 AND installed in certain motherboards, it reads at 450 others, 490 curiouser and curiouser
     
  10. Kombatant

    Regular

    Joined:
    May 29, 2003
    Messages:
    639
    Likes Received:
    19
    Location:
    Milton Keynes, UK
    The XFX i have right now is at 450/1.25GHz
     
  11. Mariner

    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    2,288
    Likes Received:
    1,055
    I've just noticed that Overclockers.co.uk have started selling the XFX "Extreme Edition" with the 490/1300 clocks. I assume the name is a bit of marketing to link it with the Intel Pentium "Extremely Expensive" chips as the price of £460 including taxes is equivalent to a mere $824 for our US chums. :shock:
     
  12. Marc

    Newcomer

    Joined:
    Mar 7, 2003
    Messages:
    74
    Likes Received:
    0
    This as nothing to do with the graphic card, some motherboard manufacturer have an option in the bios wich overclock the graphic card (PEG Link for ASUSTeK). You have to disable this option!
     
  13. Unwinder

    Newcomer

    Joined:
    Jul 19, 2002
    Messages:
    48
    Likes Received:
    4
    Location:
    Russia, Taganrog
    Some updated info about investigation progress:

    1) G70 has 3 independently clocked domains with no doubts. I've already fully ripped clock detection logic from the driver, so new RT will replace single "Core clock" graph with 3 graphs, representing current clocks of each domain. I still have no strict info about the functional purpose of these domains, most likely they are geometric domain (that clock is currently read by publicly available RT and 3DMark), shader domain and ROP domain. So this naming scheme ("Core clock \ geometric domain", "Core clock \ shader domain" and "Core clock \ ROP domain") is currently used in beta of RT 15.7. However we're still waiting for NV's official comments on it, as well as performing own synthetic testing allowing us to determine domain functional purposes ourselves. So domain naming scheme will probably change in future.
    2) Shader/ROP domain are clocked with more primitive PLL comparing to geometric domain allowing per-1MHz clock frequency adjustment, and currently NVIDIA driver is able to adjust clocks of these domains with 27MHz (oscillator frequency) step only. For example, default 430MHz ROP clock is generated as 27MHz * 16, and the next ROP clock the driver is able to set is 459MHz (27MHz * 17). This results in rather interesting effect when overclocking G70: often overclocking will adjust geometric domain clock only, e.g. attempt to set 440MHz instead of default 430MHz will result in generating the same 432MHz for ROP domian clock, but 480MHz (440+40) for geometric domain clock. This effect is clearly visible on new RT's core clock monitor, G70 owners may also verify it with fillrate tests and see that there are no changes in fillrate for such example.
    3) Target 3D clock for shader/ROP domain (430MHz) and delta for geometric domain clock (40MHz) are explicitly specified in VGA BIOS in performance table. New RT is able to display this delta in "NVIDIA VGA BIOS information" section of the diagnostic report. The example of such info is shown below:

    $ffffffffff ---------------------------------------------------
    $ffffffffff NVIDIA VGA BIOS information
    $ffffffffff ---------------------------------------------------
    $1100000000 Title : GeForce 7800 GTX VGA BIOS
    $1100000002 Version : 5.70.02.11.01
    $1100000100 BIT version : 1.00
    $1100000200 Core clock : 275MHz
    $1100000201 Memory clock : 1200MHz
    $1100010000 Perf. level 0 : 275MHz/600MHz/1.20V/100%
    $1100010001 Perf. level 1 : 415(+35)MHz/600MHz/1.40V/100%
    $1100010002 Perf. level 2 : 430(+40)MHz/600MHz/1.40V/100%
    $1100020000 VID bitmask : 00000011b
    $1100020100 Voltage level 0 : 1.20V, VID 00000000b
    $1100020101 Voltage level 1 : 1.30V, VID 00000010b
    $1100020102 Voltage level 2 : 1.40V, VID 00000001b

    Please pay attention to performance level 1 descriptor. It is so called low power 3D perormance level, and the system uses this performance level as a temporary step when switching from 3D (performance level 2) to 2D (performance level 0). As you see, it has different "Geometric clock delta", and it perfectly matches with temporary 450MHz clock (415+35), which you can see on many clock graphs.
    During overclocking the driver uses this BIOS-defined delta and generates closest possible to desired clock for ROP/shader domain and closest possible to (desired clock + delta) for geometric domain. => We'll probably see some BIOS editors in the future, allowing us to alter this delta. Or even independent clock control sliders for each domain (of course, if NV will decide to provide such functionality in their driver).
    4) Digit-Life are currently preparing the review summing all this info.
    5) Unfortunately I'm leaving the city for vacation soon, so I'll be probably offline since 11th July till 8th of August. So new RT will be publicly launched when I'll return.

    Stay tuned ;)
     
  14. Geo

    Geo Mostly Harmless
    Legend

    Joined:
    Apr 22, 2002
    Messages:
    9,116
    Likes Received:
    215
    Location:
    Uffda-land
    Thanks for the update, Unwinder --I'm sure we all appreciate you taking the time to keep the denizens at B3D informed.

    Sounds like you have the situation well in hand and the mysteries on the way to resolution.
     
  15. digitalwanderer

    digitalwanderer Dangerously Mirthful
    Legend

    Joined:
    Feb 19, 2002
    Messages:
    18,992
    Likes Received:
    3,532
    Location:
    Winfield, IN USA
    Thank you, enjoy your vacation! :)
     
  16. ben6

    Regular

    Joined:
    Feb 8, 2002
    Messages:
    864
    Likes Received:
    3
    Nope not quite. The card is default clocked at 450/1.25 (according to Coolbits), but clocked at 490/1.3 according to Rivatuner and 3DMark, strange actually, as the PR person who dropped it off said it was supposed to be 490/1.3 on the AMD SLI board.

    On the Asus and Gigabyte Intel SLI boards, it's clocked at 450MHz strangely in all instances.
     
  17. Pete

    Pete Moderate Nuisance
    Moderator Legend

    Joined:
    Feb 7, 2002
    Messages:
    5,777
    Likes Received:
    1,814
    Yep, thanks for your efforts, Unwinder.

    So, Dave, is this news or old hat to ATI? Any traffic spikes from certain IP ranges?
     
  18. Rys

    Rys Graphics @ AMD
    Moderator Veteran Alpha

    Joined:
    Oct 9, 2003
    Messages:
    4,182
    Likes Received:
    1,579
    Location:
    Beyond3D HQ
    Just to back Unwinder up (not that he needs it!) and prove my earlier farting about wrong (always nice :lol: ), NVIDIA confirmed to me today that there are indeed discrete clock domains for G70.

    They'd only confirm that the pixel processing and output hardware uses the primary clock, so FPs and ROPs are at 430MHz for a reference 7800 GTX. Everything else is a 'no comment' and 'Unwinder will get the info :D'.
     
  19. digitalwanderer

    digitalwanderer Dangerously Mirthful
    Legend

    Joined:
    Feb 19, 2002
    Messages:
    18,992
    Likes Received:
    3,532
    Location:
    Winfield, IN USA
    I don't get that, they know Unwinder is gonna tell us about it. :roll:

    Lemme guess, that was either a BB or DP reply....right? ;)
     
  20. Geo

    Geo Mostly Harmless
    Legend

    Joined:
    Apr 22, 2002
    Messages:
    9,116
    Likes Received:
    215
    Location:
    Uffda-land
    Unless it was a prediction rather than a promise. They can read too --they probably know better than UW that the game is up. :)
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...