Nvidia Ampere Discussion [2020-05-14]

Discussion in 'Architecture and Products' started by Man from Atlantis, May 14, 2020.

Tags:
  1. pharma

    Veteran

    Joined:
    Mar 29, 2004
    Messages:
    4,889
    Likes Received:
    4,536
    Aside from restricting number sold, sales only to country or region where product is shipped might be another option.
     
  2. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,244
    Likes Received:
    4,465
    Location:
    Finland
    Fun fact: The "8" in 3080 is upside down (verified on actual card, too)
    upload_2020-9-8_19-15-13.png
     
    Lightman, PSman1700 and Cyan like this.
  3. Rurouni

    Veteran

    Joined:
    Sep 30, 2008
    Messages:
    1,101
    Likes Received:
    432
    8 is correct. The other numbers and letters are upside down XD
    Tbf, some fonts do write 8 like that, but on 2080 it is a normal 8.
     
  4. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,244
    Likes Received:
    4,465
    Location:
    Finland
    GeForce font doesn't, double checked (also you can see it on the product pages text proper way up)
    Here's the same from actual card that can be shared
    upload_2020-9-8_19-28-29.png
     
  5. OlegSH

    Regular

    Joined:
    Jan 10, 2010
    Messages:
    801
    Likes Received:
    1,630
    Ampere is an evolutionary architecture over Turing. Such architectures are usually highly concentrated on perf/mm and perf/watt improvements.
    If something was included in Ampere, it likely boosts perf per area and watt. Does it matter whether it's balanced or not?
    IMO it doesn't. If 2x FP32 improves perf per mm - add it, people don't care less about the "balanced" metric, they care about perf per $, which is derevative of perf/area and watt.
     
  6. Geeforcer

    Geeforcer Harmlessly Evil
    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    2,320
    Likes Received:
    525
    So has the "world computer" actually accomplish anything of real, tangible value since its creation, or just bootstrapped a bunch of energy-burning shitcoins?
     
    Putas and swaaye like this.
  7. Cyan

    Cyan orange
    Legend

    Joined:
    Apr 24, 2007
    Messages:
    9,734
    Likes Received:
    3,460
  8. Rootax

    Veteran

    Joined:
    Jan 2, 2006
    Messages:
    2,401
    Likes Received:
    1,845
    Location:
    France
    Cyan and BRiT like this.
  9. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    It's more Flops because the trend in graphics is "moar compute": For a great many things, there's a more or less costly version available via compute shader. From micropolygons to post-FX.

    Kepler was bad in that regard, yes. But here, I think it's a sensible choice, even though Gaming-Ampere probably cannot keep all its units busy at the same time. But then - it's power draw is high enough as it is, judging from TDP numbers.
     
    LeStoffer likes this.
  10. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,058
    Likes Received:
    3,116
    Location:
    New York
    For real, geez.
     
  11. Tarkin1977

    Newcomer

    Joined:
    Mar 10, 2018
    Messages:
    15
    Likes Received:
    15
    Lightman and BRiT like this.
  12. LeStoffer

    Veteran

    Joined:
    Feb 6, 2002
    Messages:
    1,262
    Likes Received:
    22
    Location:
    Land of the 25% VAT
    Indeed. Ampere looks to be an excellent compute GPU for a lot of things and a fairly great gaming card too. I'm looking at 3d rendering first for my use (Octane, Vray maybe Redshift - all CUDA) and some other OpenCL compute, so I'm beyond tempted.

    Edit: as the Octane guys are saying: "Yes, we have Octane 2020.1.5 (our next) fully optimized for 3090/Ampere and the results are pretty crazy. I can’t share more until NVIDIA shares the OB scores themselves or the cards are public."
     
    Lightman, PSman1700 and BRiT like this.
  13. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,058
    Likes Received:
    3,116
    Location:
    New York
    Perf/area maybe in this case. It’s interesting that Nvidia didn’t showcase specific workloads or games that benefit from the change. They had a slide or 3 during the Turing launch showcasing the speed up from the separate INT pipeline.

    Strangely Nvidia didn’t talk about raw shader flops much at all. You would think the first consumer GPUS to break 20 and 30 Tflops would be a big deal from a marketing standpoint. That leads me to believe that even Nvidia doesn’t think the inflated numbers are worth talking about.
     
    Lightman likes this.
  14. Man from Atlantis

    Regular

    Joined:
    Jul 31, 2010
    Messages:
    961
    Likes Received:
    855
    Ashes of the Singularity

     
    #1374 Man from Atlantis, Sep 8, 2020
    Last edited: Sep 8, 2020
    Lightman and BRiT like this.
  15. LiXiangyang

    Newcomer

    Joined:
    Mar 4, 2013
    Messages:
    87
    Likes Received:
    48
  16. BRiT

    BRiT (>• •)>⌐■-■ (⌐■-■)
    Moderator Legend Alpha

    Joined:
    Feb 7, 2002
    Messages:
    20,511
    Likes Received:
    24,410
    Wouldn't that be the task of the drivers and not the software application?
     
    Lightman, Krteq, CarstenS and 3 others like this.
  17. LiXiangyang

    Newcomer

    Joined:
    Mar 4, 2013
    Messages:
    87
    Likes Received:
    48
    Well, if Volta's TDR is of any indiction, the higher TDR may already take the extra FP32 unit into consideration.

    For instance, despite of whatever computing load (large sgemm and Tensor-based hgemm included), the real power draw of my volta rarely reach more than 80% of its TDR unless you do FP64 extensively.
     
  18. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,058
    Likes Received:
    3,116
    Location:
    New York
    Yes, the application doesn't know anything about the hardware configuration of the ALUs. Also it's not like the application/warp/thread sees multiple FP32 ALUs anyway. It's just now the dispatcher has a 2nd FP32 slot to issue warps to each clock cycle.
     
    PSman1700, BRiT and pharma like this.
  19. LiXiangyang

    Newcomer

    Joined:
    Mar 4, 2013
    Messages:
    87
    Likes Received:
    48
    The application may not know the hardware configuration much (well, unless you are an informed programmer), but the compiler sure does...
     
  20. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,058
    Likes Received:
    3,116
    Location:
    New York
    Yeah, the compiler knows for sure but I'm not sure it matters. The compiler's job is to statically schedule math instructions within a warp. It can do so because it knows how many cycles each math operation will take and when the output of that operation will be available for input to the next math op. The dispatcher then has a bunch of ready warps to choose from each cycle based on hints from the compiler. Presumably, none of that changes with Ampere.

    The only thing that changes is that now there are more opportunities for a ready FP32 instruction to be issued each clock by the dispatcher. With Turing those instructions could be blocked because the lone FP32 pipeline was busy.
     
    Lightman, PSman1700 and pharma like this.
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...