NVIDIA Tegra Architecture

Discussion in 'Mobile Graphics Architectures and IP' started by french toast, Jan 17, 2012.

Tags:
  1. mboeller

    Regular

    Joined:
    Feb 7, 2002
    Messages:
    923
    Likes Received:
    3
    Location:
    Germany

    What? 100GFlops?

    quoting Anandtech:
    Disappointing!

    No wonder they showed no GLB2.5 scores.
     
  2. Ailuros

    Ailuros Epsilon plus three
    Legend Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    9,511
    Likes Received:
    224
    Location:
    Chania
    Theoretical peak FLOPs have what exactly to do with pixel shader precision? I would had bet on at least USC ALUs; as it stands I'm betting 4*Vec4 FP20 PS ALUs and 2*Vec4 VS ALUs, 1 TMU/Vec4 ALU and of course coverage sampling AA :lol:

    Peak theoretical GFLOPs might even be as high IF frequency is as high as he claims it to be; if performance now should not break even with an iPad4 I for one won't be the one that will have expected too much ;)
     
  3. mboeller

    Regular

    Joined:
    Feb 7, 2002
    Messages:
    923
    Likes Received:
    3
    Location:
    Germany
    16bit Flops then :sad:
     
  4. Ailuros

    Ailuros Epsilon plus three
    Legend Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    9,511
    Likes Received:
    224
    Location:
    Chania
    A large portion FP20 and a smaller on FP32 for the VS ALUs; however since applications don't require strictly only FP32 everywhere: floating point is still floating point irrelevant of precision.

    OGL_ES is quite specific where lowp, mediump and highp should be used; as long as their vertex shaders are inevitably FP32 they've got the majority of highp recommendations covered. The question now is what the OGL_ES3.0 requirements exactly will be; by the sound of Tegra4 I'm willing to bet that the minimum won't be FP32 :p
     
  5. Xmas

    Xmas Porous
    Veteran Subscriber

    Joined:
    Feb 6, 2002
    Messages:
    3,344
    Likes Received:
    176
    Location:
    On the path to wisdom
    The spec is public. In GLSL ES 3.00 highp support is required in both vertex and fragment shaders and must be FP32, while mediump must be at least FP16.
     
  6. Ailuros

    Ailuros Epsilon plus three
    Legend Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    9,511
    Likes Received:
    224
    Location:
    Chania
    Meaning that the ULP GF in Tegra4 won't reach OGL_ES3.0?
     
  7. Nebuchadnezzar

    Legend

    Joined:
    Feb 10, 2002
    Messages:
    1,061
    Likes Received:
    328
    Location:
    Luxembourg
    That already seems out of question due to lack of unified shaders.
     
  8. Ailuros

    Ailuros Epsilon plus three
    Legend Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    9,511
    Likes Received:
    224
    Location:
    Chania
    Great :roll: Well is there anything really interesting besides the i500 to the whole Tegra4 enchilada or am I the only one here that seems bored to death? Oh wait there's project shield...:shock: arggghhh :roll:
     
  9. Xmas

    Xmas Porous
    Veteran Subscriber

    Joined:
    Feb 6, 2002
    Messages:
    3,344
    Likes Received:
    176
    Location:
    On the path to wisdom
    How so? No API feature requires unified shaders.
     
  10. Can OpenCL be done without unified shaders?
     
  11. fellix

    Veteran

    Joined:
    Dec 4, 2004
    Messages:
    3,552
    Likes Received:
    514
    Location:
    Varna, Bulgaria
    OCL can run on large variety of hardware, as long as there's proper vendor run-time support.
     
  12. ams

    ams
    Regular

    Joined:
    Jul 14, 2012
    Messages:
    914
    Likes Received:
    0
    Yes, this is surprising (and one has to wonder if NVIDIA is saving their more modern and more powerful GPU architecture to go up against next gen Rogue/Mali/Adreno/Radeon, etc.), but at the end of the day it now makes sense where the 6x GPU performance improvement comes from when comparing Tegra 4 vs. Tegra 3. Compared to Tegra 3, it appears (but has not yet been confirmed) that Tegra 4 has 6x more pixel shader execution units (ie. 48 pixel shader execution units vs. 8 pixel shader execution units) and 6x more vertex shader execution units (ie. 24 vertex shader execution units vs. 4 vertex shader execution units), for a grand total of 72 pixel/vertex shader execution units in Tegra 4 vs. a grand total of 12 pixel/vertex shader execution units in Tegra 3.

    Up to 6x GPU performance improvement in Tegra 4 vs. Tegra 3 is quite significant, and should be good enough to put NVIDIA at or near the top of the heap with respect to GPU performance compared to the highest performance mobile/handheld SoC's currently on the market today. Top that off with what is arguably the highest CPU performance in a mobile/handheld SoC, in addition to some of the new features introduced on Tegra 4, and NVIDIA can legitimately claim that Tegra 4 is the world's fastest mobile processor.
     
  13. OlegSH

    Regular

    Joined:
    Jan 10, 2010
    Messages:
    798
    Likes Received:
    1,625
  14. ltcommander.data

    Regular

    Joined:
    Apr 4, 2010
    Messages:
    616
    Likes Received:
    15
  15. wishiknew

    Regular

    Joined:
    May 19, 2004
    Messages:
    341
    Likes Received:
    9
    Is this going to work power wise? I thought Anandtech's last article had the A15 in exynos sucking up a lot of juice.
     
  16. silent_guy

    Veteran Subscriber

    Joined:
    Mar 7, 2006
    Messages:
    3,754
    Likes Received:
    1,382
    It's also a question what clock you're going to run this thing at, right? Is it a given that an A15 does worse in terms of perf/W and perf/mm2? I imagine that, with very high maximum clock speeds, you can save quite a bit of power by clocking down and lowering VDD as well.

    As long as the possibility is there to reasonably trade off performance vs. power consumption, it doesn't have to be a problem.
     
  17. 3dcgi

    Veteran Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    2,493
    Likes Received:
    474
    Yes, though you likely won't be able to use all of your FLOPS.
     
  18. Lazy8s

    Veteran

    Joined:
    Oct 3, 2002
    Messages:
    3,100
    Likes Received:
    19
    nVidia boasted that Tegra 3 enabled new-era PCs, equipped with Windows RT, that would never use a fan/heatsink like conventional PCs.

    Although Project Shield is a different product with different priorities and even though the fan/heatsink is mostly a non-issue, it's at least a little amusing that they'd be the company to release a mobile device needing one.
     
  19. NathansFortune

    Regular

    Joined:
    Mar 3, 2009
    Messages:
    559
    Likes Received:
    0
    Late to the discussion, but Tegra 4 doesn't have an integrated modem and it doesn't have a SM4 GPU. What the fuck are they playing at?!? What were they doing for the past year. How hard can it be to stuff 80 or so Kepler ALUs into this and integrate the baseband!

    Now comes the long wait for Tegra 5 where we just have to hope Nvidia become a real competitor to Qualcomm, because right now they aren't even turning up to the smartphone race with quad A15 and no integrated baseband.
     
  20. Laurent06

    Veteran

    Joined:
    Dec 14, 2007
    Messages:
    1,091
    Likes Received:
    489
    Do you think it's easy to integrate IP that comes from another company you just bought? Just ask Intel about integrating Infineon baseband on Atom SoC ;)
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...