nvidia mobile kepler more powerful than ps3 - New era of mobile games!

Discussion in 'Mobile Graphics Architectures and IP' started by arijoytunir, Jul 24, 2013.

  1. arijoytunir

    Regular

    Joined:
    Nov 13, 2012
    Messages:
    347
    Likes Received:
    12
  2. Rangers

    Legend

    Joined:
    Aug 4, 2006
    Messages:
    12,322
    Likes Received:
    1,120
    I doubt this will stay here as on topic, but yeah that was very interesting!

    They quote ~400 gflops at 1ghz, but Anand speculates in product clocks would be much lower (500 mhz, 200 gflops).

    It's a straight Kepler SIMD, 192 cuda cores.

    I'm trying to figure how this will compare to next gen...I think the issue would likely be it will fall well flat everywhere except FP. Such as bandwidth. In pure FP at that 1GHZ clock it is 1/3 as good as rumored Xbone.

    That jungle demo is very impressive, looks a bit like Crysis.

     
  3. arijoytunir

    Regular

    Joined:
    Nov 13, 2012
    Messages:
    347
    Likes Received:
    12
    thanx for the edit.
     
    #3 arijoytunir, Jul 24, 2013
    Last edited by a moderator: Jul 24, 2013
  4. ThePissartist

    Veteran Regular

    Joined:
    Jul 15, 2013
    Messages:
    1,494
    Likes Received:
    427
    That's fairly impressive. One more year until I have a mobile refresh, hopefully something like this will be available then. Maybe the Power VR6.

    I do wonder how long it'll take for mobile SOCs to catch up with the PS4/Xbox One.
     
  5. ninelven

    Veteran

    Joined:
    Dec 27, 2002
    Messages:
    1,702
    Likes Received:
    117
    Mobile devices won't have a chance until they have comparable bandwidth. PS4 has ~176 GB/s iirc. However, that could be sooner than you think depending on your perspective.

    Samsung is a member of the Hybrid Memory Cube consortium, and I would not be surprised to see them try to get it into their premier phone/tablet ASAP. I would guess possibly in 2016, maybe, if we are lucky, by 2015.

    Even the lowest end HMC could provide ~160 GB/sec of bandwidth. That would be "competitive" with the newest consoles.

    But the manufacturing process (and power consumption) for the SoCs probably will probably continue to be an issue. Still, you could easily see something "close" to XB One in 2016.
     
  6. swaaye

    swaaye Entirely Suboptimal
    Legend

    Joined:
    Mar 15, 2003
    Messages:
    8,457
    Likes Received:
    580
    Location:
    WI, USA
    Their demo video looks inferior to 2004's Far Cry. I'd guess this is actually well below 360 and PS3 except in some raw numbers that are impractical.
     
  7. Priyadarshi

    Newcomer

    Joined:
    Sep 22, 2012
    Messages:
    57
    Likes Received:
    0
    Location:
    USA
    I disagree. The lighting, shadows and water effects are much better than original FarCry. The HDR looks kinda off though. They also demoed UE4 running on Logan at Siggraph.
     
  8. ams

    ams
    Regular

    Joined:
    Jul 14, 2012
    Messages:
    914
    Likes Received:
    0
    I suggested that Mobile Kepler (ie. Kepler.M for Project Logan) could have 192 CUDA "cores" operating at up to ~ 1GHz several months ago. The surprising part is that Kepler.M reportedly has 3x better perf/watt compared to the Ipad 4 GPU, even though it is reportedly fabricated on a 28nm fabrication process and not a 20nm fabrication process.
     
  9. Alexko

    Veteran Subscriber

    Joined:
    Aug 31, 2009
    Messages:
    4,496
    Likes Received:
    910
    It is? Where?
     
  10. swaaye

    swaaye Entirely Suboptimal
    Legend

    Joined:
    Mar 15, 2003
    Messages:
    8,457
    Likes Received:
    580
    Location:
    WI, USA
    Rendering techniques have improved considerably and so have feature sets. But that jungle island is rather sparse don't you think? This will probably be a giant leap for ultra low power GPUs but compared to 8800GTX? 8800GTX has 86 GB/s memory bandwidth, 14 Gpix/sec, 37 Gtex/sec. 128 1.35 GHz G80 ALUs vs. 192 ~500MHz Kepler ALUs is an interesting question too.

    The other question is power consumption. The low end 384 ALU Kepler notebook parts are ~30W. How will they get 192 ALUs to work in Tegra 5 at ~2W or whatever.
     
  11. Priyadarshi

    Newcomer

    Joined:
    Sep 22, 2012
    Messages:
    57
    Likes Received:
    0
    Location:
    USA
    IIRC, This is GLBenchmark 2.7 where Logan achieved ~18 fps while Ipad4 was ~17 fps.

    [​IMG]
     
  12. Priyadarshi

    Newcomer

    Joined:
    Sep 22, 2012
    Messages:
    57
    Likes Received:
    0
    Location:
    USA
    I think it is achievable with improvements over the years in process technology and power efficiency. This is from the Kepler whitepaper :

    One of the notable papers presented by Nvidia this year at 2013 IEEE International Solid-State Circuits Conference (ISSCC) was a 20 Gbit/s serial die-to-die link made in 28-nm CMOS. The link runs on a 0.9 V supply and has power efficiency of 0.54pJ/b. This is probably the 'new low-power interconnect' they have been talking about.
     
  13. ams

    ams
    Regular

    Joined:
    Jul 14, 2012
    Messages:
    914
    Likes Received:
    0
    The other crazy thing is that Kepler.M "Logan" reportedly has ~ 5x higher performance (frames per second) than the ipad 4 GPU (presumably while still being useable in a thin fanless tablet).

    So Tegra 4 "Wayne" comes to market much later than expected, but Tegra 5 "Logan" will probably come to market much earlier than expected.
     
    #13 ams, Jul 28, 2013
    Last edited by a moderator: Jul 28, 2013
  14. swaaye

    swaaye Entirely Suboptimal
    Legend

    Joined:
    Mar 15, 2003
    Messages:
    8,457
    Likes Received:
    580
    Location:
    WI, USA
    I'm suspicious about the power consumption, especially when it's pushing that 5x ipad4 performance level.
     
  15. MfA

    MfA
    Legend

    Joined:
    Feb 6, 2002
    Messages:
    6,811
    Likes Received:
    478
    Performance/Watt can be misleading when performance/mm2 is substantially different (ie. if it's a much larger chip much lower clocked). Performance/watt/mm2 against PowerVR6 will be the big battle (assuming IMG and it's partners manage to get it out in time to compete).

    Still, mobile devices which can run relatively straight XBOX360/PS3 ports will be interesting.
     
  16. ltcommander.data

    Regular

    Joined:
    Apr 4, 2010
    Messages:
    614
    Likes Received:
    11
    nVidia is saying they can reach iPad 4 performance levels at 1/3rd the power and that Logan's peak theoretical performance is 5x the the iPad 4's. Assuming linear performance/watt scaling, if Logan is going full tilt at 5x the iPad 4's performance it's power consumption will be 66% higher than the iPad 4's GPU. A less than linear performance/watt scaling would make it worse of course. The iPad 4 is considered big, thick, and heavy so the direction of future tablets will be smaller. In such a case, is Logan's peak performance and accompanying power consumption and thermal load going to be achievable and sustainable within the confines of a reasonably thin tablet with reasonable power consumption?

    nVidia does claim that the Ira demo was done at less than 3W. If this is Logan operating at less than full tilt, the graphics that are achievable at this performance and power level are still impressive and it bodes well for the graphics that can be achieved at peak performance in a thicker tablet like a hybrid/convertible or a Shield 2. Whether they'll lead performance/watt we'll have to wait for Rogue and Adreno 4xx.

    Anand does point out that they're not sure what else might be hanging off the GPU power rail on the iPad 4 so it's best not to focus too much on the specific iPad 4 power number. Logan's 900 mW figure at iPad 4 performance levels seems like the only solid figure.


    Imagination seems to be addressing both options with small area G6x00 designs and large area G6x30 designs. It'll be interesting to see which option is more popular.
     
    #16 ltcommander.data, Jul 29, 2013
    Last edited by a moderator: Jul 29, 2013
  17. lik

    lik
    Newcomer

    Joined:
    Jun 30, 2006
    Messages:
    13
    Likes Received:
    1
    Perf/W doesn't scale linearly, due to the non-linear relationship between Vdd and power. 1GHz requires probably 1.1V or higher for 28nm, while 400~500MHz needs only 0.8V or lower. Simple voltage and freq scaling will get about 5x power difference. There are other differences that could push it to the 7.5x (2SMX@30W vs 1SMX@2W) such as the removal of FP64, slightly different processes from TSMC (although both in 28nm), notebook's board design, etc.
     
  18. GLX

    GLX
    Newcomer

    Joined:
    Jun 23, 2007
    Messages:
    21
    Likes Received:
    0
    That's a die - to - die (two chips) link.

    Logan is one chip.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...