NVIDIA Maxwell Speculation Thread

Discussion in 'Architecture and Products' started by Arun, Feb 9, 2011.

Tags:
  1. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,135
    Likes Received:
    2,935
    Location:
    Well within 3d
    The only descriptions I've seen of Larrabee's configuration were 32 512-bit vector registers for each hardware thread, the L1 data cache is separate and 32 KB, the L2 below it is 256 KB.

    I have not run across a description indicating the design had deviated from a reg file-> L1-> L2 standard CPU configuration.
     
  2. rpg.314

    Veteran

    Joined:
    Jul 21, 2008
    Messages:
    4,298
    Likes Received:
    0
    Location:
    /
    The "registers" for ~500 workitems cannot be stored in a register file with only 32 registers in all. They are allocated from cache, made resident, and then the sw runtime takes over, scheduling and switching workitems via load store instructions.
     
  3. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,135
    Likes Received:
    2,935
    Location:
    Well within 3d
    32 registers do not hold all the data needed, but that doesn't change the fact that the vector registers are physically separate and separately addressed pools of SRAM.
     
  4. rpg.314

    Veteran

    Joined:
    Jul 21, 2008
    Messages:
    4,298
    Likes Received:
    0
    Location:
    /
    Correct. I was referring to the logical registers as seen in OpenCL, not physical ones.
     
  5. Megadrive1988

    Veteran

    Joined:
    May 30, 2002
    Messages:
    4,642
    Likes Received:
    155
    Is it likely that Maxwell will be DX12?
     
  6. silent_guy

    Veteran Subscriber

    Joined:
    Mar 7, 2006
    Messages:
    3,754
    Likes Received:
    1,379
    Before this gets answered: what's in DX12?
     
  7. ninelven

    Veteran

    Joined:
    Dec 27, 2002
    Messages:
    1,709
    Likes Received:
    122
    Concurrent context support I hope...
     
  8. rpg.314

    Veteran

    Joined:
    Jul 21, 2008
    Messages:
    4,298
    Likes Received:
    0
    Location:
    /
    Better and more tightly coupled compute, I hope.
     
  9. jlippo

    Veteran Regular

    Joined:
    Oct 7, 2004
    Messages:
    1,352
    Likes Received:
    459
    Location:
    Finland
  10. Megadrive1988

    Veteran

    Joined:
    May 30, 2002
    Messages:
    4,642
    Likes Received:
    155
    Interesting, thanks. Seems to be for the Echelon project. I'd imagine Echelon will use a GPU architecture beyond that of Maxwell.

    I wonder if Nvidia works on 2 or 3 GPU architectures at a time, well, of course they do, concurrently.
     
  11. iMacmatician

    Regular

    Joined:
    Jul 24, 2010
    Messages:
    773
    Likes Received:
    200
    From SemiAccurate: "Nvidia’s Maxwell process choice."

    Key points:
    • Likely release timeframe Q1 2014
    • Internal aim of an 80% increase in performance/W [I'm assuming over some Kepler]
    • The first Maxwell parts will be on 28 nm, not 20 nm
    • The rest of the article is for subscribers or can be bought for $50
     
  12. homerdog

    homerdog donator of the year
    Legend Veteran Subscriber

    Joined:
    Jul 25, 2008
    Messages:
    6,173
    Likes Received:
    956
    Location:
    still camping with a mauler
    $50 what a deal!
     
  13. DuckThor Evil

    DuckThor Evil Anas platyrhynchos
    Legend Veteran

    Joined:
    Jul 9, 2004
    Messages:
    5,787
    Likes Received:
    796
    Location:
    Finland
    1000$ per year for day one access to Charlie's articles.

    Too good to be true. :)
     
  14. tviceman

    Newcomer

    Joined:
    Mar 6, 2012
    Messages:
    191
    Likes Received:
    0
    That has to be some kind of idiotic joke.
     
  15. silent_guy

    Veteran Subscriber

    Joined:
    Mar 7, 2006
    Messages:
    3,754
    Likes Received:
    1,379
    With pay per click rates being what they are and $50, he only needs a few idiots who are willing to pay for his 'analysis' to come out ahead. Kinda sad to miss out on the entertainment value though...
     
  16. A1xLLcqAgt0qc2RyMz0y

    Regular

    Joined:
    Feb 6, 2010
    Messages:
    993
    Likes Received:
    284
    Maxwell will be on TSMC's 20nm process when that is available.

    Seems reasonable. Kepler GK110 was 100% more efficient than Fermi GF110 in DP.

    Pure speculation from the V I.
     
  17. ninelven

    Veteran

    Joined:
    Dec 27, 2002
    Messages:
    1,709
    Likes Received:
    122
    Do you get a refund (with interest) every time he is wrong?
     
  18. Alexko

    Veteran Subscriber

    Joined:
    Aug 31, 2009
    Messages:
    4,496
    Likes Received:
    910
    Apparently, non-free articles become free after a month, so we'll just have to wait a bit. ;-)

    (Which, incidentally, only makes the $50 fee even crazier.)
     
  19. Malo

    Malo Yak Mechanicum
    Legend Veteran Subscriber

    Joined:
    Feb 9, 2002
    Messages:
    7,090
    Likes Received:
    3,161
    Location:
    Pennsylvania
    He sure puts a large value on his "insights" into the industry.
     
  20. iMacmatician

    Regular

    Joined:
    Jul 24, 2010
    Messages:
    773
    Likes Received:
    200
    Apparently not all of them will.

    And this report is under "Analysis."

    If this report is true, then I'm guessing the Maxwell lineup may be something like the G80, G92, and GT200.

    "Maxwell Prime": 28 nm, Q1 2014, 400-450 mm^2, 384-bit bus, game performance ≥ GeForce version of Tesla K20/K20X.
    "Maxwell Lite": 20 nm, Q4 2014 or whenever 20 nm realistically shows up, 250-300 mm^2, ≥256-bit bus, performance ≈ Maxwell Prime depending on bandwidth bottlenecks?
    "Big Maxwell": 20 nm, 2015, 500-550 mm^2, 512-bit bus, focusing on compute.

    I would also assume the presence of multiple lower-end chips for both process nodes, but given info and rumors about GK11x and GK20x lineups, I'm wondering if GK20x will coexist with 28 nm Maxwell, and not in a rebranding way (so GK11x = 2013 and GK20x = 2014?). However, I think that if the "80%" performance increase per watt is for 28 nm Maxwell, then NVIDIA would move to Maxwell for the other segments quickly (especially for laptops), maybe unless that move would significantly delay 20 nm Maxwell. If the performance increase is for 20 nm Maxwell, then I'd imagine they might take their time.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...