Llano IGP vs SNB IGP vs IVB IGP

Discussion in 'Architecture and Products' started by AnarchX, Oct 29, 2010.

  1. TKK

    TKK
    Newcomer

    Joined:
    Jan 12, 2010
    Messages:
    148
    Likes Received:
    0
    I agree with Alexko, highly unlikely.
    Not really feasible in mobile space were power efficiency is so important, and I expect the desktop version of Llano to be clocked quite a bit higher than the mobile version, so it wouldn't help much in desktop space, either.

    It will simply be like this:
    - People who know a bit and want the fastest CPU with a somewhat acceptable IGP (or don't care about the IGP at all) will buy SB.
    - People who know a bit and just want a fast enough CPU with decent IGP will buy Llano.
    - And those who don't know anything will base their decision either on brand recognition or price.


    Conclusion: Intel wouldn't gain anything by brute-forcing their IGPs to significantly higher clocks, so they won't do it.
     
  2. AnarchX

    Veteran

    Joined:
    Apr 19, 2007
    Messages:
    1,559
    Likes Received:
    34
  3. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    12 (EU) * 4 (physical width of EU) * 2 (mul + accumulate) * 1.3Ghz.
    Strange, I don't see where the quoted doubling (per clock) from previous generation comes. Maybe intel counts different for new gen.
     
  4. EduardoS

    Newcomer

    Joined:
    Nov 8, 2008
    Messages:
    131
    Likes Received:
    0
    Or is counting fps instead of flops...
     
  5. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    No, actually it looks like it was just specific to COD:MW2. Computerbase has their full review up for SB graphics (http://www.computerbase.de/artikel/grafikkarten/2011/test-sandy-bridge-grafik/12/) and indeed in every other title the AA hit is roughly comparable to HD 5450. Could be either driver or something the game is doing SB doesn't like, but generally the AA implementation seems fine.
     
  6. Tridam

    Regular Subscriber

    Joined:
    Apr 14, 2003
    Messages:
    541
    Likes Received:
    47
    Location:
    Louvain-la-Neuve, Belgium
    AFAIK previous Intel IGP can't do single cycle MAD.
     
  7. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    Neither can the current one...
    Unless you count the accumulator (whose usefulness has improved, but nothing which would change the theoretical peak flop rate as far as I can tell).
    IIRC docs said you couldn't do back-to-back accumulator write followed by accumulator read without stalling (which might no longer be the case dunno), but for a constant stream of MACs this should not affect peak flop rate.
     
    #67 mczak, Jan 14, 2011
    Last edited by a moderator: Jan 14, 2011
  8. rpg.314

    Veteran

    Joined:
    Jul 21, 2008
    Messages:
    4,298
    Likes Received:
    0
    Location:
    /
    Regarding Ivy Bridge GPU, we know it will be DX11, so,

    a) How would intel implement local memory? Dedicated ram (a la cayman/fermi) or unified with the rest of the cache hierarchy like lrb? 64 KB of L1 seems doable on somewhat low freq (~1.1 GHz for full turbo).

    b) Considering the jumps Intel has made with SB, I would expect unified cpu/gpu address space.
     
  9. ltcommander.data

    Regular

    Joined:
    Apr 4, 2010
    Messages:
    616
    Likes Received:
    15
    So Intel's pretty clear on Sandy Bridge's IGP supporting Compute Shader 4.x, presumably meaning both CS4.0 and CS4.1, mentioning it on pg 14 and pg 16. Has anyone actually tested this capability yet, since I don't remember seeing it in any reviews?

    http://arstechnica.com/apple/news/2010/12/apple-may-drop-nvidia-for-sandy-bridges-igp-next-year.ars
    http://www.realworldtech.com/page.cfm?ArticleID=RWT120710035639

    And what does this say about Sandy Bridge IGP OpenCL support? Chris Foreman and David Kanter were pretty adamant that Sandy Bridge's IGP couldn't/wouldn't support OpenCL, with Foreman going so far as saying Sandy Bridge couldn't do any GPGPU at all, which now appears incorrect with CS4.x support. If I'm not mistaken all GPUs that have Compute Shader support, CS4.0 in the G80 and up and CS4.1 in the RV770 and up, have also supported OpenCL. Is this generally true, that CS4.x and OpenCL have enough overlap that Intel should be able to support OpenCL as well?
     
  10. Axel

    Newcomer

    Joined:
    Dec 2, 2009
    Messages:
    2
    Likes Received:
    0
    Location:
    France
  11. AnarchX

    Veteran

    Joined:
    Apr 19, 2007
    Messages:
    1,559
    Likes Received:
    34
  12. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    I've got some doubts they will support normal MADD but either way obviously intel counts it the same so it wouldn't improve flops.
    I was wondering though with the earlier rumors about 24 EUs and now 16 EUs, maybe both rumors are true? Some chips with 24 and some with 16, instead of the current 12 or 6?
     
  13. rpg.314

    Veteran

    Joined:
    Jul 21, 2008
    Messages:
    4,298
    Likes Received:
    0
    Location:
    /
    I don't know how accurate this is, but as a general rule of thumb - and paraphrasing Arun - people leak stuff to Fudzilla only when they want to deny it. :wink:
     
  14. DavidC

    Regular

    Joined:
    Sep 26, 2006
    Messages:
    348
    Likes Received:
    27
    Maybe not. Earlier rumors of G35 had EUs at 16, but it came with 8. They only increased EU count by 20-30% each generation so 16 for IVB isn't surprising.

    BTW, about throughput, I don't know how they got the flops number, but just for the claim of doubled throughput might have been with geometry processing(VS/T&L) performance. Read the developer guide for HD graphics Sandy Bridge.
     
  15. 3dcgi

    Veteran Subscriber

    Joined:
    Feb 7, 2002
    Messages:
    2,493
    Likes Received:
    474
    What is your distinction between MAC and MADD? To me they are two names for the same instruction.
     
  16. rpg.314

    Veteran

    Joined:
    Jul 21, 2008
    Messages:
    4,298
    Likes Received:
    0
    Location:
    /
    Sorta like FMA3 and FMA4, IMO.
     
    #77 rpg.314, Jan 29, 2011
    Last edited by a moderator: Jan 29, 2011
  17. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    Yes, though not quite the same - IIRC FMA3 has several versions so you can choose which of the source regs is also used as destination (as you can specify 3 operands in total). intel igp however will always use the accumulator reg for last source operand.
    The manual (from Ironlake) says:
    "The mac instruction takes component-wise multiplication of <src0> and <src1>, adds the results with the corresponding accumulator values, and then stores the final results in <dst>."
     
  18. rpg.314

    Veteran

    Joined:
    Jul 21, 2008
    Messages:
    4,298
    Likes Received:
    0
    Location:
    /
    That doesn't seem like much of a distinction. You can always change the order of your operands.
     
  19. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    How so? In a * b + c there's only one operand which gets added to the multiplication result.
    Also, you need to get the operand into the accumulator reg first - luckily you can do that "for free", that is instructions can implicitly update the accumulator reg (or they can do it explicitly). You don't need to do that with FMA3 (so for FMA3, the hw still has to be able to fetch 3 normal regs, not so for igp mac).
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...