AMD: R7xx Speculation

Discussion in 'Architecture and Products' started by Unknown Soldier, May 18, 2007.

Thread Status:
Not open for further replies.
  1. silent_guy

    Veteran Subscriber

    Joined:
    Mar 7, 2006
    Messages:
    3,754
    Likes Received:
    1,382
    Just when I thought these R700/GT200 threads couldn't get more boring...

    (Your way of stereotyping doesn't require me to own a Mac to feel better and special.)
     
    #2201 silent_guy, May 29, 2008
    Last edited by a moderator: May 29, 2008
  2. ShaidarHaran

    ShaidarHaran hardware monkey
    Veteran

    Joined:
    Mar 31, 2007
    Messages:
    4,027
    Likes Received:
    90
    Macs are closed architecture. It's up to Apple to request ATi (or NV) create a card with EFI ROM to support their systems. It's not like there's an aftermarket for Mac parts (comparatively speaking) so ATi has no incentive to do it on their own.

    I understand the function of the PLX chip, though some posting appear not to. The purpose of my previous post was to make it clear to these individuals that no bridge or arbiter is necessary for the sharing of data between the two GPUs.
     
  3. Pressure

    Veteran

    Joined:
    Mar 30, 2004
    Messages:
    1,655
    Likes Received:
    593
    The limitation of BIOS is really starting to show its ugly face and even Vista were supposed to go EFI.

    The .kext has support for several graphic cards, including the Radeon HD 3800 series. There were a rumor floating around earlier that ATI would release an EFI based card. Otherwise it just makes no sense that the device ID is found in the graphic .kext

    I sure did hit the mature audience here I see.

    No, I actually use my Mac Pro for professional work. I am earning money on this lovely platform and my creativity is through the roof. Did wish I had a bit more choice in the graphic card department though.

    Back to topic I presume.
     
    #2203 Pressure, May 29, 2008
    Last edited by a moderator: May 29, 2008
  4. Kaotik

    Kaotik Drunk Member
    Legend

    Joined:
    Apr 16, 2003
    Messages:
    10,246
    Likes Received:
    4,465
    Location:
    Finland
    At least x64 Vista has EFI support, and apparently at least some motherboard manufacturers are now bringing motherboards with normal BIOS, but which can be software updated to EFI instead later this year
     
  5. Razor1

    Veteran

    Joined:
    Jul 24, 2004
    Messages:
    4,232
    Likes Received:
    749
    Location:
    NY, NY
    hmm I have a mac, macs are cool, bootcamped there are EFI emulators out there, to get most graphics cards workin in the Mac OS :wink:
     
  6. ChronoReverse

    Newcomer

    Joined:
    Apr 14, 2004
    Messages:
    245
    Likes Received:
    1
    So a Mac somehow increases your creativity? This is one of the reasons why there's such a backlash against some of the Mac folks.
     
  7. XMAN26

    Banned

    Joined:
    Feb 17, 2003
    Messages:
    702
    Likes Received:
    1
    OT, apparently, MAC users think that somehow the you can get for it is better than th same software n a PC.
     
  8. Sound_Card

    Regular

    Joined:
    Nov 24, 2006
    Messages:
    936
    Likes Received:
    4
    Location:
    San Antonio, TX
    Wow, Mac bashing came out of no where. Back on topic yes?
     
  9. silent_guy

    Veteran Subscriber

    Joined:
    Mar 7, 2006
    Messages:
    3,754
    Likes Received:
    1,382
    Companies spend millions on nice buildings, nice furniture and a pleasant working environment. I've worked in the worst kind of conditions and the best, and it does make a difference in the overall attitude one has about the work place.

    Why would it be any different about the day to day in-your-face tool you're using?

    I'm not saying it's a necessary condition, but even if it helps just a few percent in making people feel better and more productive, it's worth it.
     
  10. Anarchist4000

    Veteran

    Joined:
    May 8, 2004
    Messages:
    1,439
    Likes Received:
    359
    I'm not sure we're suggesting that an external arbiter is needed but connecting what is effectively a 512bit bus doesn't seem plausible. Less pins and higher speeds between the chips seem to be the only solution to have a reasonable amount of bandwidth.

    If both chips are going to share the same pool then I'd assume half of whatever bandwidth would be consumed by texture fetches would have to utilize that connection with minimal latency.

    That's why we're suggesting some form of high speed interconnect and possibly offloading other features in the process. Ultimately it would come down to just how much space a feature consumed and whether or not it was even worth removing.
     
  11. IbaneZ

    Regular

    Joined:
    Apr 15, 2003
    Messages:
    743
    Likes Received:
    17
    No.

    This is pre-release hysteria, let the weird times roll. :lol:

    Just enjoy it, when we all have the bechmarks it'll be boring times again. :wink:
     
  12. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,716
    Likes Received:
    2,137
    Location:
    London
    What's the worst case scenario for texture bandwidth, assuming a 1GHz GPU with 16 TUs? Assuming fp16 texels, minified with no mipmap and bilinearly filtered, I think this comes out as 16 texels per pixel * 16 pixels per clock * 1GHz * 8 bytes per texel = 51.2GB/s.

    So each GPU in a pair could read that out of its local memory. Or each GPU could, on average, read half of that from the other GPU's memory. So that would require 51GB/s connecting the two GPUs.

    Is that reasonable as an upper bound on the bandwidth required to join two RV770s if they operate as a "shared memory" graphics card?

    Jawed
     
  13. ShaidarHaran

    ShaidarHaran hardware monkey
    Veteran

    Joined:
    Mar 31, 2007
    Messages:
    4,027
    Likes Received:
    90
    This is what I'm talking about.

    Why must a dual RV770 SKU have any sort of connection between its GPUs?

    A shared memory pool utilizing DMA and existing memory interface infrastructure(s) are present on each GPU. No additional hardware nor separate traces need be run (beyond what is necessary to enable dual GPUs on a PCB, that is).
     
  14. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,716
    Likes Received:
    2,137
    Location:
    London
    Depends on the bandwidth required to get scaling out of a shared memory configuration, if they are, indeed, configured that way.

    Jawed
     
  15. ShaidarHaran

    ShaidarHaran hardware monkey
    Veteran

    Joined:
    Mar 31, 2007
    Messages:
    4,027
    Likes Received:
    90
    Good thing they've switched to GDDR5 from GDDR3 then ;)

    I just see all these bits as separate pieces of the same puzzle. They just seem to fit together too well for the obvious case to be anything but true, but I've been wrong before...
     
  16. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,061
    Likes Received:
    3,119
    Location:
    New York
    All we know is that R700 is a dual-GPU card with GDDR5. What other bits are there?
     
  17. ShaidarHaran

    ShaidarHaran hardware monkey
    Veteran

    Joined:
    Mar 31, 2007
    Messages:
    4,027
    Likes Received:
    90
    I believe the slides showing "shared memory in the R700 generation" has been linked to several times, the most recent is probably back a couple pages by now.

    Even if you believe this information to be outdated (as some have suggested), it is clear that ATi has the desire to simplify multi-GPU rendering, while also increasing efficiency resulting in greater performance.

    Maybe I just want to believe so hard that 4870 X2 is "something more" than yet another CF on a card solution.
     
  18. trinibwoy

    trinibwoy Meh
    Legend

    Joined:
    Mar 17, 2004
    Messages:
    12,061
    Likes Received:
    3,119
    Location:
    New York
    Yeah I saw those but weren't they made up by some Chinese website? ATi is obviously moving in this direction but I haven't seen anything indicating that we will see it in R700.

    There's one simple reason for that....such a high level of inter-die integration would probably require significant architectural change. R600 was definitely an attempt at single die supremacy so I'm not expecting anything along these lines until AMD's next architecture rolls out.
     
  19. ShaidarHaran

    ShaidarHaran hardware monkey
    Veteran

    Joined:
    Mar 31, 2007
    Messages:
    4,027
    Likes Received:
    90
    Re: integration of components - I know the slides showed an MCM, but I don't think it's absolutely necessary to achieve the desired effects. It does make trace-routing all that much more difficult, and of course PCB costs rise accordingly, though.

    I think the change to R7xx generation (as small as it may be) by no means precludes the possibility of the introduction of any necessary micro-architectural changes to facilitate the use of a shared memory pool for a dual GPU SKU.
     
    #2219 ShaidarHaran, May 29, 2008
    Last edited by a moderator: May 29, 2008
  20. mczak

    Veteran

    Joined:
    Oct 24, 2002
    Messages:
    3,022
    Likes Received:
    122
    I don't get your math. If I multiply the stuff you mentioned, I come up with 2TB/s... That said, I don't understand the calculation neither - why 16 texels per pixel? Shouldn't that be 4 for bilinear? In any case, I suspect even under somewhat bad conditions you'd usually only have 1 or so, bilinear (with mipmaps) tends to be perfect for texture caches. That still gives 128GB/s - meaning the chip doesn't have enough bandwidth for this anyway. Though DXT1 textures would only use 8GB/s, and DXT5 only 16GB/s...
    I suspect for really good performance you'd want half the memory bandwidth as aggregate link bandwidth, with all textures split up (with some tiling pattern) between the two chips - meaning each chip would still have the same memory bandwidth as a single chip configuration (aside from pathological cases where all texture accesses from a chip go to the memory of the other chip). Though if you assume texture fetch doesn't consume that much bandwidth (after all, your ROPs probably want some too, and as said with compressed formats it should be much lower) maybe something like one fourth the bandwidth instead of half could be enough...
     
    #2220 mczak, May 29, 2008
    Last edited by a moderator: May 29, 2008
Loading...
Thread Status:
Not open for further replies.

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...