PlayStation 4 (codename Orbis) technical hardware investigation (news and rumours)

Discussion in 'Console Technology' started by Love_In_Rio, Jan 28, 2013.

Thread Status:
Not open for further replies.
  1. Betanumerical

    Veteran

    Joined:
    Aug 20, 2007
    Messages:
    1,763
    Likes Received:
    280
    Location:
    In the land of the drop bears
    and writing between the eSRAM and DRAM uses the DRAM bandwidth.
     
  2. oldschoolnerd

    Newcomer

    Joined:
    Sep 13, 2013
    Messages:
    65
    Likes Received:
    8
    Of course, but reading in the PR Textures and copying a completed framebuffer out at the end of the frame render is a small % of the overall bandwidth consumed during the generation of a frame. It's pretty clear that ms have measured 150GB/s to their esram during real world usage, getting that read/write activity off the system ram bus has to result in less contention.
     
  3. itsmydamnation

    Veteran

    Joined:
    Apr 29, 2007
    Messages:
    1,349
    Likes Received:
    470
    Location:
    Australia
    that is an assumption i would not make. just because they hit 150gbps doesn't actually mean anything in terms of a contention comparison. its all about sample interval, if your 95th percentile average or something like that is 150gbps over the entire frame then it means something. Remember GPU can tolerate latency

    There will be a lot of data being read-write to the Esram, its to small not to. There will still be a lot of data being read-write from the GPU to the DRAM and there will still be data read-write from the CPU to the DRAM.

    Remember GPU's have caches its not like every read and write on the PS4 is going to Dram either.

    Personally i think Dev's will find the best way to use all resources on the boxes. So unless the Esram is going to buy them the ability to run algorithms on the shaders the PS4 can't i don't think the Esram is going to be any kind of performance multiplier as i expect shader utilization to be high on both platforms regardless.
     
  4. Betanumerical

    Veteran

    Joined:
    Aug 20, 2007
    Messages:
    1,763
    Likes Received:
    280
    Location:
    In the land of the drop bears
    It may be a small percentage but it is not free, either bandwidth nor latency wise and as such it is important to keep in mind how often you use it.

    Just to throw out some numbers (based on the time it takes to copy the data).

    25.6GB/s in MB/s is 26214.4 MB/s
    26214.4MB/s / 30 is 874 MB/frame

    In other words to copy 874MB of data to/from the eSRAM would take a entire frame of time.

    you can do a full fill/read from the eSRAM 28 times / frame if its doing nothing else.
    This mean that each total fill/read (ie 32MB) takes 1/28th of a frame to complete.

    That doesn't seem so low to me.
     
    #3384 Betanumerical, Sep 25, 2013
    Last edited by a moderator: Sep 25, 2013
  5. taisui

    Regular

    Joined:
    Aug 29, 2013
    Messages:
    674
    Likes Received:
    0
    25.6 is from?
     
  6. Betanumerical

    Veteran

    Joined:
    Aug 20, 2007
    Messages:
    1,763
    Likes Received:
    280
    Location:
    In the land of the drop bears
  7. Jay

    Jay
    Veteran

    Joined:
    Aug 3, 2013
    Messages:
    4,033
    Likes Received:
    3,428
    Using a single DME, which would make it the slowest you could expect it to be then.
    Fastest being using all 4 together.
     
  8. Betanumerical

    Veteran

    Joined:
    Aug 20, 2007
    Messages:
    1,763
    Likes Received:
    280
    Location:
    In the land of the drop bears
    Not unless your doing interleaved read/writes and the DME's are bad at that

     
  9. taisui

    Regular

    Joined:
    Aug 29, 2013
    Messages:
    674
    Likes Received:
    0
    but copying with GPU is just 2X faster, and wastes the cycles, besides how realistically is this scenario in real titles? Why would anyone just move data around for nothing?
     
  10. Betanumerical

    Veteran

    Joined:
    Aug 20, 2007
    Messages:
    1,763
    Likes Received:
    280
    Location:
    In the land of the drop bears
    Its not for nothing, its for processing, im just pointing out that it takes more then a 'insignificant amount of time'
     
  11. taisui

    Regular

    Joined:
    Aug 29, 2013
    Messages:
    674
    Likes Received:
    0
    why would anyone need to copy a finished frame is beyond me. Also PRT don't need to be refreshed completely every frame, I think the point is that the DME frees up the GPU, 25.6G be, you can refresh the entire 32M 13 times @ 60fps, that's like 1M cycles when going through GPU is at 500K? Its pretty insignificant IMO
     
  12. oldschoolnerd

    Newcomer

    Joined:
    Sep 13, 2013
    Messages:
    65
    Likes Received:
    8
    I'm not sure that's the point I was trying to make. I only mentioned the low % of reads/writes to dram as opposed to esram in response the post that was saying that 32MB esram is so small it's irrelevant to the overall bandwidth picture.

    Clearly data is going to need to be moved between dram and esram, but there is going to be a bigger proportion of bandwidth that is consumed by the GPU reading/writing intermediate data to esram. That has to be the point of having it in the first place.

    Getting back to my original question, it seems that in the absence of any evidence we can only guess at the real world performance of Orbis bus. I proffer that it will not do any better than the x1 at 75%. Happy to hear anybody's ideas about why it could me more or less than this.
     
  13. Betanumerical

    Veteran

    Joined:
    Aug 20, 2007
    Messages:
    1,763
    Likes Received:
    280
    Location:
    In the land of the drop bears
    Im merely giving upper bounds on specific busses and the times that they take. The reason I gave numbers for the entire thing is that its easier to reason with thats all.

    Yeah you should be able to fill / empty it 13 times at 60FPS but thats using the entire peak bandwidth of the DME's and they hang off a bus that has other devices on it (although they do not use a great deal of bandwidth).

    Not to mention thats also using ~1/2 your DDR bandwidth.
     
  14. taisui

    Regular

    Joined:
    Aug 29, 2013
    Messages:
    674
    Likes Received:
    0
    well, to put it in perspective, PS4 at 176GBps and 8G Ram, allows 3G of data copying per frame @ 60fps, so it'll take 3f to copy the full 8G? I don't get what you are saying at all.
     
  15. Betanumerical

    Veteran

    Joined:
    Aug 20, 2007
    Messages:
    1,763
    Likes Received:
    280
    Location:
    In the land of the drop bears
    Im saying that there is a upper limit to the amount of bandwidth the DME's have and as such I don't really think that copying even the entire 32MB from eSRAM to DDR or vice versa takes 'insignificant' amount of time and as such it would require careful usage to get good performance out of.
     
  16. Pixel

    Veteran

    Joined:
    Sep 16, 2013
    Messages:
    1,008
    Likes Received:
    477
    My bad, not sure how I misread it.
     
  17. Love_In_Rio

    Veteran

    Joined:
    Apr 21, 2004
    Messages:
    1,627
    Likes Received:
    226
    #3397 Love_In_Rio, Sep 25, 2013
    Last edited by a moderator: Sep 25, 2013
  18. mrcorbo

    mrcorbo Foo Fighter
    Veteran

    Joined:
    Dec 8, 2004
    Messages:
    4,024
    Likes Received:
    2,851
    I don't think that use case (copying the entire contents of the ESRAM to DRAM or filling the ESRAM from DRAM data unmodified) is going to be very prevalent. Ideally, I would expect that since the GPU can read and write to both pools that you would want to accomplish the bulk of your data movement between them by tying it to GPU processing. At that point you're just using the DMEs for specific use cases and in ALU-bound situations to take advantage of unused bandwidth.
     
  19. szymku

    Regular

    Joined:
    Mar 2, 2007
    Messages:
    318
    Likes Received:
    180
    Do you think it will be possible for PS4 to emulate PS2 without additional hardware?
     
  20. Love_In_Rio

    Veteran

    Joined:
    Apr 21, 2004
    Messages:
    1,627
    Likes Received:
    226
    Yes, because unlike PS3 RSX, this time the GPU has enough bandwitdth to emulate graphics synthesizer without any problems. Sony could port any freeware PS2 PC emulator to PS4.
     
Loading...
Thread Status:
Not open for further replies.

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...