gpu

  1. F

    A simple ray tracing GPU on FPGA

    Last year, I wrote a simple ray tracing GPU on FPGA for learning Verilog but didn't have time to write documents until last month. Last weekend, I finally finished the documentation and uploaded all source code to GitHub. It's a simple project but it should be educational for someone who is...
  2. CarstenS

    GPUOpen: All the Pipelines - Journey through the GPU

    A very informative video from GPUOpen by AMDs Lou I. Kramer. https://gpuopen.com/videos/graphics-pipeline/ Well worth the 55 Minutes of time, when you're at beginner or intermediate level B3D forum warrior, I'd say. :)
  3. D

    Diligent Engine 2.4.c - GLTF2.0, PBR, Shadows, Dear Imgui and more

    The latest release of Diligent Engine combines a number of recent updates (GLTF2.0 loader and PBR renderer, shadows), enables Vulkan on iOS, adds integration with Dear Imgui UI library, implements new samplers and Tutorials. Check it out on GitHub.
  4. S

    Survey paper on Deep Learning on GPUs

    The rise of deep-learning (DL) has been fuelled by the improvements in accelerators. GPU continues to remain the most widely used accelerator for DL applications. We present a survey of architecture and system-level techniques for optimizing DL applications on GPUs. We review 75+ techniques...
  5. S

    Survey on Deep Learning on NVIDIA's Jetson Platform

    Design of hardware accelerators for neural network (NN) applications involves walking a tight rope amidst the constraints of low-power, high accuracy and throughput. NVIDIA's Jetson is a promising platform for embedded machine learning which seeks to achieve a balance between the above...
  6. S

    Survey on security techniques for GPU

    Graphics processing unit (GPU), although a powerful performance-booster, also has many security vulnerabilities. Due to these, the GPU can act as a safe-haven for stealthy malware and the weakest ‘link’ in the security ‘chain’. We present a survey of GPU vulnerabilities showed by researchers...
  7. S

    A Survey of Techniques for Architecting TLBs

    Translation lookaside buffer (TLB) caches virtual to physical address translation information and is used in systems ranging from embedded devices to high-end servers. Since TLB is accessed very frequently and a TLB miss is extremely costly, prudent management of TLB is important. We present a...
  8. D

    Tile-based Rasterization in Nvidia GPUs

    http://www.realworldtech.com/tile-based-rasterization-nvidia-gpus/ Starting with the Maxwell GM20x architecture, Nvidia high-performance GPUs have borrowed techniques from low-power mobile graphics architectures. Specifically, Maxwell and Pascal use tile-based immediate-mode rasterizers that...
  9. S

    A Survey On Cache Bypassing Techniques for CPUs, GPUs and CPU-GPU systems

    Available at https://www.academia.edu/24842555/A_Survey_of_Cache_Bypassing_Techniques accepted in JLPEA 2016, reviews ~90 papers. Part of the abstract: With increasing core-count, the cache demand of modern processors has also increased. However, due to strict area/power budgets and presence...
  10. B

    Visual Designer - Rendering & Animation Studio

    Dear community members, I drop you a line to let you know that we have release a new version of our GPU/CPU rendering & animation studio. Visits us at : http://www.vd-3d.com Follow us at : http://www.vd-3d.com/blog/ BTW, any constructive feedback is welcomed, we are looking for...
  11. S

    A Survey of Techniques for Managing GPU Register File

    A Survey of Techniques for Architecting and Managing GPU Register File Accepted in IEEE TPDS 2016 Part of the abstract: To support their massively-multithreaded architecture, GPUs use very large register file (RF) which has a capacity higher than even L1 and L2 caches. In total contrast...
  12. S

    A Survey Of Techniques for Approximate Computing and Storage

    A Survey Of Techniques for Approximate Computing accepted in ACM Computing Surveys 2016, reviews ~85 papers. Covers: * Approximate computing in CPU, GPU and FPGA and various processor components (e.g. cache, main memory, secondary storage) * Approximate storage in SRAM, DRAM/eDRAM, non-volatile...
  13. S

    Gfx benchmarking for photo editing

    Hello, I am new member here at Beyond3D. Pardon if this topics was already discussed before. I am a photo enthusiast and was looking to compare photo editing results on various devices (Say PC vs Phone vs Tablet). My idea was to install lightroom in each device and perform common...
  14. S

    A Survey on Techniques for Managing Process Variation

    A Survey Of Architectural Techniques for Managing Process Variation Accepted in ACM Computing Surveys 2016 Part of the abstract: Process variation --deviation in parameters from their nominal specifications-- threatens to slow down and even pause technological scaling and mitigation of it is...
Back
Top