Nvidia BigK GK110 Kepler Speculation Thread

Our rendering servers delivers the highest throughput with 2 instances (of our application, ie 1 d3d9 context pr instance) per graphics card, and adding another card with it's own load can barely affects performance (ie it scales nicely).
This is Windows7 (for the particular machine with 2 cards, but 2008R2 works the same) and radeons.
I know I'm late to the party, and this was posted a while back. Can you share any more info on your app? I've been doing some tinkering with Microsoft Hyper-V and RemoteFX platforms, specifically around the "shared" 3D acceleration that RemoteFX can deliver across multiple simultaneous streams via any 3D accelerator card that has a WDDM driver available. Even an ATI 4850 provides an acceptable amount of performance for 'easy' games across three separate Win7 VM remote sessions.

I'm not entirely sure of how they do it, either.
 
So...
1/3rd rate DP
Access to texture cache
2x L2
4x Registers
2880 CCs
240 TUs

I wonder if it will be substantially faster than GK104 in games?
 
Another interesting bit:

Unlike Fermi, which did not permit double precision instructions to be paired with other instructions, Kepler GK110 allows double precision instructions to be paired with certain other instructions that have no register file reads, including load/store, texture, and some integer instructions.
 
I wonder if it will be substantially faster than GK104 in games?

Depends on clocks but it's almost guaranteed to be worse perf/w unless nVidia found additional architecture tweaks. Too bad they didn't announce a release date for consumer versions.
 
Depends on clocks but it's almost guaranteed to be worse perf/w unless nVidia found additional architecture tweaks. Too bad they didn't announce a release date for consumer versions.
I'm not sure it will be worse perf/w. Sure at the same clocks it would be. But chances are clocks (and hence more important voltage) will be lower which can improve things a lot. So might be something like 50% faster with 50% higher power draw in the end.
 
Does that mean DP units can be used as additional SP units? 2880 + 960(or even x2)
No. First, nV would have told us so and would advertize the higher speed. They explicitly say, that the co-issued instructions (to a DP instruction) cannot read from the register file. I imagine there can't be too many instructions of this type. :LOL:
And as a second point: the actual implementation will probably look differently than what nvidia sketched.
 
I wonder if it will be substantially faster than GK104 in games?

Well, unfortunately they will release GeForces with GK110 only in 2013.

GK110 will not appear in Quadro or GeForce products this year. It is expected that NVIDIA will fight AMD’s new Graphics Core Next based FirePro at SIGGRAPH 2012 (early August 2012), with GK107 and GK104 based parts, while professionals will probably have to wait for GK110-based Quadro until April 2013 (NAB 2013). GeForce gamers should get their hands on GeForce GTX 700 Series sooner, though. Did we just mention GTX 780? Dang...

Read more: http://vr-zone.com/articles/nvidia-...rce-and-quadro-cards/15884.html#ixzz1v6ePIBpA

But, does it mean we will see GK114 as GTX 760 while a little bit later GK110 as GTX 780?

2880 CUDA Cores
15 SMX Clusters
384-bit Memory Controller
Up to 24GB of GDDR5 memory
2nd Gen ECC
Hardware GPU Silicon Virtualization
Hyper-Q (Slashes CPU idle time by allowing multiple CPU cores to simultaneously utilize a single Kepler GPU, dramatically advancing programmability and efficiency)
Dynamic Parallelism (Simplifies GPU programming by allowing programmers to easily accelerate all parallel nested loops – resulting in a GPU dynamically spawning new threads on its own without going back to the CPU)
50-85% Double Precision Rate to Single Precision
At least 1.5 TFLOPS DP FP64
Target: 250 GB/s bandwidth
 
Back
Top