So is 640 CCs double confirmed now?
If it is, can we stop the babbling about GM107 being a Kepler derivative? Is the change in the number of CUDA cores per SMX enough to call it a Maxwell now?
So is 640 CCs double confirmed now?
First time we get to see the reference PCB without the cooler.
I think he's mocking the guy that claims 640, it doesn't really make much difference honestly, once the final product slides are released, we'll know the CUDA cores count.
If it is, can we stop the babbling about GM107 being a Kepler derivative? Is the change in the number of CUDA cores per SMX enough to call it a Maxwell now?
Probably not, after all, the GF100 and GF104 have different numbers of CCs per SM.If it is, can we stop the babbling about GM107 being a Kepler derivative? Is the change in the number of CUDA cores per SMX enough to call it a Maxwell now?
960 vs 640SPs don't make much difference?
Exactly why I consider a 640 config heaven sent; now we'll of course see theories in the wild that they cut out SPs last minute
Probably not, after all, the GF100 and GF104 have different numbers of CCs per SM.
Look carefully at the TK1 GPU specs and compare to the purported 750 Ti specs. The GPU clock operating frequencies and memory speeds are ~ 25% higher on the latter, which gives 5x more pixel fillrate and 5x more mem. bandwidth. The TK1 GPU has up to 952MHz clock operating frequency and up to 17 GB/s mem. bandwidth.
ams;1826766" said:Through and through" Maxwell would be unified virtual memory, improved IQ, built-in ARM cores, and 20nm or better fab. process, and 750 Ti is most definitely not that.
ams;1826766" said:The TK1 GPU is most certainly not stripped to the bone either, but rather is a true Kepler GPU in any way/shape/form (even if TMU count per SMX and ROP count per 32-bit mem. channel is halved), and unquestionably moves the bar forward with respect to perf. per watt vs. prior Kepler GPU's. GM107 obviously uses experiential, process, and architectural improvements over the last two years to significantly improve on GK107 (and make no mistake, using a mobile first approach is a significant architectural difference here).
NVIDIA did some work to make Kepler suitable for low power, but it's my understanding that the underlying architecture isn't vastly different from what we have in notebooks and desktops today. Mobile Kepler retains all of the graphics features as its bigger counterparts, although I'm guessing things like FP64 CUDA cores are gone.
GM107 and GM108 are no Keplers. If they have compute capability 5.0 there are probably some radical changes (4.x is missing) in the architecture.True, but then again all Kepler had this number constant throughout the family including GK208A. Why would they changed it just for one chip?
Also, PedantOne in the XS thread has posted some purported pictures including one showing Hynix H5GC4H24MFR-T2C memory. According to this data sheet the memory could be rated 6.0 Gbps or 5.0 Gbps, which doesn't really tell us much (but is good to know).
The "roadmap feature" of Kepler, dynamic parallelism, didn't even show up until GK110. I wouldn't be surprised if GM107/GM108 have "significantly" fewer features than the GM200 (the "through and through" Maxwell?) or even the other GM20x chips, but that doesn't make the GM10x chips "just" a Kepler refresh.Nvidia bifurcated Kepler more so than they did with Fermi. GK104 and it's derivatives were more graphics focused than GF104 and it's derivatives and stripped of more HPC-oriented functionality. Consequently, GK104 and GK106 had a die size reduction over their predecessors. GK110 brought new, exclusive compute features with it, and as a result increased in die size over it's predecessor. From what I'm deducing, Nvidia is continuing this strategy. I have no idea if Nvidia plans to implement ARM cores into anything other than the flagship Maxwell die, but I am fairly confident they will continue to bifurcate their product line, like they started doing with Fermi and even more so with Kepler.
GM107 and GM108 are no Keplers. If they have compute capability 5.0 there are probably some radical changes (4.x is missing) in the architecture.
LOL!
Of course 960 vs 640 SPs makes a difference... Given the alleged performance of these 640 SPs, faster than the 768 SPs GTX650Ti, they increased performance per SP. Although not by much? We still need to know how much power consumption it consumes exactly though...
It's just strange that those in the "know" so far didn't notice that there's something different.
Could have sworn that I've read from multiple reputable sites that TK1's GPU is "up to 1ghz." And after you move past the memory bus, which is 1/2 GM107 and not 1/5, memory bandwidth is entirely based on the power constraints of the ram. I think you're trying to create correlations to match up product features, when there really is no real correlation.
Okay it's not stripped to the bone, but it's definitely stripped down.
I'm not saying you are definitley wrong, I just don't think you are right. I'm standing by theory that Nvidia was ready with Maxwell significantly before TSMC was able to deliver 20nm at reasonable costs
the smx structure changes slightly. nv did some optimization that they can use now the dp alu also for sp, it supports now all sp instructions and can be used in parallel. it means an smx looks now to have 256 alu. technically, that reduces maxwell's dp rate to 1:4
Well looks like there is mounting evidence that the smx is considerably different, and configured with 128 CC's instead of 192. I am going with Maxwell.
Since Fermi, all GPUs have 64bit units, just enough to debug and run 64bit code. Since Nvidia wants to sell a lot of Teslas, they made it so you can run and debug 64bit code on GeForce with capped performance of course.
Note that each rearchitected CUDA core for 750 Ti would need to be capable of much more work (at least 50% more) than a Kepler CUDA core...