NVIDIA Tegra Architecture

AlexV · Oct 26, 2014

extrajudicial, I think you have the wrong forum. You need to improve your contributions massively, otherwise I fear that your time in these lands will be brief.

silent_guy · Oct 26, 2014

Here's one result I came up with Googling around for the register reuse cache: A compile-time managed multi-level register file hierarchy, by Nvidia's Bill Dally. It's only a citation, unfortunately, but it describes exactly the kind of configuration that the Maxas guy uncovered, and it was published in 2012, which should be around the time Maxwell was in its architecture definition stage.

The paper claims a reduced register file energy usage by up to 54%.

A1xLLcqAgt0qc2RyMz0y · Oct 26, 2014

silent_guy said:
Here's one result I came up with Googling around for the register reuse cache: A compile-time managed multi-level register file hierarchy, by Nvidia's Bill Dally. It's only a citation, unfortunately, but it describes exactly the kind of configuration that the Maxas guy uncovered, and it was published in 2012, which should be around the time Maxwell was in its architecture definition stage.

The paper claims a reduced register file energy usage by up to 54%.

The link you provided just times out.

Here is a link to Nvidia's paper on the subject:

https://research.nvidia.com/publication/compile-time-managed-multi-level-register-file-hierarchy

extrajudicial · Oct 26, 2014

ninelven said:
For no credible reason (you have yet to provide one anyway).

Indeed, it is not magic; it is quality engineering.

You have provided zero actual evidence that this is the case.

You have also provided zero evidence that Maxwell is less efficient for compute workloads.

I meant to say I'm NOT discounting its efficiency, why can't I edit my posts?

And I did provide evidence which you ignored. Please explain why compute loads make maxwell only marginally (>10%) more efficient during compute. If it's not power gating as anandtech said then what? You criticize my explanation and provide none of your own besides saying "it's in the architecture"

I'm appalled to see people putting their own personal favor and letting it trump evidence and common sense. If maxwell is so power efficient why can't the nexus 9 manage 10 hours battery life? That's hardly impressive.

ninelven · Oct 26, 2014

extrajudicial said:
Please explain why compute loads make maxwell only marginally (>10%) more efficient during compute

Again, you have not yet provided any evidence that this is the case. Furmark power consumption is not evidence. If you don't understand why it is not evidence, I am not sure myself or anyone else here can help you. Indeed, the only actual compute benchmarks in this thread, that I am aware of, show Maxwell to be considerably more efficient than Kepler in compute workloads, sometimes over 100%+.

extrajudicial said:
I'm appalled to see people putting their own personal favor and letting it trump evidence and common sense.

The only person here I see ignoring evidence is you.

extrajudicial said:
If maxwell is so power efficient why can't the nexus 9 manage 10 hours battery life?

The Tegra K1 in the Nexus 9 is based on Kepler, not Maxwell.

silent_guy · Oct 26, 2014

extrajudicial said:
If it's not power gating as anandtech said then what? You criticize my explanation and provide none of your own besides saying "it's in the architecture"

And again you ignore my post completely.

If maxwell is so power efficient why can't the nexus 9 manage 10 hours battery life? That's hardly impressive.

Because Nexus 9 uses Kepler, not Maxwell?

RecessionCone · Oct 26, 2014

extrajudicial said:
If maxwell is so power efficient why can't the nexus 9 manage 10 hours battery life? That's hardly impressive.

The battery life tests that show ~10 hours aren't using the GPU anyway. Those tests are measuring the efficiency of the screen and the ability of the SoC to shut itself off when nothing is happening.

silent_guy · Oct 26, 2014

A1xLLcqAgt0qc2RyMz0y said:
The link you provided just times out.

Here is a link to Nvidia's paper on the subject:

https://research.nvidia.com/publication/compile-time-managed-multi-level-register-file-hierarchy

Thanks! Very interesting paper.

sebbbi · Oct 26, 2014

ninelven said:
The Tegra K1 in the Nexus 9 is based on Kepler, not Maxwell.

Yes. K1 = Kepler, 1 SMX. There is no Maxwell based mobile SOC yet.

dogen · Oct 27, 2014

extrajudicial said:
Please explain why compute loads make maxwell only marginally (>10%) more efficient during compute. If it's not power gating as anandtech said then what?

Do you know what efficiency means?

How is it only "10% more efficient" if it's doing almost twice as much work?

ams · Oct 30, 2014

There are now many Geekbench 3 AArch32 results for Nexus 9:

http://browser.primatelabs.com/geekbench3/search?q=htc+nexus+9

The single-core score is as high as ~ 2000 while the dual-core score is as high as ~ 3300-3500! That is incredibly good considering these are 32-bit results.

swaaye · Oct 30, 2014

How do you guys think this chip fares against Baytrail on the CPU front?

Laurent06 · Oct 30, 2014

swaaye said:
How do you guys think this chip fares against Baytrail on the CPU front?

It's more than twice faster than BT on Geekbench: http://browser.primatelabs.com/geekbench3/compare/1142302?baseline=620722

BT is slower than many chips such as Cortex-A15 and A17.

Blazkowicz · Oct 30, 2014

I wonder if that thing is benchmarking the hardware encryption accelerator?

At that point in time I think I'm tired of micro-benchmarks, even if they "improved".

Year 2000 : dhrystones, raw MIPS and the like are dead, let's switch to application benchmarks instead!
Year 2014 : here's a multi-platform collection of ~40 synthetic benchmarks with autogenerated web pages.

It leaves me wondering if the bench is about what fits in L1, or L2.

Laurent06 · Oct 31, 2014

Blazkowicz said:
I wonder if that thing is benchmarking the hardware encryption accelerator?

There are indeed too many benchmarks that are broken due to dedicated instructions. But having studied Geekbench code, it's not that bad as long as you don't forget it's a smallish benchmark (though not really a micro-benchmark such as dhrystone or coremark).

At that point in time I think I'm tired of micro-benchmarks, even if they "improved".

Year 2000 : dhrystones, raw MIPS and the like are dead, let's switch to application benchmarks instead!
Year 2014 : here's a multi-platform collection of ~40 synthetic benchmarks with autogenerated web pages.

It leaves me wondering if the bench is about what fits in L1, or L2.

What is the alternative? SPEC has been mostly broken by compilers and autopar, and anyway it can't be run on most smaller devices (it requires 2GB). Javascript benchmarks are somewhat interesting but results vary a lot depending on the browser. So what non micro-benchmark do you propose that is available on many platforms?

Blazkowicz · Oct 31, 2014

Indeed I was venting a bit about the benchmarks, at least it's nice to have all these individual results though it's hard to know which ones are the more useful/balanced ones.

Running something under desktop linux (debian, Ubuntu etc.) or maybe some mobile linux like Mer ought to be a solution, at least for comparing Tegra K1 and Atom. Mullins, too.
Sure, it shifts the problem, you won't be able to run abritrary OS on that many devices, or it may come later while the device is "Android first".

I'll be curious about Android L, will it make easy to run a linux container on arbitrary (but rooted) phone/tablet hardware, with some simple linux distro in it? (even text mode allows to crunch numbers and run various stuff fine). That might be possible sometimes but I wonder what performance and features are there on arbitrary or vanilla Android L (going off-topic here. that may be something desirable to do on Denver and tablet x86, let's say)

dogen · Oct 31, 2014

ams said:
There are now many Geekbench 3 AArch32 results for Nexus 9:

http://browser.primatelabs.com/geekbench3/search?q=htc+nexus+9

The single-core score is as high as ~ 2000 while the dual-core score is as high as ~ 3300-3500! That is incredibly good considering these are 32-bit results.

Can't wait to see how it will handle dolphin.

ams · Oct 31, 2014

Google Nexus 9 is able to match the performance of Shield tablet in GFXBench 3.0 Manhattan Off-screen test, and is only 1 fps behind iPad Air 2 in this test: https://gfxbench.com/device.jsp?benchmark=gfx30&os=Android&api=gl&D=Google Nexus 9

Ailuros · Nov 1, 2014

While using a newer driver which sets TRex scores by quite a bit lower, while keeping Manhattan scores untouched. There's a reason why I keep saying that long time performance should include both TRex & Manhattan tests.

By the way 28 posts ago: http://forum.beyond3d.com/showpost.php?p=1883304&postcount=3092

Nebuchadnezzar · Nov 1, 2014

Seems Denver isn't exactly power efficient. Also 26W EDP on the N9.

NVIDIA Tegra Architecture

AlexV

Heteroscedasticitate

silent_guy

A1xLLcqAgt0qc2RyMz0y

extrajudicial

ninelven

PM

silent_guy

RecessionCone

silent_guy

sebbbi

dogen

ams

swaaye

Entirely Suboptimal

Laurent06

Blazkowicz

Laurent06

Blazkowicz

dogen

ams

Ailuros

Epsilon plus three

Nebuchadnezzar

Similar threads