Nvidia Shows Signs in [2022]

PSman1700 · Oct 5, 2022

DavidGraham said:
RTX 3060 now holds the number one spot in the Steam Survey (desktop + laptop variant).

Steam Hardware & Software Survey

Thats quite impressive, the 3060 is not a slouch at all. In special considering that thing sports 12GB framebuffer.

digitalwanderer · Oct 5, 2022

DavidGraham said:
RTX 3060 now holds the number one spot in the Steam Survey (desktop + laptop variant).

Steam Hardware & Software Survey

Wot? It's not one of their uber high end models?!?

PSman1700 · Oct 5, 2022

digitalwanderer said:
Wot? It's not one of their uber high end models?!?

Some are under the impression there only exists 3090Ti and up (4080 12G and up).

Kaotik · Oct 14, 2022

Since there's no really appropriate thread anymore (just 4090 review thread) I suppose this is the closest match

NVIDIA "unlaunches" RTX 4080 12 GB due the name

Unlaunching The 12GB 4080

16GB GeForce RTX 4080 on track to delight gamers everywhere November 16th.

www.nvidia.com

JoeJ · Oct 14, 2022

Kaotik said:
NVIDIA "unlaunches" RTX 4080 12 GB due the name

If it was due to the name, they would rename it, not cancel it?
So would could be the real reason?

Edit: I found it. The guy running DSOG called them out on phone and said he thinks the name is confusing, so they cancelled it because they are so nice. At least that's what he thinks.

Well, fine. So we have one option less.
And there is enough room for appropriate 3070 pricing :O

Seriously, maybe 3070 gets 16GB, but earlier was planned to have less. So 3080/12 is redundant.
Kudos to Intel! ; )

Kaotik · Oct 14, 2022

JoeJ said:
If it was due to the name, they would rename it, not cancel it?
So would could be the real reason?

It takes time to rename something when you've made tons of cards with old names. If it was just stickers it would be quick, but they need to reflash every card too.
At least I got the impression it will be relaunched under different model number later (likely 4070 or 4070 Ti)

digitalwanderer · Oct 15, 2022

Notice how the 12GB 4080 is the only card that nVidia isn't making a founders edition of? So this is only hitting AIBs hard on the chin, nVidia will only lose what it chooses to in "subsidizing the lost boxes" for the AIBs.

Total scum move.

DavidGraham · Oct 19, 2022

Oracle buys tens of thousands of A100s and H100s, in a deal that is worth hundreds of millions.

Oracle Buys Tens of Thousands of Nvidia A100, H100 GPUs

All of Nvidia's AI platforms are now available in Oracle Cloud.

www.tomshardware.com

dorf · Oct 20, 2022

Nvidia's phrasing is "tens of thousands NVIDIA GPUs including A100 and upcoming H100 accelerators". So unless Tom's hardware has some further information on this, it's not necessarily tens of thousands of A100 and H100 as some of the hardware could be something lower end.

NVIDIA, Oracle CEOs in Fireside Chat Light Pathways to Enterprise AI

In a fireside chat, Safra Catz and Jensen Huang discussed their expanding collaboration to speed adoption of enterprise AI.

blogs.nvidia.com

If it truly is tens of thousands of those highest end accelerators it would be a huge amount. For context:

Kaotik · Oct 20, 2022

dorf said:
Nvidia's phrasing is "tens of thousands NVIDIA GPUs including A100 and upcoming H100 accelerators". So unless Tom's hardware has some further information on this, it's not necessarily tens of thousands of A100 and H100 as some of the hardware could be something lower end.

NVIDIA, Oracle CEOs in Fireside Chat Light Pathways to Enterprise AI

In a fireside chat, Safra Catz and Jensen Huang discussed their expanding collaboration to speed adoption of enterprise AI.

blogs.nvidia.com

If it truly is tens of thousands of those highest end accelerators it would be a huge amount. For context:

Tom's article is just plain wrong. Both NVIDIA and Oracle say "tens of thousands GPUs including A100 and H100", not "tens of thousands of A100 and H100 GPUs"

https://www.oracle.com/news/announcement/ocw-oracle-and-nvidia-partner-to-speed-ai-adoption-2022-10-18/

Deleted member 2197 · Oct 20, 2022

https://www.hpcwire.com/2022/10/18/oracle-providing-a-ground-to-fuel-nvidias-subscription-revenue/

The partnership, which builds on earlier deployments, sets up Nvidia with the kind of infrastructure it requires to expand on a long-term goal to become a software powerhouse. It also gives Oracle’s cloud service the plug-and-play hardware capacity and software framework to easily deploy AI software.
...
The companies are “looking at the full stack so not just the GPUs and infrastructure but getting into the software layer, getting into the service layer,” Leung said.

Nvidia is known as a graphics chip company, but is betting its future on generating more revenue from software and services. The company is looking at a Netflix style subscription business model and charging customers when its software and hardware are used to create products.
...
The AI Enterprise offerings from Nvidia have so far been limited to a handful of virtual machine interfaces on Google Cloud, Microsoft Azure and Amazon Web Services, which have their own AI software offerings that are largely based on open-source tools. But Nvidia has found a full-stack partner in Oracle, which is willing to take on the graphics chip maker’s proprietary software stack for its cloud service.
...
Oracle customers can currently get clusters of 512 GPUs, and is adding tens of thousands of GPU capacity, Leung said. The GPUs and AI Enterprise software stack will sit on top of the core Oracle Cloud infrastructure, which includes bare metal compute, storage and networking hardware.
...
Nvidia’s aiming to provide software services as a subscription model, and the chipmaker declined to comment on whether it’ll get a cut from GPU instances on the Oracle Cloud.

DavidGraham · Oct 20, 2022

Kaotik said:
Tom's article is just plain wrong. Both NVIDIA and Oracle say "tens of thousands GPUs including A100 and H100", not "tens of thousands of A100 and H100 GPUs"

They are not that wrong, the only AI data center GPU missing from that statement is the A30, so the correct statement should be A30, A100 and H100. What a massive difference!

The other data center GPUs, the A40, A10, A16 and A2 are based on the GA102 and GA107 dies, and they are for visual computing and video processing not AI.

Kaotik · Oct 20, 2022

DavidGraham said:
They are not that wrong, the only AI data center GPU missing from that statement is the A30, so the correct statement should be A30, A100 and H100. What a massive difference!

The other data center GPUs, the A40, A10, A16 and A2 are based on the GA102 and GA107 dies, and they are for visual computing and video processing not AI.

It's not just for AI, it's "Accelerated computing and AI". Cloud services offer usually wide variety of different hardware configurations for their clients to serve different needs efficiently (even in this day of GPUs supporting several clients at once), why would Oracle be any different?

DavidGraham · Oct 20, 2022

Kaotik said:
It's not just for AI, it's "Accelerated computing and AI". Cloud services offer usually wide variety of different hardware configurations for their clients to serve different needs efficiently (even in this day of GPUs supporting several clients at once), why would Oracle be any different?

Their main press release focused on AI, they integrated the whole NVIDIA AI software stack, AI hardware platforms, the whole sha·bang, I don't see any mention of visual computing and video processing in this release, and even if it was, these products are low volume anyway. The bulk of GPUs is most definitely going to be the A30, A100 and H100, you know the products directly aimed at serving AI. They even stated they are using A100 for data processing.

With the full NVIDIA AI platforms available on OCI instances, the extended partnership is designed to accelerate AI-powered innovation for a broad range of industries to better serve customers

Data processing is one of the top cloud computing workloads. To support this demand, OCI Data Science plans to offer support for OCI bare metal shapes, including BM.GPU.GM4.8 with NVIDIA A100 Tensor Core GPUs across managed notebook sessions, jobs, and model deployment.

Deleted member 2197 · Oct 20, 2022

Oracle Takes The Whole Nvidia AI Stack For Its Cloud

The top hyperscalers and clouds are rich enough to build out infrastructure on a global scale and create just about any kind of platform they feel like.

www.nextplatform.com

Oracle is adding tens of thousands of Nvidia “Ampere” A100 and “Hopper” H100 GPU accelerators to its infrastructure and is also licensing the complete Nvidia AI Enterprise stack so its database and application customers – and there are a lot of them as you see above – can seamlessly access AI training and inference if they move their applications to OCI. At the moment OCI GPU clusters still top out at 512 GPUs, according to Leo Leung, who is vice president of products and strategy for OCI and who has very long experience in cloud stuff with Oracle, Oxygen Cloud, Scality, and EMC.

To us, it looks like Oracle is adding capacity, meaning more GPU clusters, not scaling out its GPU clusters to have thousands or tens of thousands of GPUs in a single instance for running absurdly large workloads with hundreds of billions of parameters. Leung says that the typical OCI customers are still only wrangling tens of billions of parameters, so the scale OCI is offering is probably sufficient.

Leung was mum about when – or if – the NeMo LLM large language model service that Nvidia just announced at the fall GTC 2022 conference last month might be integrated into OCI, but we reckon that Oracle would rather not have a service running on AWS or Google Cloud or Microsoft Azure (where presumably these cloud LLMs run) linked to services running on OCI. And that means Oracle will eventually have to have enough GPU cluster scale to run the NeMo Megatron 530B model internally. That right there is 10,000 or more GPUs. So Oracle saying it is adding “tens of thousands” more GPUs is, well, a good start.

DavidGraham · Oct 21, 2022

Facebook (Meta) to use H100 GPUs for it's Meta AI platforms.

The Iron That Will Drive AI At Meta Platforms

If there is one thing that is consistently true about HPC clusters for the past thirty years and for AI training systems for the past decade, it is as

www.nextplatform.com

JoeJ · Oct 27, 2022

Facebook is such a graveyard not even worth my jokes. What a waste of silicon.

I'm much more excited about the new 3060 models. That's the proper attitude. \

/

DegustatoR · Nov 16, 2022

Nvidia and Microsoft Join Forces to Build AI Supercomputer

The venture aims to make the future tech available for businesses.

www.cnet.com

tunafish · Nov 17, 2022

NVIDIA Announces Financial Results for Third Quarter Fiscal 2023

NVIDIA today reported revenue for the third quarter ended October 30, 2022, of $5.93 billion, down 17% from a year ago and down 12% from the previous quarter.

nvidianews.nvidia.com

Gaming revenue half of same quarter last year, inventory up 100% from last year to $4.5B. But they are saved from posting an outright terrible result by growing data center revenue 31% YoY, to $3.83B, or now 2.4 times as much as their entire gaming revenue. Total revenue is down 17% YoY, income down 72% YoY.

Their balance sheet is worrying, though, and appears to prove MLID right on at least something. They have both $4.5B of inventory, and a total of about ~$4B of pre-paid assets (... where they hid the non-current part under "other non-current assets". Not illegal, but given they have not done that before, also not a good look. Rather drawing the eye where they don't want you to look.) Given how inventory is on balance sheet on basis of cost, not value, they are probably sitting on more than a year's sales worth of inventory + pre-paid fab allocations. And that's at normal sales numbers. If you assume that past quarter was representative of gaming sales for the near future, they will probably still be selling GA102-based cards next summer.

dobwal · Nov 17, 2022

Good. Hopefully I can pick up a 3090 at some insanely cheap price sometime in the future.

Nvidia Shows Signs in [2022]

PSman1700

digitalwanderer

PSman1700

Kaotik

Drunk Member

Unlaunching The 12GB 4080

JoeJ

Kaotik

Drunk Member

digitalwanderer

DavidGraham

Oracle Buys Tens of Thousands of Nvidia A100, H100 GPUs

dorf

NVIDIA, Oracle CEOs in Fireside Chat Light Pathways to Enterprise AI

Kaotik

Drunk Member

NVIDIA, Oracle CEOs in Fireside Chat Light Pathways to Enterprise AI

Deleted member 2197

Guest

DavidGraham

Kaotik

Drunk Member

DavidGraham

Deleted member 2197

Guest

Oracle Takes The Whole Nvidia AI Stack For Its Cloud

DavidGraham

The Iron That Will Drive AI At Meta Platforms

JoeJ

DegustatoR

Nvidia and Microsoft Join Forces to Build AI Supercomputer

tunafish

NVIDIA Announces Financial Results for Third Quarter Fiscal 2023

dobwal

Similar threads