Home Artificial Intelligence Nvidia flexes generative AI muscle at SIGGRAPH with new GPUs, development software

by Jon Gold

Senior Writer

Nvidia flexes generative AI muscle at SIGGRAPH with new GPUs, development software

News Analysis

Aug 09, 20233 mins

Artificial IntelligenceComputer ComponentsGenerative AI

GPU titan Nvidia doubled down on generative AI at SIGGRAPH this week, unveiling new chips, server designs, and software to fill out its ecosystem of artificial intelligence hardware and systems design products.

Credit: Nvidia

Looking to solidify its position as the dominant global supplier of chips that support generative AI workoads, Nvidia announced new GPUs and servers as well as a range of new software offerings at the SIGGRAPH conference in Los Angeles this week.

On the hardware side, Nvidia announced a new line of servers, the OVX series. The server line is designed to use up to eight of the company’s L40S GPUs. The GPUs are based on the company’s Ada Lovelace architecture, which succeeded Ampere as the microarchitecture in use in its main line graphics cards. Each L40S packs 48GB of memory and is designed with complex AI workloads in mind, boasting 1.45 petaflops of tensor processing power.

It’s similar to the approach Nvidia has taken in the past with its consumer graphics card designs, in that the company plans to offer OVX server reference designs, and other manufacturers (in this case, Dell, ASUS, Gigabyte, HPE, Lenovo, QCT and Supermicro) will serve as global system builders. The L40S will become available in the fall, and the company said that OVX systems will go on sale soon after.

As part of an upgrade to its AI Enterprise software line, Nvidia also released a new product called AI Workbench, which is designed to be a sort of self-assembly kit for AI developers. The system comes with pretrained models and an array of tools that can be used to customize them, with the idea of saving considerable development time. Nvidia also announced numerous features designed to add generative AI capabilities to its other product lines,including an AI developer “co-pilot” for its Omniverse 3D imaging software.

How Nvidia targets different sets of users

Many of the company’s newest AI-related releases are targeted at different users — including cloud service providers, developers, and server makers. That’s a key part of Nvidia’s strategy, according to Shane Rau, research vice president at IDC.

“If the end customer’s a cloud service provider, they may just want, say, a server GPU board,” he said. “Some customers would like to buy the Nvidia silicon but also buy the whole system around it — LVX, OVX, and so on. Then maybe the next level is you buy the hardware but maybe you also need some training.”

Another important strategic point, according to Rau, is Nvidia’s flexibility. That flexibility started as long ago as 2012, when the company released its first server GPUs, with the CUDA developer environment that allowed them to be reprogrammed and optimized for different tasks, and has continued with the various AI-related pieces of software that Nvidia has released. The only place, in fact, where the company tends to stop offering solutions is when it would encroach directly on an end user’s own domain.

“AI can be very end-user specific,” Rau said. “Usually the end user brings in their own expertise — agriculture, financial analysis, and so on. So Nvidia wanst to bring the level of solution that you’re wiling to invest in all the way up to your specific domain, but you provide the specific expertise.”

It’s been a highly successful strategy for the company in the AI market, Rau added, given that Nvidia is the largest provider of silicon for AI use by some distance.

“I’d say this was always in the cards for them,” he said.

(Editor’s note: This story has been corrected to clarify that Nvidia will be offering server reference designs, not selling its own branded servers.)

by Jon Gold

Senior Writer

Jon Gold covers IoT and wireless networking for Network World. He can be reached at jon_gold@ifoundrycodg.com.

Americas

Topics

About

Policies

Our Network

More

Nvidia flexes generative AI muscle at SIGGRAPH with new GPUs, development software

GPU titan Nvidia doubled down on generative AI at SIGGRAPH this week, unveiling new chips, server designs, and software to fill out its ecosystem of artificial intelligence hardware and systems design products.

How Nvidia targets different sets of users

More from this author

Nvidia to buy AI orchestration software provider Run:ai

TSMC gets $6.6 billion in CHIPS funding for third Arizona fab

AI drives spending on cloud infrastructure, IDC reports

Two-in-one SIM offers network redundancy for IoT devices

Most popular authors

Show me more

F5 looks to squelch 'ball of fire' that is application security

Arista targets lateral security threat in enterprise networks

Breaches galore - why a proven platform for Zero Trust is needed

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

Has the hype around ‘Internet of Things’ paid off?

Are unused IPv4 addresses a secret gold mine?

Preparing for a 6G wireless world: Exciting changes coming to the wireless industry

Nvidia flexes generative AI muscle at SIGGRAPH with new GPUs, development software

GPU titan Nvidia doubled down on generative AI at SIGGRAPH this week, unveiling new chips, server designs, and software to fill out its ecosystem of artificial intelligence hardware and systems design products.

How Nvidia targets different sets of users

Related content

Nscale offers AMD AI chips-as-a-service in green data center

Palo Alto extends SASE security, performance features

The logic of && and || on Linux

Cisco-backed startup Corelight raises $150M to expand network security services

Newsletter Promo Module Test

More from this author

Nvidia to buy AI orchestration software provider Run:ai

TSMC gets $6.6 billion in CHIPS funding for third Arizona fab

AI drives spending on cloud infrastructure, IDC reports

Two-in-one SIM offers network redundancy for IoT devices

Most popular authors

Show me more

F5 looks to squelch 'ball of fire' that is application security

Arista targets lateral security threat in enterprise networks

Breaches galore - why a proven platform for Zero Trust is needed

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

Has the hype around ‘Internet of Things’ paid off?

Are unused IPv4 addresses a secret gold mine?

Preparing for a 6G wireless world: Exciting changes coming to the wireless industry