Monday, March 2, 2026

NVIDIA and AWS Broaden Full-Stack Partnership

At AWS re:Invent, NVIDIA and Amazon Net Companies expanded their strategic collaboration with new know-how integrations throughout interconnect know-how, cloud infrastructure, open fashions and bodily AI.

As a part of this enlargement, AWS will assist NVIDIA NVLink Fusion — a platform for {custom} AI infrastructure — for deploying its custom-designed silicon, together with next-generation Trainium4 chips for inference and agentic AI mannequin coaching, Graviton CPUs for a broad vary of workloads and the Nitro System virtualization infrastructure.

Utilizing NVIDIA NVLink Fusion, AWS will mix NVIDIA NVLink scale-up interconnect and the NVIDIA MGX rack structure with AWS {custom} silicon to extend efficiency and speed up time to marketplace for its next-generation cloud-scale AI capabilities.

AWS is designing Trainium4 to combine with NVLink and NVIDIA MGX, the primary of a multigenerational collaboration between NVIDIA and AWS for NVLink Fusion.

AWS has already deployed MGX racks at scale with NVIDIA GPUs. Integrating NVLink Fusion will permit AWS to additional simplify deployment and techniques administration throughout its platforms.

AWS also can harness the NVLink Fusion provider ecosystem, which supplies all of the elements required for full rack-scale deployment, from the rack and chassis, to power-delivery and cooling techniques.

By supporting AWS’s Elastic Cloth Adapter and Nitro System, the NVIDIA Vera Rubin structure on AWS will give prospects strong networking decisions whereas sustaining full compatibility with AWS’s cloud infrastructure and accelerating new AI service rollout.

“GPU compute demand is skyrocketing — extra compute makes smarter AI, smarter AI drives broader use and broader use creates demand for much more compute. The virtuous cycle of AI has arrived,” stated Jensen Huang, founder and CEO of NVIDIA. “With NVIDIA NVLink Fusion coming to AWS Trainium4, we’re unifying our scale-up structure with AWS’s {custom} silicon to construct a brand new era of accelerated platforms. Collectively, NVIDIA and AWS are creating the compute material for the AI industrial revolution — bringing superior AI to each firm, in each nation, and accelerating the world’s path to intelligence.”

“AWS and NVIDIA have labored facet by facet for greater than 15 years, and at present marks a brand new milestone in that journey,” stated Matt Garman, CEO of AWS. “With NVIDIA, we’re advancing our large-scale AI infrastructure to ship prospects the best efficiency, effectivity and scalability. The upcoming assist of NVIDIA NVLink Fusion in AWS Trainium4, Graviton and the Nitro System will carry new capabilities to prospects to allow them to innovate sooner than ever earlier than.”

Convergence of Scale and Sovereignty

AWS has expanded its accelerated computing portfolio with the NVIDIA Blackwell structure, together with NVIDIA HGX B300 and NVIDIA GB300 NVL72 GPUs, giving prospects rapid entry to the {industry}’s most superior GPUs for coaching and inference. Availability of NVIDIA RTX PRO 6000 Blackwell Server Version GPUs, designed for visible functions, on AWS is anticipated within the coming weeks.

These GPUs kind a part of the AWS infrastructure spine powering AWS AI Factories, a brand new AI cloud providing that can present prospects all over the world with the devoted infrastructure they should harness superior AI companies and capabilities in their very own information facilities, operated by AWS, whereas additionally letting prospects keep management of their information and adjust to native laws.

NVIDIA and AWS are committing to deploy sovereign AI clouds globally and convey the perfect of AI innovation to the world. With the launch of AWS AI Factories, the businesses are offering safe, sovereign AI infrastructure to ship unprecedented computing capabilities for organizations all over the world whereas assembly more and more rigorous sovereign AI necessities.

For public sector organizations, AWS AI Factories will rework the federal supercomputing and AI panorama. AWS AI Factories prospects will be capable to seamlessly combine AWS’s industry-leading cloud infrastructure and companies — identified for its reliability, safety and scalability — with NVIDIA Blackwell GPUs and the full-stack NVIDIA accelerated computing platform, together with NVIDIA Spectrum-X Ethernet switches.

The unified structure will guarantee prospects can entry superior AI companies and capabilities, in addition to prepare and deploy huge fashions, whereas sustaining absolute management of proprietary information and full compliance with native regulatory frameworks.

NVIDIA Nemotron Integration With Amazon Bedrock Expands Software program Optimizations 

Past {hardware}, the partnership expands integration of NVIDIA’s software program stack with the AWS AI ecosystem. NVIDIA Nemotron open fashions at the moment are built-in with Amazon Bedrock, enabling prospects to construct generative AI functions and brokers at manufacturing scale. Builders can entry Nemotron Nano 2 and Nemotron Nano 2 VL to construct specialised agentic AI functions that course of textual content, code, photos and video with excessive effectivity and accuracy.

The mixing makes high-performance, open NVIDIA fashions immediately accessible by way of Amazon Bedrock’s serverless platform the place prospects can depend on confirmed scalability and nil infrastructure administration. Business leaders CrowdStrike and BridgeWise are the primary to make use of the service to deploy specialised AI brokers.

NVIDIA Software program on AWS Simplifies Developer Expertise

NVIDIA and AWS are additionally co-engineering on the software program layer to speed up the information spine of each enterprise. Amazon OpenSearch Service now gives serverless GPU acceleration for vector index constructing, powered by NVIDIA cuVS, an open-source library for GPU-accelerated vector search and information clustering. This milestone represents a elementary shift to utilizing GPUs for unstructured information processing, with early adopters seeing as much as 10x sooner vector indexing at 1 / 4 of the associated fee.

These dramatic positive factors cut back search latency, speed up writes and unlock sooner productiveness for dynamic AI strategies like retrieval-augmented era by delivering the correct quantity of GPU energy exactly when it’s wanted. AWS is the primary main cloud supplier to supply serverless vector indexing with NVIDIA GPUs.

Manufacturing-ready AI brokers require efficiency visibility, optimization and scalable infrastructure. By combining Strands Brokers for agent improvement and orchestration, the NVIDIA NeMo Agent Toolkit for deep profiling and efficiency tuning, and Amazon Bedrock AgentCore for safe, scalable agent infrastructure, organizations can empower builders with a whole, predictable path from prototype to manufacturing.

This expanded assist builds on AWS’s present integrations with NVIDIA applied sciences — together with NVIDIA NIM microservices and frameworks like NVIDIA Riva and NVIDIA BioNeMo, in addition to mannequin improvement instruments built-in with Amazon SageMaker and Amazon Bedrock — that allow organizations to deploy agentic AI, speech AI and scientific functions sooner than ever.

Accelerating Bodily AI With AWS

Creating bodily AI calls for high-quality and various datasets for coaching robotic fashions, in addition to frameworks for testing and validation in simulation earlier than real-world deployment.

NVIDIA Cosmos world basis fashions (WFMs) at the moment are accessible as NVIDIA NIM microservices on Amazon EKS, enabling real-time robotics management and simulation workloads with seamless reliability and cloud-native effectivity. For batch-based duties and offline workloads equivalent to large-scale artificial information era, Cosmos WFMs are additionally accessible on AWS Batch as containers.

Cosmos-generated world states can then be used to coach and validate robots utilizing open-source simulation and studying frameworks equivalent to NVIDIA Isaac Sim and Isaac Lab.

Main robotics corporations equivalent to Agility Robotics, Agile Robots, ANYbotics, Diligent Robotics, Dyna Robotics, Subject AI, Haply Robotics, Lightwheel, RIVR and Skild AI are utilizing the NVIDIA Isaac platform with AWS to be used circumstances starting from amassing, storing and processing robot-generated information to coaching and simulation for scaling robotics improvement.

Sustained Collaboration

Underscoring years of continued collaboration, NVIDIA earned the AWS World GenAI Infrastructure and Information Companion of the Yr award, which acknowledges high know-how companions with the Generative AI Competency that assist vector embeddings, information storage and administration or artificial information era in a number of varieties and codecs.

Study extra about NVIDIA and AWS’s collaboration and be a part of periods at AWS re:Invent, operating via Friday, Dec. 5, in Las Vegas.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles