AWS offers more flexible access to Nvidia GPUs for short-duration AI workloads

Running AI and ML workloads got a whole lot cheaper, for some

When you purchase through links on our site, we may earn an affiliate commission.Here’s how it works.

AWS, an already popularcloud computing servicefor developers looking to access the best-performing hardware for AI workloads, has announced a more flexible scheme for shorter-term requirements.

AmazonElastic Compute Cloud (EC2) Capacity Blocks for ML is what Amazon is calling an industry-first, and will allow customers to access GPUs on a consumption-based model.

The Seattle-based cloud giant hopes that more affordable options will provide smaller organizations with greater opportunities, helping to make for a more diverse landscape.

AWS launches short-term consumption-based GPU renting

AWS launches short-term consumption-based GPU renting

In apress release, the company said: “With EC2 Capacity Blocks, customers can reserve hundreds ofNvidiaGPUs colocated in Amazon EC2 UltraClusters designed for high-performance ML workloads.”

Customers can get access to the latest Nvidia H100 Tensor Core GPUs, which are suited to training foundation models and large language models, by specifying cluster size and duration, meaning they only pay for what they need.

Amazon noted that demand for GPUs is fast outpacing supply as more businesses get to grips with generative AI, and many will either find themselves paying for an excessive service or having GPUs sitting dormant when they’re not in use – or worse still, both.

AWS users can reserve EC2 UltraClusters of P5 instances for between 1-14 days, and up to eight weeks in advance. They can pick flexible cluster size options, ranging from 1-64 instances, or a maximum of 512 GPUs.

Are you a pro? Subscribe to our newsletter

Are you a pro? Subscribe to our newsletter

Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!

AWS Compute and Networking VP David Brown commented: “With Amazon EC2 Capacity Blocks, we are adding a new way for enterprises and startups to predictably acquire Nvidia GPU capacity to build, train, and deploy their generative AI applications – without making long-term capital commitments. It’s one of the latest ways AWS is innovating to broaden access to generative AI capabilities.”

Pricing for the service can be found on theAWS website, where prospective users can also sign up to use the short-term, affordable option.

More from TechRadar Pro

With several years’ experience freelancing in tech and automotive circles, Craig’s specific interests lie in technology that is designed to better our lives, including AI and ML, productivity aids, and smart fitness. He is also passionate about cars and the decarbonisation of personal transportation. As an avid bargain-hunter, you can be sure that any deal Craig finds is top value!

Cisco issues patch to fix serious flaw allowing possible industrial systems takeover

7 myths about email security everyone should stop believing

Another reason to avoid edge-lit 4K TVs: they may fail faster than others, according to this report