Amazon Web Services (AWS) has launched a new Amazon EC2 P5 instance powered by NVIDIA H100 Tensor Core GPUs, allowing users to scale generative AI, high performance computing (HPC) and other applications with a click from a browser. The NVIDIA H100 GPU offers supercomputing-class performance with fourth-generation Tensor Cores, a new Transformer Engine for accelerating LLMs, and the latest NVLink technology. P5 instances are ideal for training and running inference for increasingly complex LLMs and computer vision models, and can be deployed in EC2 UltraClusters for petabit-scale non-blocking networks. NVIDIA AI Enterprise helps users make the most of P5 instances with a full-stack suite of tools.
