Google Cloud has announced the launch of new AI-optimized virtual machines, an updated Google Distributed Cloud offering, and an enterprise-grade edition of Google Kubernetes Engine. The new Cloud TPU v5e is said to be the most cost-efficient, versatile and scalable cloud TPU ever devised, providing integration with GKE, Vertex AI and various leading AI frameworks. It is designed for medium and large-scale AI training and inference applications, delivering up to two times faster training performance per dollar and up to 2.5 times the inference performance per dollar for LLMs and generative AI models.
