Inference
-
Virtualization
AWS Trainium2-powered Amazon EC2 Trn2 Instancess and Trn2 UltraServers | Amazon Web Services
AWS Trainium2-powered Amazon EC2 Trn2 Instances, are the most powerful EC2 instances for deep learning and generative AI and provide up to 30-40% better price performance than the current generation of GPU-based EC2 P5e and P5en instances. For the most demanding, state-of-the-art models that need more compute and memory than a single instance can deliver, Amazon EC2 Trn2 UltraServers are…
Read More » -
Virtualization
Extreme Performance Series 2024: Automated Testing with Virtualized GPUs for ML/AI Workloads
Mark Achtemichuk talks with Venu Tiwari about testing automation put into place at VMware for ML and AI workloads with virtualized GPUs and some the key insights gained from these tests. Links to additional resources: Boost Throughput With Scaling VMs While Keeping the GPUs to a Minimum – VMware vSphere 8 Performance is in the Goldilocks Zone for AI/ML Training…
Read More »