AWS HPC Blog
Tag: ML
Automotive component design at Nifco using generative AI and diffusion models
Combining generative AI with AWS services, Nifco USA is exploring new frontiers in structural design. See how they’re using diffusion models, SageMaker, and Batch to create game-changing lightweight auto parts.
Use Terraform to deploy a complete AWS Batch environment on Amazon EKS
Harness the power of AWS Batch on Amazon EKS with this new Terraform blueprint. It provides a complete template to create robust batch processing in the cloud. An easy button you shouldn’t miss.
LLMs: the new frontier in generative agent-based simulation
How can LLMs take agent-based simulation to the next level? Check out our new post on leveraging large language models’ capabilities for more realistic modeling.
Harnessing the power of agent-based modeling for equity market simulation and strategy testing
Financial professionals: Simulate realistic market conditions with Simudyne’s agent-based modeling on AWS and Red Hat OpenShift. Learn how HKEX leverages these insights.
AWS Batch enables near-real-time energy production forecasts using NVIDIA Earth-2
Using AWS Batch and NVIDIA Earth-2, we built a scalable workflow that explores millions of scenarios at a fraction of the cost of traditional methods. This innovative approach not only provides rapid energy calculations, but also shows the potential of AI-driven meteorology.
Whisper audio transcription powered by AWS Batch and AWS Inferentia
Transcribe audio files at scale for really low cost using Whisper and AWS Batch with Inferentia. Check out this post to deploy a cost-effective solution in minutes!
Guided multi-objective generative AI for drug design
Transforming computer-aided drug design: see how SandboxAQ leverages AWS and generative AI to explore chemical space, more intelligently generating drug candidates.
Implementing e-mail and SMS notifications in AWS ParallelCluster with Slurm
Learn how to configure email and SMS alerts for job events to stay on top of your HPC workloads with AWS ParallelCluster using Slurm.
Gang scheduling pods on Amazon EKS using AWS Batch multi-node processing jobs
AWS Batch multi-node parallel jobs can now run on Amazon EKS to provide gang scheduling of pods across nodes for large scale distributed computing like ML model training. More details here.
Large scale training with NVIDIA NeMo Megatron on AWS ParallelCluster using P5 instances
Launching distributed GPT training? See how AWS ParallelCluster sets up a fast shared filesystem, SSH keys, host files, and more between nodes. Our guide has the details for creating a Slurm-managed cluster to train NeMo Megatron at scale.