Machine Learning | AWS HPC Blog

A guide to identity management in Research and Engineering Studio on AWS

Check out this new post to learn about identity options for Research and Engineering Studio on AWS. Understanding choices for SAML IdPs and Active Directory will help you plan secure VDI access.

LLMs: the new frontier in generative agent-based simulation

How can LLMs take agent-based simulation to the next level? Check out our new post on leveraging large language models’ capabilities for more realistic modeling.

Harnessing the power of agent-based modeling for equity market simulation and strategy testing

Financial professionals: Simulate realistic market conditions with Simudyne’s agent-based modeling on AWS and Red Hat OpenShift. Learn how HKEX leverages these insights.

Recent improvement to Open MPI AllReduce and the impact to application performance

Our team engineered some Open MPI optimizations for EFA to enhance performance of HPC codes running in the cloud. By improving MPI_AllReduce they improved scaling – matching commercial MPIs. Tests show gains for apps including Code Saturne and OpenFOAM on both Arm64 and x86 instances. Check out how these tweaks can speed up your HPC workloads in the cloud.

Near-real-time energy production forecasts with NVIDIA Earth-2 and AWS Batch

Using AWS Batch and NVIDIA Earth-2, we built a scalable workflow that explores millions of scenarios at a fraction of the cost of traditional methods. This innovative approach not only provides rapid energy calculations, but also shows the potential of AI-driven meteorology.

Whisper audio transcription powered by AWS Batch and AWS Inferentia

Transcribe audio files at scale for really low cost using Whisper and AWS Batch with Inferentia. Check out this post to deploy a cost-effective solution in minutes!

Guided multi-objective generative AI for drug design

Transforming computer-aided drug design: see how SandboxAQ leverages AWS and generative AI to explore chemical space, more intelligently generating drug candidates.

Deploying generative AI applications with NVIDIA NIMs on Amazon EKS

Learn how to deploy AI models at scale with @AWS using NVIDIA’s NIM and Amazon EKS! This step-by-step guide shows you how to create a GPU cluster for inference. Don’t miss part 1 of this 2-part blog series!

Implementing e-mail and SMS notifications in AWS ParallelCluster with Slurm

Learn how to configure email and SMS alerts for job events to stay on top of your HPC workloads with AWS ParallelCluster using Slurm.

Gang scheduling pods on Amazon EKS using AWS Batch multi-node processing jobs

AWS Batch multi-node parallel jobs can now run on Amazon EKS to provide gang scheduling of pods across nodes for large scale distributed computing like ML model training. More details here.

Tag: Machine Learning