AWS Cloud Operations Blog
Tag: Amazon Elastic Container Service
Gain operational insights for NVIDIA GPU workloads using Amazon CloudWatch Container Insights
As machine learning models grow more advanced, they require extensive computing power to train efficiently. Many organizations are turning to GPU-accelerated Kubernetes clusters for both model training and online inference. However, properly monitoring GPU usage is critical for machine learning engineers and cluster administrators to understand model performance and to optimize infrastructure utilization. Without visibility […]
Accelerate End-to-End Application Modernization with AWS App2Container and AWS Migration Hub Refactor Spaces
This blog post was written with contributions from Gaurav Parashar who is prior AWS Customers often have challenges accelerating the modernization of their applications. The complexity of refactoring a monolith application often provides hurdles in depth of expertise, time and effort. In this blog, we will explore two mechanisms that can help you accelerate your […]
Application configuration deployment to container workloads using AWS AppConfig
UPDATE (15 Dec 22): AWS AppConfig released an Agent for containers (EKS, ECS, Docker, Kubernetes) in December 2022, which makes calling AppConfig much simpler from containerized applications. We recommend using the AppConfig Agent for containers instead of the method below. Read the Agent documentation. AWS AppConfig is a capability of AWS Systems Manager that you […]
Introducing Amazon CloudWatch Container Insights for Amazon ECS
Amazon Elastic Container Service (Amazon ECS) lets you monitor resources using Amazon CloudWatch, a service that provides metrics for CPU and memory reservation and cluster and services utilization. In the past, you had to enable custom monitoring of services and tasks. Now, you can monitor, troubleshoot, and set alarms for all your Amazon ECS resources using […]