AWS Cloud Operations Blog
Know Before You Go – AWS re:Invent 2024 Monitoring and Observability
Planning to join us in Las Vegas from Dec 2 to Dec 6 at AWS re:Invent 2024 and looking to learn more about monitoring and observability? If you are, this blog highlights Cloud Operations sessions that focus on monitoring and observability at re:Invent 2024!
Monitoring and Observability allows you to understand the health of your applications and infrastructure, so you can diagnose issues and optimize your workloads and applications for performance, availability, and security. AWS offers an end-to-end monitoring and observability solution, giving you the ability to understand what is happening across all the layers of your environment, from end-user sessions and application transactions to the underlying infrastructure. This end-to-end solution starts with the collection of data including traces, metrics, and logs, to the processing and contextualizing, and understanding of your data. It also includes artificial intelligence and machine learning to allow you to proactively react, predict, and prevent potential problems. AWS Monitoring and Observability transforms data into actionable insights to help you detect, investigate, and remediate problems faster. This frees up time for your team to dedicate to other workloads and projects!
At re:Invent 2024, we’re excited to bring you a comprehensive lineup of monitoring and observability sessions that will equip you with the latest best practices and innovative capabilities to gain deeper, end-to-end visibility into your applications, modern workloads and environments. Through hands-on workshops and expert-led discussions, you’ll discover how to leverage AWS and open source observability tools and techniques to optimize performance, streamline operations, and unlock powerful insights that drive innovation and elevate customer experiences.
AWS re:Invent offers learning sessions in a variety of formats and levels so that you can expand your knowledge and grow your skills at a pace that is right for you. Levels are indicated by the session ID. Learn more about re:Invent session types.
Author’s Pick:
We’ve seen how open source tooling can help provide a multitude of benefits for companies such as cost savings, a collaborative community, and portability. With these benefits, open source tools have become more commonly used and exist in the observability space as well. With that usage, a common question that we’ve gotten from customers that we work with is “how do we use open source tooling for observability in AWS”? We’re highlighting this breakout session, which will cover exactly how that can be done in AWS and what different services can be used for the setup.
COP324 | Observability, the open source way – Breakout Session
Interested in using open source tooling for observability? Setting up an effective observability solution with open source tools can be challenging due to their rapid pace of development. AWS offers the flexibility to implement observability with AWS managed versions of open source tools like Prometheus, Grafana, and OpenTelemetry. This session shows how managed open source services enable a standardized approach to instrumentation, collection, and analysis. Discover recommended architecture patterns for each observability stage, including instrumentation with OpenTelemetry, ingestion with Amazon Managed Service for Prometheus and Amazon OpenSearch Service, and insights with Amazon Managed Grafana.
Additional monitoring and observability sessions:
COP315 | Accelerate Innovation with AI-Powered Operations – Breakout Session
Tired of spending countless hours firefighting operational issues? Do you wish you had more time to innovate? Discover how AIOps (Artificial Intelligence for IT Operations) can revolutionize your workflows. In this session, explore how AIOps empowers you to streamline operations, automate repetitive tasks, and gain invaluable insights from the chaos. Dive into AWS best practices with Amazon CloudWatch and Amazon DevOps Guru, and learn how to leverage these powerful tools to reclaim your time for driving innovation. Embrace the future of operations and unlock your team’s full potential.
COP320 | Best practices for end-to-end digital experience monitoring – Breakout Session
Gain full-stack visibility into application performance from the internet down to your services. In this breakout session, learn how AWS Digital Experience Monitoring combines network and internet monitoring with user experience tracking to provide an outside-in view across all touchpoints. Understand how monitoring real user interactions, synthetic traffic, internet service provider data, and backend infrastructure metrics reveals a complete picture of frontend performance, user behavior, API efficiency, and potential failure points. See how correlating these insights allows you to turn end-to-end digital experience into actionable KPIs like release velocity, adoption rates, conversions, and availability for enhancing customer experiences.
COP404 | Best practices for generative AI observability – Breakout Session
As generative AI adoption grows, comprehensive observability is crucial for ensuring reliability, transparency, and optimization. We’ll begin by delving into the unique observability challenges of different generative AI patterns, including large language models, retrieval-augmented generation (RAG) architectures, and other emerging approaches. Attendees will learn how to leverage Amazon CloudWatch with a wide range of metrics, logs, and distributed tracing to gain deep visibility into the end-to-end lifecycle of generative AI workloads. Additionally, we’ll discuss the role of Langchain, a powerful framework for building generative AI applications, and how it can be leveraged in conjunction with Amazon Bedrock and Amazon SageMaker to enhance observability across the entire development and deployment pipeline.
COP406 | Byte to insight: Maximize value from your logs with Amazon CloudWatch – Breakout Session
Businesses often struggle with too many logs and not enough actionable insights. In this session, we’ll explore the full log data lifecycle, from ingestion to insights, and uncover concrete strategies to extract maximum value using the latest Amazon CloudWatch Logs capabilities. Learn practical techniques to optimize log data for cost-effectiveness and business impact. This isn’t the CloudWatch of yesteryear – it’s a transformed, customer-centric observability platform that can revolutionize how you manage and leverage log data.
COP409 | Implementing application performance monitoring feat. PBS – Breakout Session
Application performance monitoring (APM) empowers organizations to proactively identify and resolve performance issues, and provide optimal user experiences. In this session, learn how Amazon CloudWatch provides complete visibility from end-user experiences to databases across traditional EC2 instances, containers, and serverless compute. Discover how to achieve strong correlation and flexibility to query the system state at any given time, enabling faster time to remediation. Explore how effective APM leverages all observability signals efficiently, enabling you to swiftly detect issues and maintain maximum application uptime.
COP345 | Building an effective observability strategy – Chalk Talk
Observability is crucial for understanding and optimizing modern cloud workloads, yet many organizations struggle to implement an effective strategy. In this session, learn how to evaluate your observability maturity and set up a comprehensive approach that measures the right signals. Gain insights into efficiently driving observability for diverse workloads, including containers and serverless applications. Through demos and practical examples, discover best practices for building a strategy that empowers you to unlock powerful insights from your observability data.
COP352 | Effortless observability for modern workloads – Chalk Talk
Gain deep visibility into your workloads without complex instrumentation. In this chalk talk, explore Amazon CloudWatch features that enable easy collection, aggregation, and visualization of metrics, logs, and traces. Discover how CloudWatch Container and Lambda Insights provide summarized insights into your containers and serverless environments using metrics and logs. Learn how Application Signals collects metrics and traces from your applications, displaying key metrics like call volume, availability, latency, faults, and errors in seconds with minimal setup from front-end to database.
COP411 | Monitoring event flows: observability for event-driven architectures – Chalk Talk
Event-driven architectures (EDAs) offer scalability and promotes loose coupling between components of a system, leading to greater agility. However, EDAs brings with it observability challenges. Gain visibility into complex event flows in an asynchronous architecture that involves messaging services. In this interactive chalk talk, explore tracing with OpenTelemetry to view the full event journey. Also, learn how to assemble metrics and logs for insights into sharding, performance, and root cause analysis along with best practices for popular EDA patterns.
COP305 | Using observability for effective incident response – Workshop
Effective incident management is crucial for business continuity. This hands-on workshop simulates an incident, and you discover how to collect, analyze, and correlate data from various sources to gain a holistic understanding of your system’s behavior. Explore techniques for setting up effective alerting and automated workflows to proactively identify and respond to incidents using Amazon CloudWatch and AWS Systems Manager. You must bring your laptop to participate.
Conclusion
In this blog, we highlighted some monitoring and observability sessions that we felt would be interesting for you to attend! We look forward to seeing you at these sessions. Visit the Observability kiosk at AWS Village in the Expo, if you have more questions or want to dive deeper on what you’ve heard at the sessions. To learn more about other monitoring and observability sessions, please visit the re:Invent sessions catalog.