AWS Blog
Featured posts
Breaking Through Bureaucracy: A Leader’s Guide to Establishing Your First Autonomous Team
Traditional organizational operating models built for stability are hindering enterprise innovation and agility in rapidly changing markets, but implementing autonomous teams—starting with small, customer-facing initiatives led by empowered single-threaded leaders—can help organizations balance necessary control with the speed and adaptability required to remain competitive.
Introducing Amazon Nova Sonic: Human-like voice conversations for generative AI applications
Amazon Nova Sonic is a new foundation model on Amazon Bedrock that streamlines speech-enabled applications by offering unified speech recognition and generation capabilities, enabling natural conversations with contextual understanding while eliminating the need for multiple fragmented models.
Amazon Bedrock Guardrails enhances generative AI application safety with new capabilities
Amazon Bedrock Guardrails introduces enhanced capabilities to help enterprises implement responsible AI at scale, including multimodal toxicity detection, PII protection, IAM policy enforcement, selective policy application, and policy analysis features that customers like Grab, Remitly, and KONE are leveraging to standardize safeguards across generative AI applications.
Category
Filter
Newest posts
Total results: 10000
-
Adam Richter, Bowen Wang, 04/22/2025As we continue our five-part series on optimizing costs for generative AI workloads on AWS, our third blog shifts our focus to Amazon Bedrock. In our previous posts, we explored general Cloud Financial Management principles on generative AI adoption and strategies for custom model development using Amazon EC2 and Amazon SageMaker AI. Today, we’ll guide you through cost optimization techniques for Amazon Bedrock, AWS’s fully managed service that provides access to leading foundation models. We’ll explore making informed decisions about pricing options, model selection, knowledge base optimization, prompt caching, and automated reasoning. Whether you’re just starting with foundation models or looking to optimize your existing Amazon Bedrock implementation, these techniques will help you balance capability and cost while leveraging the convenience of managed AI models.
-
Samuel Selvan, Jagadish Kumar, 04/22/2025Today, we’re launching a new visual interface for OpenSearch Ingestion that makes it simple to create and manage your data pipelines from the AWS Management Console. With this new feature, you can build pipelines in minutes without writing complex configurations manually. In this post, we walk through how these new features work and how you can use them to accelerate your data ingestion projects.
-
Atticus Wong, Tuhin Mukherjee, 04/22/2025Businesses use Amazon Elastic Block Store (Amazon EBS) snapshots to capture point-in-time copies of application data volumes that can serve as baseline standards when creating new volumes. With snapshot copy, users are enabled to launch application workloads in different AWS Regions or meet data protection and disaster recovery requirements. Security and regulatory compliance remain top [...]
-
CJ Sturgess, Adam Sandman, Jessica Moore, Bhavye Sharma, 04/22/2025In today’s fast-paced digital landscape, the integration of AI in DevOps is changing how organizations balance innovation with compliance requirements. Generative AI tools have enhanced developer productivity, with estimates suggesting up to 45% improvement in efficiency, while creating new challenges for quality control and regulatory compliance. Through solutions like SpiraTeam from Inflectra, organizations can now leverage AI to modernize their quality and compliance processes while maintaining rigorous safety standards, particularly crucial for regulated industries like healthcare, finance, and utilities.
-
ramadit, 04/22/2025This post was jointly authored by Alex Kestner (Sr. Product Manager, Amazon EKS), Ratnopam Chakrabarti (Sr. SA, Containers & OSS), Shivam Dubey (Specialist SA, Containers), and Suket Sharma (Sr. SDE, Amazon EKS). Introduction Amazon Elastic Kubernetes Service (Amazon EKS) now offers node monitoring and auto repair capabilities. This new feature enables automatic detection and remediation [...]
-
Vivek Gangasani, Banu Nagasundaram, Dmitry Soldatkin, Felipe Lopez, Siddharth Venkatesan, 04/22/2025Today, we’re excited to announce the launch of Amazon SageMaker Large Model Inference (LMI) container v15, powered by vLLM 0.8.4 with support for the vLLM V1 engine. This release introduces significant performance improvements, expanded model compatibility with multimodality (that is, the ability to understand and analyze text-to-text, images-to-text, and text-to-images data), and provides built-in integration with vLLM to help you seamlessly deploy and serve large language models (LLMs) with the highest performance at scale.
-
Rui Cardoso, Ricardo Aldao, Amit Gupta, Julia Hu, Neil Desai, 04/22/2025In the first post of this series, we introduced a comprehensive evaluation framework for Amazon Q Business, a fully managed Retrieval Augmented Generation (RAG) solution that uses your company’s proprietary data without the complexity of managing large language models (LLMs). The first post focused on selecting appropriate use cases, preparing data, and implementing metrics to [...]
-
Shreyas Subramanian, Haibo Ding, Balasubramaniam Srinivasan, Yun Zhou, 04/22/2025Today, we’re happy to announce the general availability of Amazon Bedrock Intelligent Prompt Routing. In this blog post, we detail various highlights from our internal testing, how you can get started, and point out some caveats and best practices. We encourage you to incorporate Amazon Bedrock Intelligent Prompt Routing into your new and existing generative AI applications.
-
Aparajithan Vaidyanathan, Saibal Samaddar, Tanushree Halder, Lokesh Joshi, Maheshwaran G, 04/22/2025In this post, we explore how Infosys developed Infosys Event AI to unlock the insights generated from events and conferences. Through its suite of features—including real-time transcription, intelligent summaries, and an interactive chat assistant—Infosys Event AI makes event knowledge accessible and provides an immersive engagement solution for the attendees, during and after the event.
-
Dan Gehred, Annie Chung, 04/22/2025Community is at the heart of live shopping destinations like the wildly popular Whatnot. The online marketplace surpassed $3 billion in gross merchandise volume in 2024 and is on track to top that achievement in 2025. People from around the world converge on the platform daily to forge connections and purchase items that they love, [...]