AWS Storage Blog

Category: Thought Leadership

How WarpStream enables cost-effective low-latency streaming with Amazon S3 Express One Zone

WarpStream, an AWS Partner, is a drop-in replacement for Apache Kafka. WarpStream’s cloud-native architecture makes it as easy to deploy and manage as a stateless web server like NGINX. WarpStream clusters can scale up to handle multiple GiB-per-second workloads as quickly as compute resources are assigned and then scale back down to zero after the […]

Optimizing enterprise MLOps in the cloud with Domino Data Lab and Amazon Elastic File System

Domino Data Lab is an AWS Partner Network (APN) partner that provides a central system of record for data science activity across an organization. The Domino solution delivers orchestration for all data science artifacts, including AWS infrastructure, data and services. As part of the solution, Domino’s platform leverages the scale, security, reliability, and cost-effectiveness of […]

AWS Backup 2021 blog image

Optimizing AWS Backup costs

The threat of ransomware has placed data protection front and center as a top priority for all businesses. The Sophos State of Ransomware Report 2023 reported that 66% of organizations were impacted in 2022 with a median ransomware payout of $400,000 (average pay out of $1.54 million). With the median recovery cost of using backups […]

Amazon S3 featured image 2023

Designing a resilient and cost-effective backup strategy for Amazon S3

Many organizations are protecting important business data against disasters like fires, floods or ransomware events. Proper backup and disaster recovery strategies can help safeguard critical data and ensure business continuity in a disaster scenario. Maintaining normal operations in a disaster recovery situation can save time and money. AWS services like Amazon S3 and AWS Backup […]

S3 cost optimization

Enhance savings for read-heavy workloads with Amazon S3 Bucket Keys

Organizations continue to grow their data lakes in the cloud as they build out new and innovative analytics, machine-learning, and generative AI workloads. At the same time, these workloads often access data that requires compliance with stringent data security and privacy standards. These compliance frameworks typically specify additional requirements for encryption at-rest, which leads customers […]

Amazon S3 Batch Operations featured image

Streamline data management at scale by automating the creation of Amazon S3 Batch Operations jobs

Over time, Enterprises may need to undertake operations or make modifications to their data as part of general data management, to address changing business needs, or to comply with evolving data-management regulations and best practices. As datasets being generated, stored, and analyzed continue to grow exponentially, the need for simplified, scalable, and reproduceable data management […]

Amazon S3 Express One Zone delivers cost and performance gains for ChaosSearch customers

ChaosSearch is an Amazon S3-native database built on a serverless, stateless compute architecture within AWS that delivers live search, SQL, and Generative AI analytics. At ChaosSearch, the speed and performance of our architecture is important to us and our customers because time to results is the difference between success and failure, and we rely on […]

Akridata accelerates processing of unstructured data with Amazon S3 Express One Zone

Deep learning processes often need to read full datasets, which are usually hundreds of gigabytes in size, before they can perform intelligent data processing. High data retrieval speed and low latency from storage are crucial for enterprises running these performance-critical workloads. Akridata, an AWS independent software vendor (ISV) partner, helps make artificial intelligence (AI)-assisted unstructured-data […]

lakeFS and Amazon S3 Express One Zone: Highly performant data version control for ML/AI

Machine learning presents a number of new challenges to data teams, calling for technology solutions that can support training and fine-tuning performance-critical workloads with high performance. Data version control is one of the facets of high-performing ML pipelines, as it allows efficient experimentation and full ML pipeline reproducibility at scale. lakeFS by Treeverse, an AWS […]

ClickHouse Cloud & Amazon S3 Express One Zone: Making a blazing fast analytical database even faster

ClickHouse is a columnar database management system (DBMS) designed for blazing-fast real-time analytics. It was built to address the needs of interactive analytical applications requiring up-to-the-second analytics. To do that, it must support real-time data ingestion at the rate of hundreds of millions of events per second and run complex analytical queries, such as filtering, […]