AWS Storage Blog
Tag: Amazon Simple Storage Service (Amazon S3)
Best practices for data lake protection with AWS Backup
Data lakes, powered by Amazon Simple Storage Service (Amazon S3), provide organizations with the availability, agility, and flexibility required for modern analytics approaches to gain deeper insights. Protecting sensitive or business-critical information stored in these S3 buckets is a high priority for organizations. AWS Backup for Amazon S3 makes it easier to centrally automate the […]
How Kurtosys quickly completed a massive cross-Region data migration using Amazon S3 Batch Replication
Kurtosys helps investment managers accurately deliver pain-free data and insights to their clients at scale. From client reporting and investor portals to global investment website solutions, their capabilities are focused on making marketing and client reporting teams’ lives easier. Their secure, cloud-based platform offers a set of easy-to-use tools designed to save clients time and […]
Reducing AWS Key Management Service costs by up to 99% with Amazon S3 Bucket Keys
Customers across many industries face increasingly stringent audit and compliance requirements on data security and privacy. Certain compliance frameworks, such as FISMA, FEDRAMP, PCI DSS, and SOC 2, have specific regulatory standards for validating the security of systems. A common requirement for these compliance frameworks is more rigorous encryption standards for data-at-rest, where organizations must […]
Extending Java applications to directly access files in Amazon S3 without recompiling
The Java programming language has been among the most used languages in software development for many years, and a vast number of Java applications exist today. Almost all applications interact with files in some way, yet most of these have been written to interact with a file system based on block storage and cannot directly […]
Enforcing encryption in transit with TLS1.2 or higher with Amazon S3
Update April 8, 2024: As of February 27th, 2024, all AWS service API endpoints (including for Amazon S3) now require a minimum of TLS version 1.2. Therefore, the S3 bucket and S3 Access Point policy examples in this post that enforce minimum of TLS version 1.2 are no longer necessary as this is the default […]
Accelerating GPT large language model training with AWS services
GPT, or Generative Pre-trained Transformer, is a language model that has shown remarkable progress in various vertical industries. This technology has been used to generate human-like text in fields such as finance, healthcare, legal, marketing, and many others. In finance, GPT is being used to analyze financial data, generate reports, and assist with decision-making. In […]
How Goldman Sachs leverages AWS PrivateLink for Amazon S3
As a multinational investment bank and financial services company, Goldman Sachs (GS) stores diverse datasets at scale that must always be accessible whilst remaining secure and compliant with regulations and requirements. As a part of its process, Goldman Sachs leverages Amazon Virtual Private Clouds (VPC) to provide secure environments for deployment of resources within AWS, […]
How Canva saves over $3 million annually in Amazon S3 costs
Canva is an online design tool that empowers users worldwide to design, edit, and publish anything they can dream up. Canva runs most of its production workloads on AWS, using several core services, including Amazon S3, Amazon ECS, Amazon RDS, and Amazon DynamoDB. Running on AWS has helped Canva move fast and keep up with […]
Migrating mixed file sizes with the snow-transfer-tool on AWS Snowball Edge devices
When moving your applications and business infrastructure to AWS, it is likely you will need to migrate your existing data as well. This data often comes from file share environments and contains a variety of file sizes. If the data contains more than a single digit percentage of files under 1 MB, your migration performance […]
Creating an ETL pipeline trigger for existing AWS DataSync tasks
Organizations look for ways to leverage the compute power of the cloud to analyze their data and produce reports to help drive business decisions. They want to load their data sets into extract-transform-load (ETL) pipelines for data processing. Once the data is processed, business decision makers at these organizations rely on accurate report generation to […]