Alluxio Enterprise Edition - Caching for data analytics
Alluxio Enterprise Edition - Caching for data analytics
Product Overview
Alluxio is Data Orchestration for the cloud and enables compute frameworks to leverage data from anywhere, an S3 data lake or remote Hadoop environments. It enables speeding up of frameworks like Apache Spark, Presto, Hive & Tensorflow by caching data and also enables hybrid cloud environments when data is remote. Alluxio moves data closer to compute from where it is stored across zones, regions or countries, creating better data locality and accessibility. Data orchestration is to data like container orchestration is to containers. This Alluxio AMI is best when used with AWS EMR for caching metadata and data to improve performance of Spark, Presto and Hive services within AWS EMR. It can also be used to create a standalone cluster of Alluxio. Learn more about Alluxio Data Orchestration here: https://www.alluxio.io/data-orchestration/. Find tutorials for Alluxio on AWS here: https://www.alluxio.io/products/aws/
Version
Video
Categories
Operating System
Linux/Unix, Amazon Linux 2018_03
Delivery Methods