Accelerate Spark and Hive Jobs on AWS S3 by 10x with Alluxio as a Tiered Storage Solutionby@bin-fan

Accelerate Spark and Hive Jobs on AWS S3 by 10x with Alluxio as a Tiered Storage Solution

tldt arrow
Read on Terminal Reader
Read this story w/o Javascript

Too Long; Didn't Read

Bazaarvoice leverages Alluxio as a caching tier on top of AWS S3 to maximize performance and minimize operating costs on running Big Data analytics on AWS EC2. The company is a software-as-a-service provider that allows retailers and brands to curate, manage, and understand user-generated content such as reviews for their products. The big data platform completely relies on the open source Hadoop ecosystem, utilizing tools such as Apache Hive, Spark for ETLs Kafka, ElasticSearch and HBase for durable datastore.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Accelerate Spark and Hive Jobs on AWS S3 by 10x with Alluxio as a Tiered Storage Solution
Bin Fan HackerNoon profile picture

@bin-fan

Bin Fan

VP of Open Source and Founding Member @Alluxio


Receive Stories from @bin-fan

react to story with heart

RELATED STORIES

L O A D I N G
. . . comments & more!