Building High-Performance Data Lake Using Apache Hudi and Alluxio by@bin-fan
368 reads

Building High-Performance Data Lake Using Apache Hudi and Alluxio

tldt arrow
Read on Terminal Reader
Read this story w/o Javascript

Too Long; Didn't Read

T3Go is China’s first platform for smart travel based on the Internet of Vehicles. Trevor Zhang and Vino Yang describe the evolution of their data lake architecture, built on cloud-native or open-source technologies, including Alibaba OSS, Apache Hudi, and Alluxio. Their data lake stores petabytes of data, supporting hundreds of pipelines and tens of thousands of tasks daily. The architecture allows us to store the data as-is without having to first structure the data and run different types of analytics to guide better decisions.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Building High-Performance Data Lake Using Apache Hudi and Alluxio
Bin Fan HackerNoon profile picture

@bin-fan

Bin Fan


Receive Stories from @bin-fan

react to story with heart

RELATED STORIES

L O A D I N G
. . . comments & more!
Hackernoon hq - po box 2206, edwards, colorado 81632, usa