paint-brush
Building High-Performance Data Lake Using Apache Hudi and Alluxio by@bin-fan
489 reads
489 reads

Building High-Performance Data Lake Using Apache Hudi and Alluxio

by Bin Fan6mAugust 27th, 2021
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

T3Go is China’s first platform for smart travel based on the Internet of Vehicles. Trevor Zhang and Vino Yang describe the evolution of their data lake architecture, built on cloud-native or open-source technologies, including Alibaba OSS, Apache Hudi, and Alluxio. Their data lake stores petabytes of data, supporting hundreds of pipelines and tens of thousands of tasks daily. The architecture allows us to store the data as-is without having to first structure the data and run different types of analytics to guide better decisions.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Building High-Performance Data Lake Using Apache Hudi and Alluxio
Bin Fan HackerNoon profile picture
Bin Fan

Bin Fan

@bin-fan

VP of Open Source and Founding Member @Alluxio

About @bin-fan
LEARN MORE ABOUT @BIN-FAN'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Bin Fan HackerNoon profile picture
Bin Fan@bin-fan
VP of Open Source and Founding Member @Alluxio

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite