paint-brush
Lambda Architecture Batch Layer: Visualizing All Time Taxi Data [Part 3]by@srivassid
188 reads

Lambda Architecture Batch Layer: Visualizing All Time Taxi Data [Part 3]

by Siddharth5mDecember 20th, 2020
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

In this part i would be talking about the batch layer of the Lambda Architecture. Batch layer is computed by applying a function to the whole historical dataset, to answer some high level questions which cannot be answered by either speed layer or serving layer. The computations typically take hours or days to run, and the results are stored usually in a distributed file system (although this is not a requirement). For example, the queries that might need to be answered would range from the beginning of the dataset to now, in our case, till date how many cabs have served how many passengers, or what is the total distance driven by all the cabs. In this article i would try to answer questions like these based on the dataset that i have. The code for the article can be found here.

Company Mentioned

Mention Thumbnail
featured image - Lambda Architecture Batch Layer: Visualizing All Time Taxi Data [Part 3]
Siddharth HackerNoon profile picture
Siddharth

Siddharth

@srivassid

Data Engineer

About @srivassid
LEARN MORE ABOUT @SRIVASSID'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Siddharth HackerNoon profile picture
Siddharth@srivassid
Data Engineer

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite
Nixnet
Digimarket
Nius