Performance Benchmark: Apache Spark on DataProc Vs. Google BigQueryby@Raghavendra_Singh
3,088 reads

Performance Benchmark: Apache Spark on DataProc Vs. Google BigQuery

tldt arrow
Read on Terminal Reader🖨️

Too Long; Didn't Read

Research undertaken to provide interactive business intelligence reports and visualisations for thousands of end users. We need to design a system that can analyse billions of data points in real time. The solution took into consideration following 3 main characteristics of the desired system of desired system: Analysing and classifying expected user queries and their frequency. Developing various pre-aggregations and projections to reduce data churn while serving various classes of user queries. Serving up to 60 concurrent queries to the platform users with a combination of aggregated datasets. Developing state of the art ‘Query Rewrite Algorithm’ to serve the user queries using a combination.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail

Coin Mentioned

Mention Thumbnail
featured image - Performance Benchmark: Apache Spark on DataProc Vs. Google BigQuery
Raghavendra Pratap Singh HackerNoon profile picture

@Raghavendra_Singh

Raghavendra Pratap Singh

Learn More
LEARN MORE ABOUT @RAGHAVENDRA_SINGH'S EXPERTISE AND PLACE ON THE INTERNET.
react to story with heart

RELATED STORIES

L O A D I N G
. . . comments & more!
Hackernoon hq - po box 2206, edwards, colorado 81632, usa