paint-brush
Containerization of Spark Python Using Kubernetesby@Raghavendra_Singh
1,321 reads
1,321 reads

Containerization of Spark Python Using Kubernetes

by Raghavendra Pratap Singh7mAugust 3rd, 2020
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Spark is a general-purpose distributed data processing engine designed for fast computation. The main feature of Spark is its in-memory cluster computing that increases the processing speed of an application. Kubernetes is a container orchestration engine which ensures there is always a high availability of resources. Spark uses the kube-api server as a cluster manager and handles execution. There is no default high availability mode available, so we need additional components like Zookeeper installed and configured. Spark-submit can use spark-submit directly to submit a Spark application to a Spark cluster.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Containerization of Spark Python Using Kubernetes
Raghavendra Pratap Singh HackerNoon profile picture
Raghavendra Pratap Singh

Raghavendra Pratap Singh

@Raghavendra_Singh

Raghavendra works for Sigmoid.

Learn More
LEARN MORE ABOUT @RAGHAVENDRA_SINGH'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Raghavendra Pratap Singh HackerNoon profile picture
Raghavendra Pratap Singh@Raghavendra_Singh
Raghavendra works for Sigmoid.

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite