paint-brush
Python & Data Engineering: Under the Hood of Join Operators by@nikagolubeva
1,617 reads
1,617 reads

Python & Data Engineering: Under the Hood of Join Operators

by Veronika Vasileva6mSeptember 1st, 2021
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

An estimated 2.5 quintillion bytes are generated each day. This makes it difficult to comb through essential data pieces, process them, and extract insights. In order to optimize your queries to big data, you need to develop a profound understanding of how these algorithms work under the hood. In this post, I discuss the algorithms of a nested loop, hash join, and merge join in Python. Nested loop joins support only four logical join operators, including: Inner join* Left outer join, Left semi join and Left anti semi join. Merge join is touted as the most effective of all operators.

Coin Mentioned

Mention Thumbnail
featured image - Python & Data Engineering: Under the Hood of Join Operators
Veronika Vasileva HackerNoon profile picture
Veronika Vasileva

Veronika Vasileva

@nikagolubeva

Data engineer, python teacher

Learn More
LEARN MORE ABOUT @NIKAGOLUBEVA'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Veronika Vasileva HackerNoon profile picture
Veronika Vasileva@nikagolubeva
Data engineer, python teacher

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite