paint-brush
Probabilistic Data Structures And Algorithms In Big Databy@physboy
956 reads
956 reads

Probabilistic Data Structures And Algorithms In Big Data

by Nikita Vasilev5mNovember 27th, 2021
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Probabilistic data structures are a great fit for modern Big Data applications. They use hash functions to randomize items and keep the size constant. The Bloom filter is an implementation of a probability set, invented by Burton Bloom in 1970. The most prominent examples of operations may include identifying some unique or frequent items. The higher the number of hash functions is, the more accurate determination you get. Bloom filters have this powerful combo of simplicity and multi-purpose nature. In layman terms, they support operations similar to the hash tables but use less space. Cassandra, Cassandra, SSTache, and others use these structures to storage massive amounts of information.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Probabilistic Data Structures And Algorithms In Big Data
Nikita Vasilev HackerNoon profile picture
Nikita Vasilev

Nikita Vasilev

@physboy

Data engineer, python tutor

About @physboy
LEARN MORE ABOUT @PHYSBOY'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Nikita Vasilev HackerNoon profile picture
Nikita Vasilev@physboy
Data engineer, python tutor

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite