paint-brush
Hadoop Data Storage Explainedby@artemg
828 reads
828 reads

Hadoop Data Storage Explained

by Artem GoginJune 9th, 2021
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Apache Hadoop is a collection of open-source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation. To take care of these issues, we need to move towards splitting calculations into a few machines. The objective is to store data, utilizing as many resources of each machine as possible at the same time. We can simply divide all input records by 4 (number of our nodes) so each machine would accept ¼ of input files.
featured image - Hadoop Data Storage Explained
Artem Gogin HackerNoon profile picture
Artem Gogin

Artem Gogin

@artemg

L O A D I N G
. . . comments & more!

About Author

Artem Gogin HackerNoon profile picture
Artem Gogin@artemg

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite