Too Long; Didn't Read
<a href="https://hackernoon.com/tagged/big-data" target="_blank">Big data</a> not only refers to machine learning, artificial intelligence, statistics and <a href="https://hackernoon.com/tagged/programming" target="_blank">programming</a>. It also refers to the computing infrastructure and software frameworks that make things happen. Regardless of the positive and negative aspects of any technology, Apache Hadoop is the state of the art in big data analytics. One of its several features consists in relying on off-the-shelf computers, connected in a networked environment and equipped with very minimal hardware. Unreliability, crashes, reboots and failures are not really a problem for Hadoop, provided a number of nodes is up and running and a good amount of redundancy by replication has been guaranteed by the administrator.