The Hitchhiker's Guide to pySpark DataFrames

Written by mlwhiz | Published 2020/11/11
Tech Story Tags: big-data | machine-learning | data-science | apache-spark | pyspark | python3 | artificial-intelligence | hackernoon-top-story | web-monetization

TLDR The Hitchhiker's Guide to pySpark DataFrames. Spark is one of the most used tools when it comes to working with Big Data. Spark has now provided a DataFrame API for us Data Scientists to work with. I will be working with the Data Science for COVID-19 in South Korea, which is a detailed dataset on the most detailed datasets on the COVID dataset for the South Korean Open Data Science conference. This post is going to be one of my longest posts on medium, so go on and pick up a Coffee.com coffee.via the TL;DR App

no story

Written by mlwhiz | Data Scientist @Facebook. Data science communicator at mlwhiz and TDS. Connect on Twitter @mlwhiz
Published by HackerNoon on 2020/11/11