Introduction to Pyspark

I’ve spent a load of time learning about botht the internals of Spark as well as learning about Pyspark for analytics. I still need to collect my thoughts but this is more of a placeholder.