Background
Regular pandas dataframe has limited scalability as it can only utilize single server resources. With spark cluster, the computation will be distributed on several servers.
Objectives
To execute remote spark command on spark cluster
Deliverables
Article & Illustration