Background
apache spark is one of the popular unfied query engine which can read several data files such as csv, parquet, avro, etc and it compatible with lakehouse architecture. But can it run with only 2 gigs of VM ?
Objectives
to understand how to spin up a low specs virtual machine and install it with apache spark (including the JVM)
Deliverables
article & illustration