Background
                            
                            
                                apache spark is one of the popular unfied query engine which can read several data files such as csv, parquet, avro, etc and it compatible with lakehouse architecture. But can it run with only 2 gigs of VM ?
                            
                            
                                Objectives
                            
                            
                                to understand how to spin up a low specs virtual machine and install it with apache spark (including the JVM)
                            
                            
                               Deliverables
                            
                            
                                article & illustration