Background
Website is composed by group of HTML syntax from div, span, table, image, list, etc. Those HTML element can be actually deconstructed and get only the text/information using a technique called parsing.
Objectives
To understand how to process html file and parse the content to get insightful information.
Deliverables
Article