Data is a commodity, but without ways to process it, its value is questionable. Data science is a multidisciplinary field whose goal is to extract value from data in all it's forms. This ebook explores the field of data science through data and its structure as well as the high-level process that you can use to transform data into value. Data science is a process. That's not to say it's mechanical and void of creativity. But, when you dig into the stages of processing data, from munging data sources and data cleansing to machine learning and eventually visualisation, you see that unique steps are involved in transforming raw data into insight. The steps that you use can also vary. In exploratory data analysis, you might have a cleansed data set that's ready to import into R, and you visualise your result but don't deploy the model in a production environment. In another environment, you might be dealing with real-world data and require a process of data merging and cleansing in addition to data scaling and preparation before you can train your machine learning model.