// data exploration & preparation · application
Python is a very powerful tool for Data Preparation. Libraries such as Numpy and Pandas can take care of various tasks such as Consolidation of datasets, Missing value and Outlier Treatment. To demonstrate the working of Python for performing all such tasks, various hypothetical datasets have been used.
The next big topic under Data Preparation is of Feature Engineering. Here, one of the most powerful and famous library of python, sklearn is put to use to perform the complex tasks of Feature Extraction. Python is also used for Feature Selection, Transformation and Scaling. For performing some of the tasks under Feature Engineering, the Boston dataset has been used.