home/data exploration & preparation/application/python

// data exploration & preparation · application

Data Exploration & Preparation in Python

Python is a very powerful tool for Data Preparation. Libraries such as Numpy and Pandas can take care of various tasks such as Consolidation of datasets, Missing value and Outlier Treatment. To demonstrate the working of Python for performing all such tasks, various hypothetical datasets have been used.

Python logo

The next big topic under Data Preparation is of Feature Engineering. Here, one of the most powerful and famous library of python, sklearn is put to use to perform the complex tasks of Feature Extraction. Python is also used for Feature Selection, Transformation and Scaling. For performing some of the tasks under Feature Engineering, the Boston dataset has been used.

Miscellaneous Methods

NumPyPandasOutlier & Missing Value Treatment

Feature Engineering

sklearnFeature ExtractionFeature Selection
ESC
100 pages indexed · Esc to close