Advanced Analytics and Deep Learning Models. Группа авторов
Чтение книги онлайн.

Читать онлайн книгу Advanced Analytics and Deep Learning Models - Группа авторов страница 15

Название: Advanced Analytics and Deep Learning Models

Автор: Группа авторов

Издательство: John Wiley & Sons Limited

Жанр: Программы

Серия:

isbn: 9781119792413

isbn:

СКАЧАТЬ multi-line retouches include an increase in the measurement of the parcel of land, number of bedrooms, year of construction, and more descriptive.

      2.3.1 Methodology

      2.3.2 Work Flow

Schematic illustration of flow of work.

      Figure 2.1 Flow of work.

      2.3.3 The Dataset

Column name Description
Area type The kind of area the flat/plot is in.
Availability If the land is currently available or not.
Location Location of the land/plot.
Size Number of bedrooms and hall kitchen in the flat.
Society Name of the cooperating society.
Total square feet Area of the plot in square feet.
Bath Number of bathroom in the flat.
Balcony Number of balcony in the flat.
Price Price of the plot/flat.

      2.3.4 Data Handling

       2.3.4.1 Missing Values and Data Cleaning

      In the size column, there are values with different attributes like 3 BHK and 3 BK, which means different; hence, to generalize, we will create a new column BHK. In this column, we would apply a function where we would tokenize each word; here, we keep the numbers and get rid of the other words. Therefore, we get a column BHK. In the total square feet column, there are entries where range is mention and not exact number; in this case, we replace it with the average of both the number.

Graph depicts the visualizing missing values using heatmap.

       2.3.4.2 Feature Engineering

       2.3.4.3 Removing Outliers

      Outliers are data points or errors, which represent extreme variations in our dataset. There are techniques to detect outlier; one of them is by visualization. We can graph box plot or scatter plot and, from the patterns, draw inference.

      In BHK, there are some flat whose average area of one room is larger, which appears unusual, whereas in some instances, the number of bathroom is larger than number of rooms in the house, hence affecting the result.

СКАЧАТЬ