Python code for the estimation of missing prices in real-estate market with a dataset of house prices from Teruel city

Main Author: García-Magariño, Iván
Other Authors: Medrano, Carlos, Delgado, Jorge
Format: Dataset
Terbitan: Mendeley , 2017
Subjects:
Online Access: https:/data.mendeley.com/datasets/mxpgf54czz
ctrlnum 0.17632-mxpgf54czz.2
fullrecord <?xml version="1.0"?> <dc><creator>Garc&#xED;a-Magari&#xF1;o, Iv&#xE1;n</creator><title>Python code for the estimation of missing prices in real-estate market with a dataset of house prices from Teruel city</title><publisher>Mendeley</publisher><description>This research data file contains the necessary software and the dataset for estimating the missing prices of house units. This approach combines several machine learning techniques (linear regression, support vector regression, the k-nearest neighbors and a multi-layer perceptron neural network) with several dimensionality reduction techniques (non-negative factorization, recursive feature elimination and feature selection with a variance threshold). It includes the input dataset formed with the available house prices in two neighborhoods of Teruel city (Spain) in November 13, 2017 from Idealista website. These two neighborhoods are the center of the city and &#x201C;Ensanche&#x201D;. This dataset supports the research of the authors in the improvement of the setup of agent-based simulations about real-estate market. The work about this dataset has been submitted for consideration for publication to a scientific journal. The open source python code is composed of all the files with the &#x201C;.py&#x201D; extension. The main program can be executed from the &#x201C;main.py&#x201D; file. The &#x201C;boxplotErrors.eps&#x201D; is a chart generated from the execution of the code, and compares the results of the different combinations of machine learning techniques and dimensionality reduction methods. The dataset is in the &#x201C;data&#x201D; folder. The input raw data of the house prices are in the &#x201C;dataRaw.csv&#x201D; file. These were shuffled into the &#x201C;dataShuffled.csv&#x201D; file. We used cross-validation to obtain the estimations of house prices. The outputted estimations alongside the real values are stored in different files of the &#x201C;data&#x201D; folder, in which each filename is composed by the machine learning technique abbreviation and the dimensionality reduction method abbreviation. </description><subject>Software</subject><subject>Machine Learning</subject><subject>Dimensionality Reduction</subject><subject>Software Agent</subject><subject>Big Data</subject><subject>Agent-Based Modeling</subject><subject>Multi-Agent Systems</subject><subject>Housing Market</subject><contributor>Medrano, Carlos</contributor><contributor>Delgado, Jorge </contributor><type>Other:Dataset</type><identifier>10.17632/mxpgf54czz.2</identifier><rights>Creative Commons Attribution 4.0 International</rights><rights>http://creativecommons.org/licenses/by/4.0</rights><relation>https:/data.mendeley.com/datasets/mxpgf54czz</relation><date>2017-12-12T12:32:34Z</date><recordID>0.17632-mxpgf54czz.2</recordID></dc>
format Other:Dataset
Other
author García-Magariño, Iván
author2 Medrano, Carlos
Delgado, Jorge
title Python code for the estimation of missing prices in real-estate market with a dataset of house prices from Teruel city
publisher Mendeley
publishDate 2017
topic Software
Machine Learning
Dimensionality Reduction
Software Agent
Big Data
Agent-Based Modeling
Multi-Agent Systems
Housing Market
url https:/data.mendeley.com/datasets/mxpgf54czz
contents This research data file contains the necessary software and the dataset for estimating the missing prices of house units. This approach combines several machine learning techniques (linear regression, support vector regression, the k-nearest neighbors and a multi-layer perceptron neural network) with several dimensionality reduction techniques (non-negative factorization, recursive feature elimination and feature selection with a variance threshold). It includes the input dataset formed with the available house prices in two neighborhoods of Teruel city (Spain) in November 13, 2017 from Idealista website. These two neighborhoods are the center of the city and “Ensanche”. This dataset supports the research of the authors in the improvement of the setup of agent-based simulations about real-estate market. The work about this dataset has been submitted for consideration for publication to a scientific journal. The open source python code is composed of all the files with the “.py” extension. The main program can be executed from the “main.py” file. The “boxplotErrors.eps” is a chart generated from the execution of the code, and compares the results of the different combinations of machine learning techniques and dimensionality reduction methods. The dataset is in the “data” folder. The input raw data of the house prices are in the “dataRaw.csv” file. These were shuffled into the “dataShuffled.csv” file. We used cross-validation to obtain the estimations of house prices. The outputted estimations alongside the real values are stored in different files of the “data” folder, in which each filename is composed by the machine learning technique abbreviation and the dimensionality reduction method abbreviation.
id IOS7969.0.17632-mxpgf54czz.2
institution Universitas Islam Indragiri
affiliation onesearch.perpusnas.go.id
institution_id 804
institution_type library:university
library
library Teknologi Pangan UNISI
library_id 2816
collection Artikel mulono
repository_id 7969
city INDRAGIRI HILIR
province RIAU
shared_to_ipusnas_str 1
repoId IOS7969
first_indexed 2020-04-08T08:32:15Z
last_indexed 2020-04-08T08:32:15Z
recordtype dc
_version_ 1686587767157424128
score 17.538404