Python code for the estimation of missing prices in real-estate market with a dataset of house prices from Teruel city
Main Author: | García-Magariño, Iván |
---|---|
Other Authors: | Medrano, Carlos, Delgado, Jorge |
Format: | Dataset |
Terbitan: |
Mendeley
, 2017
|
Subjects: | |
Online Access: |
https:/data.mendeley.com/datasets/mxpgf54czz |
ctrlnum |
0.17632-mxpgf54czz.2 |
---|---|
fullrecord |
<?xml version="1.0"?>
<dc><creator>García-Magariño, Iván</creator><title>Python code for the estimation of missing prices in real-estate market with a dataset of house prices from Teruel city</title><publisher>Mendeley</publisher><description>This research data file contains the necessary software and the dataset for estimating the missing prices of house units. This approach combines several machine learning techniques (linear regression, support vector regression, the k-nearest neighbors and a multi-layer perceptron neural network) with several dimensionality reduction techniques (non-negative factorization, recursive feature elimination and feature selection with a variance threshold). It includes the input dataset formed with the available house prices in two neighborhoods of Teruel city (Spain) in November 13, 2017 from Idealista website. These two neighborhoods are the center of the city and “Ensanche”.
This dataset supports the research of the authors in the improvement of the setup of agent-based simulations about real-estate market. The work about this dataset has been submitted for consideration for publication to a scientific journal.
The open source python code is composed of all the files with the “.py” extension. The main program can be executed from the “main.py” file. The “boxplotErrors.eps” is a chart generated from the execution of the code, and compares the results of the different combinations of machine learning techniques and dimensionality reduction methods.
The dataset is in the “data” folder. The input raw data of the house prices are in the “dataRaw.csv” file. These were shuffled into the “dataShuffled.csv” file. We used cross-validation to obtain the estimations of house prices. The outputted estimations alongside the real values are stored in different files of the “data” folder, in which each filename is composed by the machine learning technique abbreviation and the dimensionality reduction method abbreviation.
</description><subject>Software</subject><subject>Machine Learning</subject><subject>Dimensionality Reduction</subject><subject>Software Agent</subject><subject>Big Data</subject><subject>Agent-Based Modeling</subject><subject>Multi-Agent Systems</subject><subject>Housing Market</subject><contributor>Medrano, Carlos</contributor><contributor>Delgado, Jorge </contributor><type>Other:Dataset</type><identifier>10.17632/mxpgf54czz.2</identifier><rights>Creative Commons Attribution 4.0 International</rights><rights>http://creativecommons.org/licenses/by/4.0</rights><relation>https:/data.mendeley.com/datasets/mxpgf54czz</relation><date>2017-12-12T12:32:34Z</date><recordID>0.17632-mxpgf54czz.2</recordID></dc>
|
format |
Other:Dataset Other |
author |
García-Magariño, Iván |
author2 |
Medrano, Carlos Delgado, Jorge |
title |
Python code for the estimation of missing prices in real-estate market with a dataset of house prices from Teruel city |
publisher |
Mendeley |
publishDate |
2017 |
topic |
Software Machine Learning Dimensionality Reduction Software Agent Big Data Agent-Based Modeling Multi-Agent Systems Housing Market |
url |
https:/data.mendeley.com/datasets/mxpgf54czz |
contents |
This research data file contains the necessary software and the dataset for estimating the missing prices of house units. This approach combines several machine learning techniques (linear regression, support vector regression, the k-nearest neighbors and a multi-layer perceptron neural network) with several dimensionality reduction techniques (non-negative factorization, recursive feature elimination and feature selection with a variance threshold). It includes the input dataset formed with the available house prices in two neighborhoods of Teruel city (Spain) in November 13, 2017 from Idealista website. These two neighborhoods are the center of the city and “Ensanche”.
This dataset supports the research of the authors in the improvement of the setup of agent-based simulations about real-estate market. The work about this dataset has been submitted for consideration for publication to a scientific journal.
The open source python code is composed of all the files with the “.py” extension. The main program can be executed from the “main.py” file. The “boxplotErrors.eps” is a chart generated from the execution of the code, and compares the results of the different combinations of machine learning techniques and dimensionality reduction methods.
The dataset is in the “data” folder. The input raw data of the house prices are in the “dataRaw.csv” file. These were shuffled into the “dataShuffled.csv” file. We used cross-validation to obtain the estimations of house prices. The outputted estimations alongside the real values are stored in different files of the “data” folder, in which each filename is composed by the machine learning technique abbreviation and the dimensionality reduction method abbreviation.
|
id |
IOS7969.0.17632-mxpgf54czz.2 |
institution |
Universitas Islam Indragiri |
affiliation |
onesearch.perpusnas.go.id |
institution_id |
804 |
institution_type |
library:university library |
library |
Teknologi Pangan UNISI |
library_id |
2816 |
collection |
Artikel mulono |
repository_id |
7969 |
city |
INDRAGIRI HILIR |
province |
RIAU |
shared_to_ipusnas_str |
1 |
repoId |
IOS7969 |
first_indexed |
2020-04-08T08:32:15Z |
last_indexed |
2020-04-08T08:32:15Z |
recordtype |
dc |
_version_ |
1686587767157424128 |
score |
17.538404 |