Indonesian News Corpus

Main Author: RAHUTOMO, FAISAL
Other Authors: MIQDAD MUADZ MUZAD, AAD
Format: Dataset
Terbitan: Mendeley , 2018
Subjects:
Online Access: https:/data.mendeley.com/datasets/2zpbjs22k3
ctrlnum 0.17632-2zpbjs22k3.1
fullrecord <?xml version="1.0"?> <dc><creator>RAHUTOMO, FAISAL</creator><title>Indonesian News Corpus</title><publisher>Mendeley</publisher><description>This corpus contains 150,466 news articles, which is derived from several freely accessible Indonesian news website. The corpus is designated for research purpose only. The news websites are: &#x2022; kompas.com is a registered trademark of PT. Kompas Cyber Media. https://inside.kompas.com/about-us &#x2022; tempo.co is a registered trademark of PT INFO MEDIA DIGITAL. https://www.tempo.co/about &#x2022; merdeka.com is a registered trademark of PT KAPAN LAGI DOT COM NETWORKS. https://www.merdeka.com/company/tentang-kami.html &#x2022; republika.co.id is a registered trademark of PT Republika Media Mandiri. https://www.republika.co.id/page/about &#x2022; viva.co.id is a registered trademark of PT. Viva Media Baru. https://www.viva.co.id/tentang-kami &#x2022; tribunnews.com is a registered trademark of PT Tribun Digital Online. http://www.tribunnews.com/about-us The corpus is a part of bachelor thesis work of Aad Miqdad Muadz Muzad under the supervision of Faisal Rahutomo. We crawled several categories of the websites for 6 months from July 2015 until December 2015. </description><subject>Information Retrieval</subject><subject>Indonesian Language</subject><contributor>MIQDAD MUADZ MUZAD, AAD</contributor><type>Other:Dataset</type><identifier>10.17632/2zpbjs22k3.1</identifier><rights>Creative Commons Attribution 4.0 International</rights><rights>http://creativecommons.org/licenses/by/4.0</rights><relation>https:/data.mendeley.com/datasets/2zpbjs22k3</relation><date>2018-08-30T08:33:11Z</date><recordID>0.17632-2zpbjs22k3.1</recordID></dc>
format Other:Dataset
Other
author RAHUTOMO, FAISAL
author2 MIQDAD MUADZ MUZAD, AAD
title Indonesian News Corpus
publisher Mendeley
publishDate 2018
topic Information Retrieval
Indonesian Language
url https:/data.mendeley.com/datasets/2zpbjs22k3
contents This corpus contains 150,466 news articles, which is derived from several freely accessible Indonesian news website. The corpus is designated for research purpose only. The news websites are: • kompas.com is a registered trademark of PT. Kompas Cyber Media. https://inside.kompas.com/about-us • tempo.co is a registered trademark of PT INFO MEDIA DIGITAL. https://www.tempo.co/about • merdeka.com is a registered trademark of PT KAPAN LAGI DOT COM NETWORKS. https://www.merdeka.com/company/tentang-kami.html • republika.co.id is a registered trademark of PT Republika Media Mandiri. https://www.republika.co.id/page/about • viva.co.id is a registered trademark of PT. Viva Media Baru. https://www.viva.co.id/tentang-kami • tribunnews.com is a registered trademark of PT Tribun Digital Online. http://www.tribunnews.com/about-us The corpus is a part of bachelor thesis work of Aad Miqdad Muadz Muzad under the supervision of Faisal Rahutomo. We crawled several categories of the websites for 6 months from July 2015 until December 2015.
id IOS7969.0.17632-2zpbjs22k3.1
institution Universitas Islam Indragiri
affiliation onesearch.perpusnas.go.id
institution_id 804
institution_type library:university
library
library Teknologi Pangan UNISI
library_id 2816
collection Artikel mulono
repository_id 7969
city INDRAGIRI HILIR
province RIAU
shared_to_ipusnas_str 1
repoId IOS7969
first_indexed 2020-04-08T08:28:38Z
last_indexed 2020-04-08T08:28:38Z
recordtype dc
_version_ 1686587746698657792
score 17.538404