Indonesian News Corpus
Main Author: | RAHUTOMO, FAISAL |
---|---|
Other Authors: | MIQDAD MUADZ MUZAD, AAD |
Format: | Dataset |
Terbitan: |
Mendeley
, 2018
|
Subjects: | |
Online Access: |
https:/data.mendeley.com/datasets/2zpbjs22k3 |
ctrlnum |
0.17632-2zpbjs22k3.1 |
---|---|
fullrecord |
<?xml version="1.0"?>
<dc><creator>RAHUTOMO, FAISAL</creator><title>Indonesian News Corpus</title><publisher>Mendeley</publisher><description>This corpus contains 150,466 news articles, which is derived from several freely accessible Indonesian news website. The corpus is designated for research purpose only. The news websites are:
• kompas.com is a registered trademark of PT. Kompas Cyber Media. https://inside.kompas.com/about-us
• tempo.co is a registered trademark of PT INFO MEDIA DIGITAL. https://www.tempo.co/about
• merdeka.com is a registered trademark of PT KAPAN LAGI DOT COM NETWORKS. https://www.merdeka.com/company/tentang-kami.html
• republika.co.id is a registered trademark of PT Republika Media Mandiri. https://www.republika.co.id/page/about
• viva.co.id is a registered trademark of PT. Viva Media Baru. https://www.viva.co.id/tentang-kami
• tribunnews.com is a registered trademark of PT Tribun Digital Online. http://www.tribunnews.com/about-us
The corpus is a part of bachelor thesis work of Aad Miqdad Muadz Muzad under the supervision of Faisal Rahutomo. We crawled several categories of the websites for 6 months from July 2015 until December 2015. </description><subject>Information Retrieval</subject><subject>Indonesian Language</subject><contributor>MIQDAD MUADZ MUZAD, AAD</contributor><type>Other:Dataset</type><identifier>10.17632/2zpbjs22k3.1</identifier><rights>Creative Commons Attribution 4.0 International</rights><rights>http://creativecommons.org/licenses/by/4.0</rights><relation>https:/data.mendeley.com/datasets/2zpbjs22k3</relation><date>2018-08-30T08:33:11Z</date><recordID>0.17632-2zpbjs22k3.1</recordID></dc>
|
format |
Other:Dataset Other |
author |
RAHUTOMO, FAISAL |
author2 |
MIQDAD MUADZ MUZAD, AAD |
title |
Indonesian News Corpus |
publisher |
Mendeley |
publishDate |
2018 |
topic |
Information Retrieval Indonesian Language |
url |
https:/data.mendeley.com/datasets/2zpbjs22k3 |
contents |
This corpus contains 150,466 news articles, which is derived from several freely accessible Indonesian news website. The corpus is designated for research purpose only. The news websites are:
• kompas.com is a registered trademark of PT. Kompas Cyber Media. https://inside.kompas.com/about-us
• tempo.co is a registered trademark of PT INFO MEDIA DIGITAL. https://www.tempo.co/about
• merdeka.com is a registered trademark of PT KAPAN LAGI DOT COM NETWORKS. https://www.merdeka.com/company/tentang-kami.html
• republika.co.id is a registered trademark of PT Republika Media Mandiri. https://www.republika.co.id/page/about
• viva.co.id is a registered trademark of PT. Viva Media Baru. https://www.viva.co.id/tentang-kami
• tribunnews.com is a registered trademark of PT Tribun Digital Online. http://www.tribunnews.com/about-us
The corpus is a part of bachelor thesis work of Aad Miqdad Muadz Muzad under the supervision of Faisal Rahutomo. We crawled several categories of the websites for 6 months from July 2015 until December 2015. |
id |
IOS7969.0.17632-2zpbjs22k3.1 |
institution |
Universitas Islam Indragiri |
affiliation |
onesearch.perpusnas.go.id |
institution_id |
804 |
institution_type |
library:university library |
library |
Teknologi Pangan UNISI |
library_id |
2816 |
collection |
Artikel mulono |
repository_id |
7969 |
city |
INDRAGIRI HILIR |
province |
RIAU |
shared_to_ipusnas_str |
1 |
repoId |
IOS7969 |
first_indexed |
2020-04-08T08:28:38Z |
last_indexed |
2020-04-08T08:28:38Z |
recordtype |
dc |
_version_ |
1686587746698657792 |
score |
17.538404 |