Online Newspaper Clustering in Aceh using the Agglomerative Hierarchical Clustering Method

Tjut Adek, Rizal and Dinata, Rozzi Kesuma and Ditha, Ananda (2022) Online Newspaper Clustering in Aceh using the Agglomerative Hierarchical Clustering Method. International Journal of Engineering, Science & InformationTechnology (IJESTY), 2 (1). pp. 70-75. ISSN 2775-2674

[img]
Preview
Text
206-417-1-PB (2).pdf

Download (368kB) | Preview
Official URL: https://ijesty.org/index.php/ijesty/index

Abstract

The rapid progress in the field of information technology, especially the internet, has given birth to a lot of information. The ease of publishing an article on a website causes an explosion of news pages which will certainly confuse readers. The diversity and the increasing number of news articles make it increasingly difficult for internet users to find news and large piles of news data on online newspaper sites in Aceh. The grouping of text documents is needed to classify news in online newspapers in Aceh based on the content contained in news articles. In this study, the process of grouping online news in Aceh was tried using the Agglomerative Hierarchical Clustering method. News is grouped with a Bottom-Up design strategy that starts with placing each object as a cluster then combined into a larger cluster based on the similarity of keywords in each news, then the cluster results are compared and put into each news category. The research design was carried out in a structured manner using data flow diagrams in forming the research framework. The study was conducted by taking online news text data on 10 online news websites in Aceh from July 2016 to March 2017 with 1000 randomly generated documents. The process of crawling news data is done using a php script which will only take text files from the news on the website. News grouping is done based on religion, politics, law, sports, tourism, education, culture, economy and technology. The results of the grouping performance of the Agglomerative Hierarchical Clustering method in this study have an average accuracy of 89.84%.

Item Type: Article
Subjects: T Technology & Engineering > TI Informatics, Information System
Divisions: Faculty of Engineering > Department of Informatics
Depositing User: Mr. Rizal Tjut Adek
Date Deposited: 28 Apr 2022 04:02
Last Modified: 28 Apr 2022 04:02
URI: http://repository.unimal.ac.id/id/eprint/7071

Actions (login required)

View Item View Item