F-IDF ALGORITHM FOR WEIGHTING IN DETERMINING THE SIMILARITY OF TEXT IN DOCUMENTS

bustami, bustami (2018) F-IDF ALGORITHM FOR WEIGHTING IN DETERMINING THE SIMILARITY OF TEXT IN DOCUMENTS. 1st International Conference on Multidisciplinary Engineering (ICoMdEn) Advancing Engineering for Human Prosperity and Environment Sustainability October 23-24, 2018, Lhokseumwe - Aceh, Indonesia.. pp. 177-182. ISSN 2656-7520

Preview

Text
ICOMDEN1.pdf
Download (454kB) | Preview

Abstract

The grouping of research documents is needed to facilitate information retrieval. Sometimes we have to read one by one the contents of a document to be able to group it or know the existing information. This research attempts to help in finding information that exists in documents quickly. The information searching in documents by calculating the Term Frequency (TF) and Inverse Document Frequency (IDF) values on each token (word) in each document. The TF-IDF algorithm is an algorithm to calculate the weight of each word that is most commonly used in information retrieval. This algorithm is also known to be efficient, easy and accurate to get results. The accuracy of this algorithm in finding the information in a document reaches above 83,3%. KEY WORDS: Text mining (Information retrieval), Term Frequency-Inverse Document Frequency (TF-IDF)

Item Type:	Article
Subjects:	T Technology & Engineering > TI Informatics, Information System
Divisions:	Faculty of Engineering > Department of Informatics
Depositing User:	Mr. Bustami S.Si, M.Si
Date Deposited:	23 Dec 2019 13:20
Last Modified:	23 Dec 2019 13:20
URI:	http://repository.unimal.ac.id/id/eprint/5053

Actions (login required)

View Item