A Tourism Knowledge Model through Topic Modeling from Online Reviews

Hananto, Valentinus Roby ORCID: https://orcid.org/0000-0003-1988-3168, Serdült, Uwe and Kryssanov, Victor (2021) A Tourism Knowledge Model through Topic Modeling from Online Reviews. In: International Conference on Computing and Data Engineering (ICCDE). International Conference Proceeding Series (ICPS), 7 . Association for Computing Machinery, Phuket, Thailand, pp. 87-93. ISBN 978-1-4503-8845-0

[img] Text
18-2021-ROBY-ICCDE-ACM.pdf - Published Version
Restricted to Registered users only

Download (3MB)

Abstract

Ontologies and knowledge models have gained more recognition because of their extensive use in recommender systems. The lack of automatic approaches in ontology engineering, however, becomes a challenge to fulfill increasing needs for such knowledge models in the field of tourism. In this study, a system for building tourism knowledge models from online reviews is proposed. The main contribution of the study is the application of topic modeling to build a knowledge model that, in turn, allows for an automated labeling process to train classifiers. Given a collection of unlabeled tourism online reviews, Latent Dirichlet Allocation (LDA) is applied to automatically label each document. Each topic discovered by LDA is labeled with one specific category, representing its semantic meaning based on an existing general ontology as a reference. These automatically labeled documents are used for classification, and the result is compared with manual annotation. Experiments on Indonesian tourism datasets showed that the automatic labeling approach using LDA provides for a precision score of 70%. In classification tasks, this approach can achieve comparable or even better classification performance than the manual labeling. The results obtained suggest that the developed system is capable of building a tourism knowledge model and providing acceptable-quality training data for the development of tourism recommender systems.


Export Record



Statistic

IRStats Detail StatisticView more statistics

Item Type: Book Section
Additional Information: https://doi.org/10.1145/3456172.3456211
Uncontrolled Keywords: tourism knowledge model, topic modeling, recommender systems
Dewey Decimal Classification: 000 - Computer science, information & general works > 000 Computer science, knowledge & systems > 005 Computer programming, programs & data
Divisions: Perpustakaan > Prosiding/Call for Papers
Depositing User: Agung P. W.
Date Deposited: 26 Jul 2022 15:57
Last Modified: 26 Jul 2022 15:57
URI: http://repository.dinamika.ac.id/id/eprint/6526

Actions (login required)

View Item View Item