METHOD OF SEMANTIC DATA ANALYSIS FOR DETERMINING MARKER WORDS IN THE PROCESSING OF VISITORS' EVALUATION RESULTS IN INTERACTIVE ART
DOI:
https://doi.org/10.26906/SUNZ.2024.1.141Keywords:
semantic data analysis, natural language processing, latent Dirichlet distribution, bidirectional coded representations from transformers, interactive art, emotional response analysisAbstract
The subject of the study is in-depth semantic data analysis based on the integration of the methodologies of latent Dirichlet distribution (LDA) and bidirectional encoding representation from transformers (BERT). This research focuses on processing textual data, in particular, visitors' evaluations of interactive art, to identify marker words that highlight key emotional and thematic elements. The goal is to deepen the understanding of visitors' experiences and perceptions of interactive art installations by identifying significant marker words using a combined LDA and BERT approach. This combination aims to capture both general thematic content and the nuanced context of feedback. Objectives: collection and preprocessing of textual data - visitor ratings, consisting of tokenization, normalization and lemmatization steps with the implementation of LDA to extract common themes from the collected data, providing insights into the main themes present in visitor feedback; integration of BERT to analyze contextual nuances and extract deeper meanings from individual words in the feedback; combining the results of LDA and BERT to create a comprehensive understanding of the textual data, focusing on identifying the most significant marker words. The following results were achieved successful extraction of key themes from visitors' ratings using LDA, which allowed us to identify broad thematic categories present in the reviews; a deep learning approach BERT was proposed, which provided nuanced contextual embeddings, emphasizing specific emotions and sentiments expressed by visitors; the results of LDA and BERT were integrated, which provided a rich set of marker words that effectively reflect the essence of the experience and perception of visitors to interactive art; the accuracy and depth of analysis in identifying key emotional and thematic elements was improved, as evidenced by the consistency and relevance of marker words in relation to visitors' ratings. Conclusions: The integration of LDA and BERT for semantic data analysis in interactive art contexts demonstrates a powerful approach for understanding complex visitor feedback. This method provides a two-level analysis, where LDA offers insights into general themes and BERT contributes to detailed contextual understanding. The study successfully identifies specific marker words that effectively capture the essence of visitors' impressions and ratings. This methodology can be useful for artists, curators, and researchers in measuring public reception and improving interactive art experiences. The adaptability of the methodology creates real prospects for its application in other areas that require a detailed semantic analysis of textual feedback.Downloads
References
O. Golembovska, V. Kharchenko, I. Shostak, M. Danova, and O. Feoktystova. Assessing the Perception of Abstract Paintings with Elements of Augmented Reality, 11th IEEE DESSERT, Ukraine, 2020. DOI: 10.1109/DESSERT50317.2020.9125014.
O. Golembovska, V. Kharchenko, I. Shostak, M. Danova, O. Feoktystova, and V. Plietnov, Augmented Reality for the Abstract Paintings: Application Scenarios, Semantic Similarity Analysis and Case Study, 2019 10th IEEE Int. Conf. on IDAACS.: Technology and Applications, Metz, France, 2019, pp. 1007-1011, DOI: 10.1109/IDAACS.2019.8924411.
N. Basov, R. Breiger, I. Hellsten. Socio-semantic and other dualities. Poetics. 2020. p.101433, DOI: 10.1016/j.poetic.2020.101433.
Pollux: Interactive Cluster-First Projections of High-Dimensional Data [Текст] / John E. Wenskovitch, C. North // 2019 IEEE Visualization in Data Science (VDS) - 2019. – pp.38-47, DOI: 10.1109/VDS48975.2019.8973381.
Visual-Textual Sentiment Analysis Enhanced by Hierarchical Cross-Modality Interaction [Текст] / Tao Zhou, Jiuxin Cao, Xueling Zhu, Bo Liu, Shancang Li // IEEE Systems Journal - 2021. – pp.4303-4314, DOI: 10.1109/jsyst.2020.3026879.
Neural Networks for Semantic Gaze Analysis in XR Settings / Lena Stubbemann, Dominik Dürrschnabel, R. Refflinghaus // ACM Symposium on Eye Tracking Research and Applications - 2021, DOI: 10.1145/3448017.3457380.
Multilingual Sequence Labeling Approach to solve Lexical Normalization / Divesh R. Kubal, Apurva Nagvenkar // 2021 The 7th Workshop on Noisy User-generated Text (W-NUT) - 2021. – p.457-464, DOI: 10.18653/v1/2021.wnut-1.51.
Proposed Natural Language Processing Preprocessing Procedures for Enhancing Arabic Text Summarization / Reda Elbarougy, G. M. Behery, Akram el Khatib // 2019. – p.39-57, DOI: 10.1007/978-3-030-34614-0_3.
The influence of preprocessing on text classification using a bag-of-words representation / Yaakov Hacohen-Kerner, Daniel Miller, Yair Yigal // PLoS ONE - 2020. – DOI: 10.1371/journal.pone.0232525.
Italian Text Categorization with Lemmatization and Support Vector Machines / F. Camastra, Gennaro Razi // 2020. – p.47-54, DOI: 10.1007/978-981-13-8950-4_5.
From Words to Numbers: Getting Started with Text Analysis for Applied Social Scientists [Текст] / Hyun Woo Kim, Hyejung Chang // BCRP (Business Communication Research and Practice) - 2020. – p.122-129, DOI: 10.22682/BCRP.2020.3.2.122.
A guided latent Dirichlet allocation approach to investigate real-time latent topics of Twitter data during Hurricane Laura / S. Zhou, P. Kan, Qunying Huang, J. Silbernagel // Journal of Information Science - 2021. DOI: 10.1177/01655515211007724.
Neural Topic Models for Short Text Using Pretrained Word Embeddings and Its Application To Real Data / R. Murakami, B. Chakraborty. 2021 IEEE 4th Int Conf on Knowledge Innovation and Invention (ICKII) - 2021. – p.146-150, DOI:10.1109/ICKII51822.2021.9574752.
Babeshko, I.; Leontiiev, K.; Kharchenko, V.; Kovalenko, A.; Brezhniev, E. Application of Assumption Modes and Effects Analysis to XMECA. In Theory and Engineering of Dependable Computer Systems and Networks; DepCoS-RELCOMEX 2021. Advances in Intelligent Systems and Computing; Springer: Cham, Switzerland, 2021; Volume 1389. DOI: 10.1007/978- 3-030-76773-0_1.