METHODS OF DATA MINING USING MACHINE LEARNING
DOI:
https://doi.org/10.26906/SUNZ.2025.4.056Keywords:
intelligent data analysis, machine learning, classification, clustering, regression, deep learning, neural networks, analytical tools, Data Mining, CRISP-DM, AutoML, Big Data, PythonAbstract
Relevance . In the context of the continuous growth of information volumes across various fields of the economy, science, and technology, the problem of effective processing and analysis of large-scale data has become increasingly urgent. Traditional analytical methods are no longer capable of providing fast and accurate extraction of useful knowledge and patterns from massive information flows. The response to this challenge lies in the methods of data mining, which are based on modern machine learning technologies. These methods enable the automatic discovery of hidden patterns, the generation of accurate predictions, and the support of data-driven decision-making in real time. Given the rapid development of digitalization, artificial intelligence, and the need for prompt decision-making in a competitive environment, the relevance of developing and improving data mining methods is growing steadily. The object of research is process of data mining using machine learning methods, namely the set of algorithms, models, tools, and approaches that ensure the detection of hidden patterns, anomalies, and structures in large volumes of heterogeneous information. Purpose of the article. This study explores contemporary approaches to intelligent data analysis based on machine learning techniques and assesses their effectiveness across a range of application domains. The article aims to provide a structured overview of state-of-the-art algorithms and to evaluate their respective advantages and limitations in processing large-scale and highdimensional datasets. Research results. A systematic analysis of key data mining methods based on machine learning algorithms was carried out. It was found that the most effective approaches for processing large and heterogeneous datasets include classification, clustering, regression analysis, and dimensionality reduction techniques. Deep neural networks demonstrated effectiveness when applied to unstructured data such as text, images, and time series. The study revealed that the appropriate choice of algorithm depends not only on the data type but also on the specific nature of the task. A comparative assessment of tools showed that the Python ecosystem offers the greatest flexibility, while AutoML platforms simplify model deployment for users with limited programming experience. The research also included a review of recent publications that confirm the practical value of machine learning in real-world use cases. Overall, the findings indicate that machine learning is a driving force behind the evolution of data mining methods, enabling accurate, scalable, and adaptive data processing in the context of modern digital transformation. Conclusions. Machine learning has significantly expanded the capabilities of intelligent data analysis by enabling the automatic detection of patterns, forecasting, and decisionmaking based on large volumes of information. The study demonstrates the effectiveness of various algorithms in tasks such as classification, clustering, regression, and deep learning. Python-based tools and cloud platforms have been identified as the most convenient environments for implementing analytical models. A promising direction lies in the development of explainable AI and hybrid approaches that combine algorithmic precision with domain-specific expertise.Downloads
References
1. Flach P. A. Machine Learning: The Art and Science of Algoritms that Makes Sense of Data. Cambridge: Cambridge University Press, 2012. 291 p. https://doi.org/10.1017/CBO9780511973000 .
2. Phua, C. et al. A Comprehensive Survey of Data Mining-Based Fraud Detection Research. Artificial Intelligence Review, 55, 2021. P. 1985-2033.
3. C. Romero, S. Ventura. Educational data mining and learning analytics: An updated survey. Cornell University. Computer Science,2024. 30 p. https://doi.org/10.48550/arXiv.2402.07956 .
4. Q. Wang. Research on the Application of Machine Learning in Financial Anomaly Detection. iBusiness, 16, 2024. 173-183. https://doi.org/10.4236/ib.2024.164012 .
5. О.Д. Земляний, О.Г. Байбуз. Огляд методів аналізу даних та методів машинного навчання при прогнозуванні ішемічної хвороби серця. Актуальні проблеми автоматизації та інформаційних технологій, т.27, 2023. С. 109-129.
6. E. Abdallah, Esraa A. Elsoud, A. Abdallah. A Survey of Data Mining Techniques for Digital Forensic Analysis. Procedia Computer Science, Volume 257, 2025. P. 731-736. https://doi.org/10.1016/j.procs.2025.03.094
7. A.Rotty, T.Dewayana, A.Habyba. Cross-Industry Standard Process for Data Mining (CRISP-DM) Approach in Determining the Most Significant Employee Engagement Drivers to Sales at X Car Dealership. IEOM Society International, 2023. P. 3368-2279. https://doi.org/10.46254/AP03.20220552 .
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Dmytro Diachenko, Mykhailo Prokopchyk, Vladyslav Rovenchak, Andriy Frolov

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.