Issue |
ITM Web Conf.
Volume 64, 2024
2nd International Conference on Applied Computing & Smart Cities (ICACS24)
|
|
---|---|---|
Article Number | 01019 | |
Number of page(s) | 17 | |
DOI | https://doi.org/10.1051/itmconf/20246401019 | |
Published online | 05 July 2024 |
Statistical Data Mining Methods in Predicting Happiness and Habits
1 College of Engineering, Department of Computer engineering, Knowledge university, Erbil, Iraq
2 Computer Science Department, Bayan University, Erbil, Kurdistan, Iraq
3 Artificial Intelligence Engineering Department, College of Engineering, Al-Ayen University, ThiQar, Iraq
* Corresponding author: sazan.sulaiman@knu.edu.iq
The objective of this study is to employ statistical data mining methods and con-duct a survey among young individuals to construct a model capable of forecasting overall happiness. This model will consider over a hundred characteristics, including lifestyle choices and musical tastes. We utilized boosting trees, subset se-lection, and GAM (Generalized Additive Models) techniques. In addition, we created actual test data to validate the model. All available approaches have found many lifestyle variables, including as energy levels, loneliness, desire to alter the past, eating properly, and spending time with friends, as significant determinants of happiness. We generated authentic test data to verify the model, utilizing rigorous testing protocols to evaluate its predicted precision and applicability across various demographics. Based on our investigation, the use of the gradient boost technique resulted in improved picture projections. The evaluation of the technique using a confusion matrix revealed an accuracy of 97.1% for training and a perfect accuracy of 100% for validation. The training phase achieved an accuracy of 62.5%, as shown by the confusion matrix, while the overall confusion matrix demonstrated a 92.0% accuracy in predicting happiness. The support vector machine, trained incrementally, demonstrated encouraging prospects for future investigation.
Key words: Data Mining / Happiness / GAM / Boosting-Tree / Subset Selection / Lifestyle
© The Authors, published by EDP Sciences, 2024
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.