Department of Information Systems & Computer Science Faculty Publications

Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts

Jelly P. Aureus, Ateneo de Manila University
Ma. Regina Justina E. Estuar, Ateneo de Manila UniversityFollow
Dorothy C. Mapua, Ateneo de Manila University
Roland P. Abao, Ateneo de Manila University
Anna Angeline M. Cataluña, Ateneo de Manila University

Document Type

Conference Proceeding

Publication Date

12-2021

Abstract

Distorted thoughts may signify underlying mental illness, and when detected early, may serve as preventive measure to a more serious condition. A significant shift to more pronounced negative sentiments has been observed in the Social Media Platform, Reddit, during the onset of the COVID-19 Pandemic. Individuals who engage in these platforms post and comment to express thoughts and feelings. This study aims to determine features that can help detect the presence of distorted thoughts, known as cognitive distortions, in a COVID-19 pandemic-related texts. Texts were extracted from a COVID-19 Support Group in Reddit and verified through annotation for presence or absence of cognitive distortions. Linguistic features were extracted using R and LIWC to determine the best set of features that can distinguish distorted from non-distorted texts. Results showed that cognitive distortions have distinguishable features in COVID-19 Pandemic-related texts. Specifically, results of Independent Samples T-test showed that distorted texts had significantly higher scores on: word count, sentiment score, authenticity, and usage of the following words: function words, pronouns in general, first-person singular pronoun, impersonal pronouns, verbs, interrogatives, positive emotions, cognitive processes on insights, discrepancy, and certainty, present-tense verbs, future-tense verbs and swear words. Further tests using Naive Bayes and Linear SVM machine learning model showed that some of these significant features can indeed help detect whether a sentence is distorted or not. Results from this study can be used to develop detection models on cognitive distortions.

Recommended Citation

Aureus, J. P., Estuar, Ma. R. J. E., Mapua, D. C., Abao, R. P., & Cataluña, A. A. M. (2021). Determining linguistic markers in cognitive distortions from COVID-19 pandemic-related Reddit texts. 2021 1st International Conference in Information and Computing Research (ICORE), 56–61. https://doi.org/10.1109/iCORE54267.2021.00029

Link to Full Text

COinS

Department of Information Systems & Computer Science Faculty Publications

Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts

Document Type

Publication Date

Abstract

Recommended Citation

Browse

Author Corner

About Archium

Department of Information Systems & Computer Science Faculty Publications

Determining Linguistic Markers in Cognitive Distortions from COVID-19 Pandemic-Related Reddit Texts

Authors

Document Type

Publication Date

Abstract

Recommended Citation

Share

Browse

Author Corner

About Archium