TRANSFORMING BINARY DEPRESSIVE DATA INTO SEVERITY INSIGHTS THROUGH AUTOMATIC CLUSTERING AND LABELLING

ICTACT Journal on Soft Computing ( Volume: 16 , Issue: 3 )

Abstract

Depression affects individuals globally, necessitating efficient diagnostic tools. This study introduces an advanced unsupervised hybrid approach, that automatically converts binary-labelled depression datasets into multi-class datasets by integrating a rule-based system with Large Language Models (LLMs). The rule-based system employs the Beck Depression Inventory-II tool that can be used to classify depression levels based on predefined scoring rules, and these rules are segregated into clusters based on score ranges from 0-3. LLMs employ fine-tuned large language Model Meta AI2 (LLaMa2) to generate domain-specific embedding from social media posts. By harnessing LLMs’ contextual understanding, both BDI rules and social media posts are embedded, thereafter cosine similarity is applied to calculate semantic similarities. Based on the similarity score, each post is assigned to the most similar BDI cluster, with the highest similarity score, creating a refined multiclass depression cluster. To evaluate clustering effectiveness, the silhouette score was computed, yielding an average score of 0.45, indicating moderate clustering quality. Additionally, 30% of the binary depression dataset was manually labelled by clinical experts. The Normalized Mutual Information (NMI) score of 0.53 further validated the method, showing strong alignment between the generated clusters and expert-labelled data. This approach enhances depression severity classification, providing a scalable, efficient, and accurate tool for researchers and practitioners.

Authors

Divya Dewangan1, Smita Selot2, Sreejit Panicker3
Shri Shankaracharya Technical Campus, India1,2, Techment Technology, India 3

Keywords

Large Language Model, Multiclass Depression Dataset, Beck Depression Inventory Tool, Semantic Similarity

Published By
ICTACT
Published In
ICTACT Journal on Soft Computing
( Volume: 16 , Issue: 3 )
Date of Publication
October 2025
Pages
4010 - 4017
Page Views
31
Full Text Views
2

ICT Academy is an initiative of the Government of India in collaboration with the state Governments and Industries. ICT Academy is a not-for-profit society, the first of its kind pioneer venture under the Public-Private-Partnership (PPP) model

Contact Us

ICT Academy
Module No E6 -03, 6th floor Block - E
IIT Madras Research Park
Kanagam Road, Taramani,
Chennai 600 113,
Tamil Nadu, India

For Journal Subscription: journalsales@ictacademy.in

For further Queries and Assistance, write to us at: ictacademy.journal@ictacademy.in