RSPC Benchmark: Modeling Relationship Stress in Digital Intimate Relationships with Psychiatrist-Annotated Data

Mental health modeling in NLP often treats individuals in isolation, neglecting interpersonal context. This paper introduces the Relationship Stress and Psychiatric Corpus (RSPC), comprising 1,799 Reddit posts about long-distance relationships annotated by psychiatrists with diagnostic categories (e.g., anxiety, depression), relationship stressor triggers, and relational stage. The authors benchmark seven fine-tuned Transformer models and five large language models across three tasks: multi-label disorder classification, relationship trigger detection, and temporal stage prediction. Results reveal substantial model variability, with Claude-3-Haiku achieving the best performance in disorder classification (Macro-F1=0.538) and GPT-4o leading in trigger detection (Macro-F1=0.519). The study also uncovers a strong association between anxiety disorders and chronic relationship uncertainty. RSPC establishes a new benchmark for relationship-aware NLP, shifting mental health modeling from an individual-centric to a socially and temporally contextualized paradigm.

Background and Context

The application of natural language processing (NLP) to mental health modeling has historically suffered from a critical methodological limitation: the isolation of psychological distress from its interpersonal context. Traditional approaches often treat mental health conditions as purely individual phenomena, analyzing text data to identify symptoms such as anxiety or depression without accounting for the complex social dynamics that frequently precipitate or exacerbate these states. This individual-centric perspective, while effective for basic symptom identification, fails to capture the relational triggers and temporal evolutions that are central to understanding psychological well-being in social contexts. To address this gap, researchers have introduced the Relationship Stress and Psychiatric Corpus (RSPC), a novel dataset designed to bridge the divide between clinical psychiatry and computational linguistics by embedding relational context into mental health modeling.

The RSPC dataset is constructed from 1,799 Reddit posts specifically focused on long-distance relationships, a domain characterized by unique stressors such as physical separation, communication latency, and uncertainty. Unlike general mental health corpora, RSPC is distinguished by its high-quality annotations provided by licensed psychiatrists. These annotations cover three distinct dimensions: diagnostic categories (primarily anxiety and depression), relationship stressor triggers (specific events or states causing distress), and relational stages (the temporal phase of the relationship). This multi-dimensional annotation scheme allows for a nuanced analysis that goes beyond binary classification of mental illness, enabling models to understand the interplay between psychological states and relational dynamics. The creation of this corpus represents a significant shift towards ecologically valid mental health NLP, where the social environment is treated as a primary variable rather than background noise.

Deep Analysis

To evaluate the capabilities of current AI architectures in understanding relational context, the study conducted a comprehensive benchmarking exercise involving seven fine-tuned Transformer models and five large language models (LLMs). The evaluation framework was structured around three core tasks: multi-label disorder classification, relationship trigger detection, and temporal stage prediction. This multi-task approach was essential for assessing not only the models' ability to recognize clinical symptoms but also their capacity to interpret subtle social cues and temporal progression within narratives. The results revealed substantial variability in model performance, highlighting that different architectures possess distinct strengths when dealing with complex, context-rich data.

In the multi-label disorder classification task, Claude-3-Haiku emerged as the top-performing model, achieving a Macro-F1 score of 0.538. This result suggests that Claude-3-Haiku has a particular aptitude for identifying complex combinations of psychological symptoms within text, likely due to its training on diverse clinical and conversational data. Conversely, in the relationship trigger detection task, which requires a deeper understanding of causal social interactions, GPT-4o led the field with a Macro-F1 score of 0.519. This indicates that GPT-4o possesses superior capabilities in parsing fine-grained social dynamics and identifying specific relational events that act as stressors. The divergence in performance between these models underscores the importance of selecting appropriate architectures based on the specific clinical or social task at hand.

Beyond performance metrics, the analysis uncovered clinically significant patterns within the data. A strong statistical association was found between anxiety disorders and chronic relationship uncertainty, a finding that aligns with existing psychiatric literature but is now quantifiable through computational means. Error analysis and ablation studies further revealed that current models still struggle with distinguishing between mild relational stress and clinical-level anxiety, particularly when the text contains implicit or ambiguous social cues. These findings highlight the limitations of existing models in handling the subtleties of human relationships and point to areas for future architectural improvements, such as enhanced context window management and better integration of social reasoning modules.

Industry Impact

The introduction of the RSPC benchmark has profound implications for the NLP community, mental health research, and the development of digital health technologies. For researchers, RSPC provides a standardized and high-quality evaluation platform that encourages the development of models capable of understanding social and temporal contexts. This shifts the focus of NLP research from purely linguistic accuracy to social computation and clinical auxiliary diagnosis, fostering interdisciplinary collaboration between computer scientists and mental health professionals. The availability of this annotated dataset fills a critical void in the field, enabling more rigorous testing of hypotheses regarding the relationship between social dynamics and psychological well-being.

In the industrial sector, the insights derived from RSPC can inform the development of more empathetic and effective mental health support chatbots and digital therapeutics. Current AI-driven mental health tools often fail to address the root causes of distress, which are frequently relational. By leveraging models trained on or benchmarked against RSPC, developers can create systems that not only identify symptoms but also recognize the underlying relational stressors, such as communication breakdowns or uncertainty in long-distance partnerships. This capability can lead to more personalized and effective interventions, where recommendations are tailored to the specific social context of the user, thereby improving engagement and outcomes.

Furthermore, RSPC supports a paradigm shift in mental health modeling from an individual-centric to a socially and temporally contextualized approach. This shift is crucial for developing a more holistic understanding of mental health, where the individual is viewed as part of a dynamic social system. By emphasizing the role of social and temporal dynamics, RSPC encourages the development of models that can predict how mental health states evolve over time in response to relational changes. This has significant potential for early intervention strategies, where changes in relational dynamics can be detected and addressed before they escalate into clinical disorders.

Outlook

Looking forward, the RSPC benchmark serves as a foundation for several promising avenues of research and development. One key direction is the exploration of multimodal data fusion, where text-based relational data is combined with other forms of data, such as voice tone, facial expressions, or physiological signals, to provide a more comprehensive picture of mental health. Another important area is longitudinal relationship dynamic modeling, which involves tracking changes in relational stress and mental health over extended periods. This can help in understanding the long-term impacts of relationship stressors and in developing predictive models that can anticipate mental health crises based on relational trends.

Additionally, the insights from RSPC can be used to develop personalized intervention strategies that are sensitive to the specific relational context of each user. This could involve creating adaptive systems that adjust their responses based on the detected relational stage and stressors, providing more relevant and timely support. As AI technologies continue to advance, the integration of relational context into mental health modeling will become increasingly important, enabling more nuanced and effective digital mental health solutions. The RSPC benchmark, with its rigorous annotations and comprehensive evaluation framework, is poised to play a central role in this evolution, driving innovation and improving the quality of care in the digital age.

Finally, the study highlights the need for continued collaboration between the technical and clinical communities to ensure that AI models are not only technically sophisticated but also clinically valid and ethically sound. As models become more capable of understanding and predicting relational dynamics, it is essential to address issues of privacy, consent, and bias. The RSPC dataset, with its careful annotation by psychiatrists, sets a high standard for ethical data usage and clinical relevance, providing a model for future research in this sensitive and impactful domain. By building on this foundation, the field can move towards a future where AI serves as a powerful tool for enhancing mental health and well-being in the context of human relationships.

Sources