Detail publikačního výsledku

MultiFlipFormer: A Multimodal Transformer for Emotion Flip Reasoning and Instigator Detection in Therapeutic Conversations

SHARMA, A.; FATIMA, N.; SIKORA, P.; MYŠKA, V.; FROLKA, J.; DUTTA, M.

Originální název

MultiFlipFormer: A Multimodal Transformer for Emotion Flip Reasoning and Instigator Detection in Therapeutic Conversations

Anglický název

MultiFlipFormer: A Multimodal Transformer for Emotion Flip Reasoning and Instigator Detection in Therapeutic Conversations

Druh

Stať ve sborníku v databázi WoS či Scopus

Originální abstrakt

Understanding the dynamics of emotional change within therapeutic conversations is a key challenge in computational mental health. We present MultiFlipFormer, a multimodal transformer-based architecture designed to model emotion flip reasoning and instigator detection. Unlike prior models that focus on static emotional state classification, MultiFlipFormer captures temporal emotional transitions across dialogue turns and identifies multimodal cues responsible for these shifts. The architecture comprises four core components: (1) emotion transition analysis to model evolving emotional trajectories (e.g., sadness → anger → neutral), (2) multimodal instigator detection that integrates visual cues, textual content, and conversational strategies to uncover emotion flip triggers, (3) cross-modal attention networks to fuse text and visual information, and (4) trajectory prediction to anticipate future emotional states. Evaluated on a large-scale MESC dataset of 23,126 therapeutic samples spanning 7 emotions, 41 flip types, 15 scenarios, and 10 intervention strategies, MultiFlipFormer achieves a final weighted F1-score of 0.828 across tasks. This framework provides a clear and reliable approach that can genuinely support therapy and help in planning better interventions.

Anglický abstrakt

Understanding the dynamics of emotional change within therapeutic conversations is a key challenge in computational mental health. We present MultiFlipFormer, a multimodal transformer-based architecture designed to model emotion flip reasoning and instigator detection. Unlike prior models that focus on static emotional state classification, MultiFlipFormer captures temporal emotional transitions across dialogue turns and identifies multimodal cues responsible for these shifts. The architecture comprises four core components: (1) emotion transition analysis to model evolving emotional trajectories (e.g., sadness → anger → neutral), (2) multimodal instigator detection that integrates visual cues, textual content, and conversational strategies to uncover emotion flip triggers, (3) cross-modal attention networks to fuse text and visual information, and (4) trajectory prediction to anticipate future emotional states. Evaluated on a large-scale MESC dataset of 23,126 therapeutic samples spanning 7 emotions, 41 flip types, 15 scenarios, and 10 intervention strategies, MultiFlipFormer achieves a final weighted F1-score of 0.828 across tasks. This framework provides a clear and reliable approach that can genuinely support therapy and help in planning better interventions.

Klíčová slova

Emotion Flip Reasoning, Therapeutic Dialogue Modelling, Emotional Trajectory Prediction, Transformer

Klíčová slova v angličtině

Emotion Flip Reasoning, Therapeutic Dialogue Modelling, Emotional Trajectory Prediction, Transformer

Autoři

SHARMA, A.; FATIMA, N.; SIKORA, P.; MYŠKA, V.; FROLKA, J.; DUTTA, M.

Rok RIV

2026

Vydáno

05.11.2025

Nakladatel

IEEE

Místo

Florence, Italy

ISBN

979-8-3315-7675-2

Kniha

2025 17th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT)

Periodikum

International Congress on Ultra Modern Telecommunications and Workshops

Stát

Spojené státy americké

Strany od

187

Strany do

192

Strany počet

6

URL

BibTex

@inproceedings{BUT200029,
  author="Akshara {Sharma} and  {} and Vojtěch {Myška} and Pavel {Sikora} and Jakub {Frolka} and Malay Kishore {Dutta}",
  title="MultiFlipFormer: A Multimodal Transformer for Emotion Flip Reasoning and Instigator Detection in Therapeutic Conversations",
  booktitle="2025 17th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT)",
  year="2025",
  journal="International Congress on Ultra Modern Telecommunications and Workshops",
  pages="187--192",
  publisher="IEEE",
  address="Florence, Italy",
  doi="10.1109/ICUMT67815.2025.11268690",
  isbn="979-8-3315-7675-2",
  url="https://ieeexplore.ieee.org/document/11268690/keywords#keywords"
}