Using discursive information to disentangle French language chat - LINA - Equipe Traitement Automatique du Langage Naturel Access content directly
Conference Papers Year : 2015

Using discursive information to disentangle French language chat

Abstract

In internet chatrooms, multiple conversations may occur simultaneously. The task of identifying to which conversation each message belongs is called disentanglement. In this paper, we first try to adapt the publicly available system of Elsner and Char-niak (2010) to a French corpus extracted from the Ubuntu platform. Then, we experiment with the discursive annotation of utterances. We find that disentanglement performances can vary significantly depending on corpus characteristics. We also find that using discursive information, in the form of functional and rhetoric relations between messages, is valuable for this task.
Fichier principal
Vignette du fichier
nlp4cmc2015.pdf (77.95 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01698147 , version 1 (21-02-2018)

Identifiers

  • HAL Id : hal-01698147 , version 1

Cite

Matthieu Riou, Soufian Salim, Nicolas Hernandez. Using discursive information to disentangle French language chat. 2nd Workshop on Natural Language Processing for Computer-Mediated Communication (NLP4CMC 2015) / Social Media at GSCL Conference 2015, Sep 2015, Essen, Germany. pp.23-27. ⟨hal-01698147⟩
175 View
154 Download

Share

Gmail Facebook X LinkedIn More