The SpeDial datasets: datasets for spoken dialogue systems analytics

Lopes José David Aguas, Chorianopoulou Arodami, Palogiannidi Elisavet, Moniz Helena, Abad Alberto, Iosif Ilias, Potamianos Alexandros

Full record

URI:

http://purl.tuc.gr/dl/dias/E6FE8E29-4C35-40AC-9C45-B2C7297F298F

Year

2016

Type of Item

Conference Full Paper

License

Details

Bibliographic Citation

J. Lopes, A. Chorianopoulou, E. Palogiannidi, H. Moniz, A. Abad, K. Louka, E. Iosif and A. Potamianos, "The SpeDial datasets: datasets for spoken dialogue systems analytics," in 10th International Conference on Language Resources and Evaluation, 2016, pp. 104-110.

Appears in Collections

Conference Publications in Community School of Electrical and Computer Engineering

Summary

The SpeDial consortium is sharing two datasets that were used during the SpeDial project. By sharing them with the community we are providing a resource to reduce the duration of cycle of development of new Spoken Dialogue Systems (SDSs). The datasets include audios and several manual annotations, i.e., miscommunication, anger, satisfaction, repetition, gender and task success. The datasets were created with data from real users and cover two different languages: English and Greek. Detectors for miscommunication, anger and gender were trained for both systems. The detectors were particularly accurate in tasks where humans have high annotator agreement such as miscommunication and gender. As expected due to the subjectivity of the task, the anger detector had a less satisfactory performance. Nevertheless, we proved that the automatic detection of situations that can lead to problems in SDSs is possible and can be a promising direction to reduce the duration of SDS's development cycle.

Search

Browse

My Space

The SpeDial datasets: datasets for spoken dialogue systems analytics

Lopes José David Aguas, Chorianopoulou Arodami, Palogiannidi Elisavet, Moniz Helena, Abad Alberto, Iosif Ilias, Potamianos Alexandros

Summary

Services

Export

Share

Statistics

Metadata & Content in a METS Package:

Metadata in Format: