The SUMS Corpus

 
  Photo: Laura Tiedtke.

The "SUMS" (Speech Under Multiple Stressors) corpus allows investigating the effects of multiple stressors to speech. This corpus addresses the questions on which basis and in which way all the different stress types shine through and combine in the acoustic speech signal. We used several stressors that form a taxonomy of stress factors according to Murray et al. (1996). Pink noise served as an external stressor. A further stress factor, cognitive load, was created by asking quiz questions. Physiological stressors were induced by training on an ergometer and the application of a respirator mask (full face mask). The speech signals produced by German native speakers while answering the quiz and further reference questions. The results of our acoustic analysis allow drawing conclusions on if and how stress factors can be distinguished from each other, interfere with each other, and/or add up in the speech signal. Furthermore, we touch upon the issue whether measurable stress can increase ad infinitum or whether there is an upper limit for the manifestation of stress in speech.

The corpus will be annotated on altogether 9 levels:

  • Level 1: Orthographic annotation on the sentence level
  • Level 2: ProsodyPro-label on the sentence level
  • Level 3: Classification in intonation phrases
  • Level 4: ProsodyPro-label of the target IP
  • Level 5: Target word
  • Level 6: ProsodyPro-label target word
  • Level 7: Target phon
  • Level 8: ProsodyPro-label target phon
  • Level 9: Discontinuities

 

35,10,0,50,1
25,600,60,2,3000,5000,25,800
90,150,1,50,12,30,50,1,70,12,1,50,1,1,1,5000
0,2,1,0,2,46,15,5,2,1,0,20,0,1
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus

Some impressions (by Laura Tiedtke)

 

To find out whether individual parameters behave differently, they were first fed alone. Subsequently, several were induced together until all of the parameters mentioned interacted. In order to avoid a serialization effect, if it were present, the recordings were made in two different versions.

Group 1   Explanation   Group 2
Condition 1   without sound, without physical exertion, without mask   Condition 8
Condition 2   with sound, without physical exertion, without mask   Condition 7
Condition 3   without sound, with physical exertion, without mask   Condition 6
Condition 4   with noise, with physical exertion, without mask   Condition 5
Condition 5   with noise, with physical exertion, with mask   Condition 4
Condition 6   without sound, with physical exertion, with mask   Condition 3
Condition 7   with sound, without physical exertion, with mask   Condition 2
Condition 8   without sound, without physical exertion, with mask   Condition 1

 

Key Features

The circumference of the corpus is approximately 30 minutes pure semi-spontaneous speech recordings of the subjects during the quiz paradigm. The individual durations vary depending on how long the question was and to what extent the test person has responded. In all, the corpus had 6 speakers of the standard German, of which one person is a woman. Five of the six speakers are North Germans, while one comes from the south of Germany. However, this speaker has been living in North Germany for many years. The age of the subjects was between 21 and 49 years. The average age was 35.14 years. Whereby the oldes participant had tob e taken out of the corpus due to technical problems. Thus the age oft he subjects was 21 to 42 years and the average age was 32.83 years.

 

State of the Corpus

You can find the current state of the corpus here.

 

Download

The corpus can be used for non-commercial research purposes. Details can be found here.

 

Examples

 
Example 1.   Example 2.

 

Creators of the Data Base

The data base was created as a joint work between Kiel University (CAU) and the University of Southern Denmark (SDU, Mads Clausen Institute). Involved researchers are:

  • Carina Marquard (CAU)
  • Oliver Niebuhr (SDU)
  • Gerhard Schmidt (CAU)

 

Corresponding Publications

C. Marquard, C. Baasch, M. Brodersen, O. Niebuhr, and G. Schmidt: Speech, Think, Act: A Phonetic Analysis of the Combinatorial Effects of Respiratory Mask, Physical and Cognitive Stress on Phonation and Articulation, Proc. DAGA, Kiel, Germany, 2017

Website News

13.08.2017: New Gas e.V. sections (e.g. pictures or prices) added.

05.08.2017: The first "slide carousel" added.

03.08.2017: Started with the RED project. Will be ready in a few years ...

30.07.2017: List of PhD theses updated and extended.

Recent Publications

P. Durdaut, J. Reermann, S. Zabel, Ch. Kirchhof, E. Quandt, F. Faupel, G. Schmidt, R. Knöchel, and M. Höft: Modeling and Analysis of Noise Sources for Thin-Film Magnetoelectric Sensors Based on the Delta-E Effect, IEEE Transactions on Instrumentation and Measurement, published online, 2017

P. Durdaut, S. Salzer, J. Reermann, V. Röbisch, J. McCord, D. Meyners, E. Quandt, G. Schmidt, R. Knöchel, and M. Höft: Improved Magnetic Frequency Conversion Approach for Magnetoelectric Sensors, IEEE Sensors Letters, published online, 2017

 

Contact

Prof. Dr.-Ing. Gerhard Schmidt

E-Mail: gus@tf.uni-kiel.de

Christian-Albrechts-Universität zu Kiel
Faculty of Engineering
Institute for Electrical Engineering and Information Engineering
Digital Signal Processing and System Theory

Kaiserstr. 2
24143 Kiel, Germany

Recent News

Jens Reermann Defended his Dissertation with Distinction

On Friday, 21st of June, Jens Reermann defended his research on signals processing for magnetoelectric sensor systems very successfully. After 90 minutes of talk and question time he finished his PhD with distinction. Congratulations, Jens, from the entire DSS team.

Jens worked for about three and a half years - as part of the collaborative research center (SFB) 1261 - on all kinds of signal ...


Read more ...