The SUMS Corpus

 
  Photo: Laura Tiedtke.

The "SUMS" (Speech Under Multiple Stressors) corpus allows investigating the effects of multiple stressors to speech. This corpus addresses the questions on which basis and in which way all the different stress types shine through and combine in the acoustic speech signal. We used several stressors that form a taxonomy of stress factors according to Murray et al. (1996). Pink noise served as an external stressor. A further stress factor, cognitive load, was created by asking quiz questions. Physiological stressors were induced by training on an ergometer and the application of a respirator mask (full face mask). The speech signals produced by German native speakers while answering the quiz and further reference questions. The results of our acoustic analysis allow drawing conclusions on if and how stress factors can be distinguished from each other, interfere with each other, and/or add up in the speech signal. Furthermore, we touch upon the issue whether measurable stress can increase ad infinitum or whether there is an upper limit for the manifestation of stress in speech.

The corpus will be annotated on altogether 9 levels:

  • Level 1: Orthographic annotation on the sentence level
  • Level 2: ProsodyPro-label on the sentence level
  • Level 3: Classification in intonation phrases
  • Level 4: ProsodyPro-label of the target IP
  • Level 5: Target word
  • Level 6: ProsodyPro-label target word
  • Level 7: Target phon
  • Level 8: ProsodyPro-label target phon
  • Level 9: Discontinuities

 

35,10,0,50,1
25,600,60,2,3000,5000,25,800
90,150,1,50,12,30,50,1,70,12,1,50,1,1,1,5000
0,2,1,0,2,46,15,5,2,1,0,20,0,1
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus
SUMS corpus

Some impressions (by Laura Tiedtke)

 

To find out whether individual parameters behave differently, they were first fed alone. Subsequently, several were induced together until all of the parameters mentioned interacted. In order to avoid a serialization effect, if it were present, the recordings were made in two different versions.

Group 1   Explanation   Group 2
Condition 1   without sound, without physical exertion, without mask   Condition 8
Condition 2   with sound, without physical exertion, without mask   Condition 7
Condition 3   without sound, with physical exertion, without mask   Condition 6
Condition 4   with noise, with physical exertion, without mask   Condition 5
Condition 5   with noise, with physical exertion, with mask   Condition 4
Condition 6   without sound, with physical exertion, with mask   Condition 3
Condition 7   with sound, without physical exertion, with mask   Condition 2
Condition 8   without sound, without physical exertion, with mask   Condition 1

 

Key Features

The circumference of the corpus is approximately 30 minutes pure semi-spontaneous speech recordings of the subjects during the quiz paradigm. The individual durations vary depending on how long the question was and to what extent the test person has responded. In all, the corpus had 6 speakers of the standard German, of which one person is a woman. Five of the six speakers are North Germans, while one comes from the south of Germany. However, this speaker has been living in North Germany for many years. The age of the subjects was between 21 and 49 years. The average age was 35.14 years. Whereby the oldes participant had tob e taken out of the corpus due to technical problems. Thus the age oft he subjects was 21 to 42 years and the average age was 32.83 years.

 

State of the Corpus

You can find the current state of the corpus here.

 

Download

The corpus can be used for non-commercial research purposes. Details can be found here.

 

Examples

 
Example 1.   Example 2.

 

Creators of the Data Base

The data base was created as a joint work between Kiel University (CAU) and the University of Southern Denmark (SDU, Mads Clausen Institute). Involved researchers are:

  • Carina Marquard (CAU)
  • Oliver Niebuhr (SDU)
  • Gerhard Schmidt (CAU)

 

Corresponding Publications

C. Marquard, C. Baasch, M. Brodersen, O. Niebuhr, and G. Schmidt: Speech, Think, Act: A Phonetic Analysis of the Combinatorial Effects of Respiratory Mask, Physical and Cognitive Stress on Phonation and Articulation, Proc. DAGA, Kiel, Germany, 2017

Recent Publications

P. Durdaut, J. Reermann, S. Zabel, Ch. Kirchhof, E. Quandt, F. Faupel, G. Schmidt, R. Knöchel, and M. Höft: Modeling and Analysis of Noise Sources for Thin-Film Magnetoelectric Sensors Based on the Delta-E Effect, IEEE Transactions on Instrumentation and Measurement, published online, 2017

P. Durdaut, S. Salzer, J. Reermann, V. Röbisch, J. McCord, D. Meyners, E. Quandt, G. Schmidt, R. Knöchel, and M. Höft: Improved Magnetic Frequency Conversion Approach for Magnetoelectric Sensors, IEEE Sensors Letters, published online, 2017

 

Website News

18.06.2017: Page about KiRAT news added (also visible in KiRAT).

31.05.2017: Some pictures added.

23.04.2017: Time line for the lecture "Adaptive Filters" added.

13.04.2017: List of PhD theses added.

Contact

Prof. Dr.-Ing. Gerhard Schmidt

E-Mail: gus@tf.uni-kiel.de

Christian-Albrechts-Universität zu Kiel
Faculty of Engineering
Institute for Electrical Engineering and Information Engineering
Digital Signal Processing and System Theory

Kaiserstr. 2
24143 Kiel, Germany

Recent News

Alexej Namenas - A New Guy in the Team

In June Alexej Namenas started in the DSS Team. He will work on real-time tracking algorithms for SONAR applications. Alexej has done both theses (Bachelor and Master) with us. The Bachelor thesis in audio processing (beamforming) and the Master thesis in the medical field (real-time electro- and magnetocardiography). In addition, he has intership erperience in SONAR processing.

We are pretty ...


Read more ...