ACM SIGMM Interest List


Options: Use Forum View

Use Monospaced Font
Show Text Part by Default
Condense Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
text/plain; charset="iso-8859-1"
Wed, 16 Mar 2011 00:49:33 +0000
Bjoern Schuller <[log in to unmask]>
Bjoern Schuller <[log in to unmask]>
ACM SIGMM Interest List <[log in to unmask]>
text/plain (94 lines)
Dear List,

the INTERSPEECH 2011 Speaker State Challenge is accepting result uploads.
If you are interested to participate, you can register for data download by downloading, signing and sending the corpus agreements:

License agreement ALC corpus for the dataset download (Intoxication Sub-Challenge):

License agreement SLC corpus for the dataset download (Sleepiness Sub-Challenges):

You can find baselines and description of the Challenge in the (preliminary) paper on the Challenge:

Results can be uploaded (after registration) at:


Call for Participation

INTERSPEECH 2011 Speaker State Challenge 
Intoxication and Sleepiness


Björn Schuller (TUM, Germany)
Stefan Steidl (ICSI, USA)
Anton Batliner (FAU Erlangen-Nuremberg, Germany)
Florian Schiel (BASSS/LMU, Germany) 
Jarek Krajewski (University of Wuppertal, Germany) 
The Challenge

While the first open comparative challenges in the field of paralinguistics targeted more "conventional" phenomena such as emotion, age, and gender, there still exists a multiplicity of not yet covered, but highly relevant speaker states and traits. Thus, the INTERSPEECH 2011 Speaker State Challenge broadens the scope by addressing two less researched speaker states while focusing on the crucial application domain of security and safety: the computational analysis of intoxication and sleepiness in speech. Apart from intelligent and socially competent future agents and robots, main applications are found in the medical domain and surveillance in high-risk environments such as steering or controlling. 

For these Challenge tasks, the ALCOHOL LANGUAGE CORPUS (ALC) and the SLEEPY LANGUAGE CORPUS (SLC) with genuine intoxicated and sleepy speech will be provided by the organisers. The first consists of 39 hours of speech, stemming from 154 speakers in gender balance, and will serve to evaluate features and algorithms for the estimation of speaker intoxication. The second features 10 hours of speech recordings of 99 subjects, annotated in 10 different levels of sleepiness. The verbal material consists of different complexity reaching from sustained vowel phonation to natural communication. The corpora further feature detailed speaker meta-data, orthographic transcript, phonemic transcript, and segmentation and multiple annotation tracks. Both are given with distinct definitions of test, development, and training partitions, incorporating speaker independence as needed in most real-life settings. Benchmark results will be provided. 

In these respects, the INTERSPEECH 2011 Speaker State Challenge shall help bridging the gap between excellent research on paralinguistic information in spoken language and low compatibility of results.
Two sub-challenges are addressed:

.	In the Intoxication Sub-Challenge, alcoholisation of speakers has to be determined as two-class classification task: alcoholised for a BAC exceeding 0.5 or non-alcoholised. The Challenge measure will be the unweighted average recall of these two classes to better compensate for imbalance between classes. 
.	In the Sleepiness Sub-Challenge, sleepiness of speakers has to be determined by a suited algorithm and acoustic features. Two classes have to be recognised accordingly: sleepiness or non-sleepiness. Again, the Challenge measure is unweighted average recall of the two classes. 

All Sub-Challenges allow contributors to find their own features with their own classification algorithm. However, a standard feature set will be given per corpus that may be used. Participants may report on results obtained on the development set, but have only a limited number of five trials to upload their results on the test set, whose labels are unknown to them. A participation has to be accompanied by a paper presenting the results that undergoes peer-review. Only contributions with an accepted paper will be eligible for the Challenge participation. The organisers preserve the right to re-evaluate the findings, but will not participate themselves in the Challenge. Participants are encouraged to compete in both Sub-Challenges. Overall, contributions using the provided or equivalent databases are sought in (but not limited to) the following areas: 

.	Participation in the Intoxication Sub-Challenge 
.	Participation in the Sleepiness Sub-Challenge 
.	Novel features and algorithms for the analysis of speaker state 
.	Cross-corpus and cross-task feature genericity analysis 
.	Exploitation of speaker trait meta-information in speaker state analysis 

The results of the Challenge will be presented at Interspeech 2011 in Florence, Italy. Prizes will be awarded to the Sub-Challenge winners. If you are interested and planning to participate in the Speaker State Challenge, or if you want to be kept informed about the Challenge, please send the organisers an e-mail to indicate your interest and visit the homepage:


Please excuse cross-postings,

Thank you very much and all the best,

Stefan Steidl, Anton Batliner, Florian Schiel, Jarek Krajewski, and Bjoern Schuller

 Dr. Björn Schuller
 Senior Researcher and Lecturer
 Technische Universität München
 Institute for Human-Machine Communication
 D-80333 München
 [log in to unmask]
 This message is confidential. It may also be privileged or otherwise protected by work product immunity or other legal rules. If you have received it by mistake please let us know by reply and then delete it from your system; you should not copy it or disclose its contents to anyone. All messages sent to and from our institute may be monitored to ensure compliance with internal policies and to protect our business. Emails are not secure and cannot be guaranteed to be error free as they can be intercepted, amended, lost or destroyed, or contain viruses. Anyone who communicates with us by email is taken to accept these risks.