zur Hauptnavigation springen zum Inhaltsbereich springen

BayWISS-Kolleg Gesundheit www.baywiss.de

Projekte im Verbundkolleg Gesundheit

© eliola, Pixabay

Non obstructive monitoring of speech and voice disorders with a special regard to privacy

Stuttering is a speech disorder which is characterized by blocks, prolongations, and repetitions of words, and syllables. It has a prevalence of about 1% of the population. The condition is treatable but not curable. Good results in therapy can be achieved by teaching a 1technique called “Fluency Shaping”. One institute in Germany that oers this kind of therapy is our project partner Kasseler Stottertherapie (KST).
The research project is based in the area of machine learning and pattern recognition. Recognizing stuttering events is a hard medical problem. To recognize such events, methods from pattern recognition and machine learning are being used, colloquially labeled as artificial intelligence, to enable automatic detection. To achieve this medical expert knowledge as well as deep learning methods are combined to dierentiate between fluent speech, repetitions, blocks and unnatural prolongations as well as detecting if a client is using the speech technique learned during therapy.
As speech data is highly sensible personal data it is of utmost importance to guarantee peoples privacy. To ensure this all algorithms developed must be adapted to run on smartphones
or smart-health devices. This is supposed to guarantee data privacy by not sending speech data into the cloud, as it is usually done for speech analysis. To enable this algorithms must
be adapted to be performance and energy eicient on devices with limited resources.
A main goal of this research is to be able to successfully classify stuttering events on devices with limited resources. My research is supposed to lay the foundation for technical aids that
can be used in stuttering therapy, but can in part be transferable to other speech disorders and pathologies. This shall enable a lasting control of therapy success.



Betreuer Technische Hochschule Nürnberg:

Prof. Dr.-Ing. Korbinian Riedhammer



Betreutes Projekt:
Non obstructive monitoring of speech and voice disorders with a special regard to privacy


Regelmäßiger BLOG:

Bayerl S. P., Tammewar A., Riedhammer K. und G. Riccardi (2021):
Detecting emotion carriers bycombining acoustic and lexical representations.
In: Proc. ASRU 2021, IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).

Klumpp P., Bocklet T., Arias-Vergara T., Vasquez-Correa J., Perez-Toro P., Bayerl S. P., Orozco-Arroyave J. und E. Nöth (2021):
 The Phonetic Footprint of Covid-19? In: Proc. Interspeech 2021, S. 441–445.

Perez-Toro P., Bayerl S. P., Arias-Vergara T., Vasquez-Correa J., Klumpp P., Schuster M., Nöth E., Orozco-Arroyave J. und K. Riedhammer (2021):
Influence of the interviewer on the automatic assessment of alzheimer’s disease in the context of the ADReSSo challenge. In: Proc. Interspeech 2021, S. 3785–3789.

Bayerl S. P., Hönig F., Reister J. und K. Riedhammer (2020):
Towards Automated Assessment of Stuttering and Stuttering Therapy, in International Conference on Text,
Speech, and Dialogue, [Online]. Available at: https://arxiv.org/abs/2006.09222.

Bayerl S. P., et al. (2020):
Offline Model Guard: Secure and Private ML on Mobile Devices,
in 23. Design, Automation and Test in Europe Conference (DATE ’20).

Orozco-Arroyave J.R., et al. (2020):
Apkinson: the smartphone application for telemonitoring Parkinson’s patients through speech, gait and hands movement. In: Neurodegenerative Disease Management, 10 (3), S. 137-157

Bayerl S. P., Riedhammer K. (2019):
A Comparison of Hybrid and End-to-End Models
for Syllable Recognition, in International Conference on Text, Speech, and Dialogue, pp. 352–360 [Online]. Available at: https://arxiv.org/abs/1909.1223.

Wenninger M., Bayerl S. P., Schmidt J., and K. Riedhammer (2019):
Timage–A Robust. Time Series Classification Pipeline, in International Conference on Artificial Neural
Networks, pp. 450–461 [Online]. Available at: https://arxiv.org/abs/1909.09149.

Vásquez-Correa J. C. et al. (2019):
Apkinson: A Mobile Solution for Multimodal Assessment
of Patients with Parkinson’s Disease, Proc. Interspeech 2019, pp. 964–965.

Vorträge und Präsentationen

Bayerl S. P., Wenninger M., Schmidt J., von Gudenberg J. W. und K. Riedhammer (2021):
STAN: A stuttering therapy analysis helper, IEEE Spoken Language Technology Workshop (SLT).


Bayerl S. P., et al. (2019):
Privacy-preserving speech processing via STPC and TEEs.

Sebastian Bayerl

Sebastian Bayerl

Technische Hochschule Nürnberg


Treten Sie mit uns in Kontakt. Wir freuen uns auf Ihre Fragen und Anregungen zum Verbundkolleg Gesundheit.

Dr. Sabine Fütterer-Akili

Dr. Sabine Fütterer-Akili

Koordinatorin BayWISS-Verbundkolleg Gesundheit und BayWISS-Verbundkolleg Economics and Business

Universität Regensburg
Zentrum zur Förderung des wissenschaftlichen Nachwuchses
Universitätsstraße 31
93053 Regensburg

Telefon: +49 941 9435548