[1] Jörn Anemüller, Maximization of Component Disjointness: A Criterion for Blind Source Separation, in Independent Component Analysis and Signal Separation, 7th International Conference, ICA 2007, M.E. Davies, C.C. James, S.A. Abdallah, and M.D. Plumbley, Eds., vol. 4666 of Lecture Notes in Computer Science, pp. 325-332. Springer, Berlin, 2007. [ bib ]
[2] Jens-E. Appell, Volker Hohmann, A. Schulz, and Andreas Hein, Hearing at home, in Fortschritte der Akustik - DAGA 2007, pp. 629-630. DEGA e.V., Berlin, 2007. [ bib ]
[3] Rainer Beutelmann and Thomas Brand, Der Einfluss von Raumakustik auf die binaurale Sprachverständlichkeit in moduliertem Störgeräusch - Messungen und Modellvorhersagen, in Fortschritte der Akustik - DAGA 2007, pp. 97-98. DEGA e.V., Berlin, 2007. [ bib ]
[4] Thomas Brand, Loudness scaling, in 8th EFAS Congress / 10th Congress of the German Society of Audiology, pp. CD-Rom. Deutsche Gesellschaft für Adiologie e.V., Heidelberg, 2007. [ bib ]
[5] Thomas Brand and Tim Jürgens, Sprachaudiometrie in der Forschung, in 8th EFAS Congress / 10th Congress of the German Society of Audiology, pp. CD-Rom. Deutsche Gesellschaft für Adiologie e.V., Heidelberg, 2007. [ bib ]
[6] Mathias Dietz, Stephan D. Ewert, and Volker Hohmann, Phase difference representation of interaural timing disparities, in Fortschritte der Akustik - DAGA 2007, pp. 107-108. DEGA e.V., Berlin, 2007. [ bib ]
[7] Bastian Epp and Jesko L. Verhey, Messungen und Simulationen zur Kombination monaurlaer und binaurlaer Effekte, in Fortschritte der Akustik - DAGA 2007, pp. 373-374. DEGA e.V., 2007, 2007. [ bib ]
[8] Stephan M.A. Ernst and Stefan Uppenkamp, fMRI evidence for spatial dissociation of changes of overall level and signal-to-noie ration in auditroy cortex for tones in noise maskers, in Topics in Advanced Imaging, Manfred Herrmann and Christiane M. Thiel, Eds., Hanse-Studien, pp. 113-116. BIS Verlag, Oldenburg, 2007. [ bib ]
[9] Stephan M.A. Ernst and Jesko L. Verhey, Frequenzübergreifende nichtlineare Prozesse in Nach- und SImultanverdeckungsexperimenten, in Fortschritte der Akustik - DAGA 2007, pp. 849-850. DEGA e.V., Berlin, 2007. [ bib ]
[10] Stephan D. Ewert, Ole Hau, and Torsten Dau, Forward masking: temporal integration or adaptation?, in Hearing: from sensory processing to perception - 14th International Symposium on Hearing, B Kollmeier, GM Klump, V Hohmann, U Langemann, M Mauermann, S Uppenkamp, and JL Verhey, Eds., pp. 165-174. Springer, Berlin, 2007. [ bib ]
[11] Volker Hohmann, Compensation of hearing deficiencies in the inner ear, in 8th EFAS Congress / 10th Congress of the German Society of Audiology, pp. CD-Rom. Deutsche Gesellschaft für Adiologie e.V., Heidelberg, 2007. [ bib ]
[12] Volker Hohmann and Birger Kollmeier, A nonlinear auditory filterbank controlled by sub-band instantaneous frequency estimates, in Hearing: from sensory processing to perception - 14th International Symposium on Hearing, B Kollmeier, GM Klump, V Hohmann, U Langemann, M Mauermann, S Uppenkamp, and JL Verhey, Eds., pp. 11-18. Springer, Berlin, 2007. [ bib ]
[13] Tim Jürgens, Thomas Brand, and Birger Kollmeier, Modellierung der Sprachverständlichkeit mit einem auditorischen Perzeptionsmodell, in Fortschritte der Akustik - DAGA 2007, pp. 717-718. DEGA e.V., Berlin, 2007. [ bib ]
[14] Tim Jürgens, Thomas Brand, and Birger Kollmeier, Modelling the human-machine gap in speech reception: microscopic speech intelligibility prediction for normal-hearing subjects with an auditory model, in Proceedings of the 8th Annual Conference of the International Speech Communication Association - Interspeech 2007, pp. 410-413. Antwerpen, 2007. [ bib ]
[15] Tim Jürgens, Thomas Brand, and Birger Kollmeier, Modelling the Human-machine Gap in Speech Reception: Microscopic Speech Intelligibility Prediction for Normal-hearing Subjects with an Auditory Model, in Interspeech Conference 2007, Antwerp, BELGIUM, 2007, pp. 1605-1608. [ bib ]
In this study speech intelligibility in noise for normal-hearing subjects is predicted by a model that consists of an auditory preprocessing and a speech recognizer. Using a highly systematic speech corpus of phoneme combinations (logatomes) allows the analysis of response rates and confusions of single phonemes. The predicted data is validated by listening tests using the same nonsense speech material. If testing utterances that are not identical to those in training material are used, the psychometric function in noise is predicted with an offset of 13 dB to higher signal-to-noise-ratios (SNR). This is consistent with the man-machine performance gap between human speech recognition (HSR) and automatic speech recognition (ASR) [1]. However, this offset reduces to 4 dB in a second model design with identical recordings for training and testing. Furthermore predicted confusion matrices are compared to those of normal-hearing subjects with the second model design.

[16] Birger Kollmeier, Speech recognition, in 8th EFAS Congress / 10th Congress of the German Society of Audiology, pp. CD-Rom. Deutsche Gesellschaft für Adiologie e.V., Heidelberg, 2007. [ bib ]
[17] Helge Lüddemann, Helmut Riedel, and B Kollmeier, Logarithmic scaling of interaural cross correlation: a model based on evidence from psychophysics and EEG, in Hearing: from sensory processing to perception - 14th International Symposium on Hearing, B Kollmeier, GM Klump, V Hohmann, U Langemann, M Mauermann, S Uppenkamp, and JL Verhey, Eds., pp. 379-388. Springer, Berlin, 2007. [ bib ]
[18] Bernd T. Meyer, Thomas Brand, and Birger Kollmeier, Phonemverwechslungen bei menschlicher und automatischer Spracherkennung, in Fortschritte der Akustik - DAGA 2007, pp. 79-80. DEGA e.V., Berlin, 2007. [ bib ]
[19] Bernd T. Meyer, M. Wachter, Thomas Brand, and Birger Kollmeier, Phoneme Confusions in Human and Automatic Speech Recognition, in Interspeech Conference 2007, Antwerp, BELGIUM, 2007, pp. 2740-2743. [ bib ]
A comparison between automatic speech recognition (ASR) and human speech recognition (HSR) is performed as prerequisite for identifying sources of errors and improving feature extraction in ASR. HSR and ASR experiments are carried out with the same logatome database which consists of nonsense syllables. Two different kinds of signals are presented to human listeners: First, noisy speech samples are converted to Mel-frequency cepstral coefficients which are resynthesized to speech, with information about voicing and fundamental frequency being discarded. Second, the original signals with added noise are presented, which is used to evaluate the loss of information caused by the process of resynthesis. The analysis also covers the degradation of ASR caused by dialect or accent and shows that different error patterns emerge for ASR and HSR. The information loss induced by the calculation of ASR features has the same effect as a deteriation of the SNR by 10 dB.

[20] Ralf M. Meyer, Thomas Brand, and Birger Kollmeier, Predicting speech intelligibility in fluctuating noise, in 8th EFAS Congress / 10th Congress of the German Society of Audiology, pp. CD-Rom. Deutsche Gesellschaft für Adiologie e.V., Heidelberg, 2007. [ bib ]
[21] Marc Nitschmann and Jesko L. Verhey, Experimente und Modellrechnungen zur binauralen Selektivität, in Fortschritte der Akustik - DAGA 2007, pp. 371-372. DEGA e.V., Berlin, 2007. [ bib ]
[22] Jan Rennies, Stephan M.A. Ernst, and Jesko L. Verhey, Einfluss von Einhüllendenstatistiken auf Signaldetektion, in Fortschritte der Akustik - DAGA 2007, pp. 851-852. DEGA e.V., Berlin, 2007. [ bib ]
[23] Thomas Rohdenburg, Volker Hohmann, and Birger Kollmeier, Robustness analysis für multi-channel hearing aid algorithms with binaural output by means of objective perceptual quality measures, in Fortschritte der Akustik - DAGA 2007, pp. 365-366. DEGA e.V., Berlin, 2007. [ bib ]
[24] Denny Schmidt and Jörn Anemüller, Acoustic Feature Selection for Speech Detection Based on Amplitude Modulation Spectrograms (AMS), in Fortschritte der Akustik - DAGA 2007, pp. 347-348. DEGA e.V., Berlin, 2007. [ bib ]
[25] Helga Sukowski, Thomas Brand, Kirsten Wagener, and Birger Kollmeier, The relationship between tone- and speech-audiometry based assessments of hearing loss, in 8th EFAS Congress / 10th Congress of the German Society of Audiology, pp. CD-Rom. Deutsche Gesellschaft für Adiologie e.V., Heidelberg, 2007. [ bib ]
[26] Helga Sukowski, Thomas Brand, Kirsten Wagener, and Birger Kollmeier, Sprachverständlichkeitstests in Ruhe: Gibt es alternative Verfahren zum Freiburger Sprachtest in der Begutachtung bei (Lärm-) Schwerhörigkeit?, in Fortschritte der Akustik - DAGA 2007, pp. 715-716. DEGA e.V., Berlin, 2007. [ bib ]
[27] Stefan Uppenkamp, Functional imaging of pitch processing, in Topics in Advanced Imaging, Manfred Herrmann and Christiane M. Thiel, Eds., Hanse-Studien, pp. 109-112. BIS Verlag, Oldenburg, 2007. [ bib ]
[28] Stefan Uppenkamp and Stephan M.A. Ernst, Räumliche Trennung der Repräsentation von Pegel und Signal-Rauschverhältnis im auditorischen Kortex, in Fortschritte der Akustik - DAGA 2007, pp. 569-570. DEGA e.V., Berlin, 2007. [ bib ]
[29] Jesko L. Verhey and Stephan M.A. Ernst, Role of peripheral nonlinearities in comodulation masking release, in Hearing: from sensory processing to perception - 14th International Symposium on Hearing, B Kollmeier, GM Klump, V Hohmann, U Langemann, M Mauermann, S Uppenkamp, and JL Verhey, Eds., pp. 117-124. Springer, Berlin, 2007. [ bib ]
[30] Jesko L. Verhey and Michael Uhlemann, Spektrale Lautheitssummation von pulsierenden Geräuschen, in Fortschritte der Akustik - DAGA 2007, pp. 847-848. DEGA e.V., Berlin, 2007. [ bib ]
[31] Kirsten Wagener, Thomas Brand, and Birger Kollmeier, International cross-validation of sentence intelligibility tests, in 8th EFAS Congress / 10th Congress of the German Society of Audiology, pp. CD-Rom. Deutsche Gesellschaft für Adiologie e.V., Heidelberg, 2007. [ bib ]