TCTS Lab Research Groups
 
 

Publications

[ FPMs > TCTS > ASR group > Publications ]

 

[Introduction]

[Publications]

[People]

[Contact]

 


List of publications related to ASR and signal processing



2016

G. PIRONKOV, S. DUPONT, T. DUTOIT, 2016, "Multi-Task Learning for Speech Recognition: An Overview", in European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), pp. 189-194, Bruges, Belgium, April 27-29.
G. PIRONKOV, S. DUPONT, T. DUTOIT, 2016, "Speaker-Aware Long Short-Term Memory Multi-Task Learning for Speech Recognition", in the 24th European Signal Processing Conference (EUSIPCO), pp. 1911-1915, Budapest, Hungary, August 29 - September 2.
2015
K. EL HADDAD, S. DUPONT, H. ÇAKMAK, T. DUTOIT, 2015, "Towards a Level Assessment System of Amusement in Speech Signals: Amused Speech Components Classification", International Symposium on Signal Processing and Information Technology (ISSPIT 2015), pp. 12-17, Abu Dhabi, UAE, December 7-10.
B. PICART, S. BROGNAUX, S. DUPONT, 2015, "Analysis and Automatic Recognition of Human Beatbox Sounds: a Comparative Study", Proceedings of the IEEE International Conference on Audio Speech and Signal Processing (ICASSP 2015), pp. 4255-4259, Brisbane, ​Australia, ​April ​19-​24.
K. EL HADDAD, S. DUPONT, H. ÇAKMAK, T. DUTOIT, 2015, "Shaking and Speech-smile Vowels Classification: An Attempt at Amusement Arousal Estimation from Speech Signals", Proceedings of the 3rd IEEE Global Conference on Signal and Information Processing (GlobalSIP 2015), Orlando, FL, US, December 14-16.
L. DEVILLERS, S. ROSSET, G. DUBUISSON DUPLESSIS, M. A. SEHILI, L. BÉCHADE, A. DELABORDE, C. GOSSART, V. LETARD, F. YANG, Y. YEMEZ, B. B. TÜRKER, M. SEZGIN, K. EL HADDAD, S. DUPONT, D. LUZZATI, Y. ESTÈVE, E. GILMARTIN, N. CAMPBELL, 2015, "Multimodal Data Collection of Human-Robot Humorous Interactions in the JOKER Project", Proceedings of the 6th International Conference on Affective Computing and Intelligent Interaction (ACII 2015), Xi’an, China, September 21-24.
G. PIRONKOV, S. DUPONT, T. DUTOIT, 2015, "Investigating Sparse Deep Neural Networks for Speech Recognition", Proceedings of the 14th IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2015), pp. 124-129, Scottsdale, AZ, US, December 13-17.
2014
J. URBAIN, 2014, "Acoustic Laughter Processing", PhD thesis supervised by Prof. T. Dutoit, May 2014.
2013
E. LOWEIMI, S.M. AHADI, T. DRUGMAN, 2013, "A New Phase-based Feature Representation for Robust Speech Recognition", Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), Vancouver, Canada, May 26-31.
J. URBAIN, H. ÇAKMAK, T. DUTOIT, 2013, "Automatic Phonetic Transcription of Laughter and its Application to Laughter Synthesis", Proceedings of the fifth bian­nual Humaine Asso­ci­a­tion Con­fer­ence on Affec­tive Com­put­ing and Intel­li­gent Inter­ac­tion (ACII2013), pp. 153-158 , Geneva, Switzerland, 2-5 September [Best Student Paper Award].
S. BROGNAUX, B. PICART, T. DRUGMAN, 2013, "A New Prosody Annotation Protocol for Live Sports Commentaries", Proceedings of the 14th Conference of the International Speech Communication Association (Interspeech 2013), pp. 1554-1558, Lyon, France, August 25-29.
M. MANCINI, L. ACH, E. BANTEGNIE, T. BAUR, N. BERTHOUZE, D. DATTA, Y. DING, S. DUPONT, H. GRIFFIN, F. LINGENFELSER, R. NIEWIADOMSKI, C. PELACHAUD, O. PIETQUIN, B. PIOT, J. URBAIN, G. VOLPE, J. WAGNER, 2013, "Laugh When You're Winning", Proceedings of the 9th International Summer Workshop on Multimodal Interfaces - eNTERFACE'13, in Innovative and Creative Developments in Multimodal Interaction Systems - IFIP Advances in Information and Communication Technology (IFIP AICT), Volume 425, pp. 50-79, Lisbon, Portugal, July 15 - August 9, doi:10.1007/978-3-642-55143-7_3.
E. LOWEIMI, S.M. AHADI, T. DRUGMAN, S. LOVEYMI, 2013, "On the Importance of Pre-emphasis and Window Shape in Phase-based Speech Recognition", Lecture Notes in Computer Science, Advances in Non-Linear Speech Processing, vol. 7911, pp. 160-167.
A. CULLEN, J. KANE, T. DRUGMAN, N. HARTE, 2013, "Creaky Voice and the Classification of Affect", Workshop on Affective Social Speech Signals (WASSS13), Grenoble, France, August 22-23.
2011
T. DRUGMAN, 2011, "Advances in Glottal Analysis and its Applications", PhD thesis supervised by Prof. T. Dutoit.
2010
T. DRUGMAN, T. DUTOIT, 2010, "Reconnaissance du Locuteur basée sur des Signatures Glottiques", XXVIIIe Journées d'Etude sur la Parole, pp. 45-48, 25-28 mai, 2010, Mons, Belgium.
T. DRUGMAN, T. DUTOIT, 2010, "On the Potential of Glottal Signatures for Speaker Recognition", Proceedings of Interspeech 2010, pp. 2106-2109, September 26-30, Makuhari, Chiba, Japan.
M. DUVINAGE, J.Y. PARFAIT, 2010, "Reconnaissance vocale basée sur les phonèmes voisés", Actes des 28emes Journées d'Etude sur la Parole (JEP 2010), pages 257-260, 25-28 mai, 2010, Mons, Belgique.
T. DRUGMAN, T. DUTOIT, 2010, "On the Glottal Flow Estimation and its Usefulness in Speech Processing", EuroDocInfo 2010, Valenciennes, France.
T. DRUGMAN, B. BOZKURT, T. DUTOIT, 2010, "Glottal Source Estimation Using an Automatic Chirp Decomposition", Lecture Notes in Computer Science, Advances in Nonlinear Speech Processing, Volume 5933/2010, pp. 35-42.
A. ASAEI, B. PICART, H. BOURLARD, 2010, "Analysis of Phone Posterior Feature Space Exploiting Class-Specific Sparsity and MLP-based Similarity Measure", Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010), pp. 4886-4889, 14th - 19th of March 2010, Dallas, Texas, USA.
2009
S. AL MOUBAYEB, M. BAKLOUTI, M. CHETOUANI, T. DUTOIT, A. MAHDHAOUI, J-C MARTIN, S. ONDAS, C. PELACHAUD, M YILMAZ, J. URBAIN, 2009, "Generating Robot/Agent Backchannels During a Storytelling Experiment", Proceedings of 2009 IEEE International Conference on Robotics and Automation, May 12 - 17, 2009, Kobe, Japan.
M. JOTTRAND, A. MOINET, L. COUVREUR, T. DUTOIT, 2009, "A Semi-Automatic Indexing Tool for Daily News, based on Text and Audio Keyword Spotting", 7th International Workshop on Content-Based Multimedia Indexing (CBMI), 3-5 June, 2009.
S.W. GILROY, M. CAVAZZA, M. NIRANEN, E. ANDRE, T. VOGT, J. URBAIN, M. BENAYOUN, H. SEICHTER, M. BILLINGHURST, 2009, "PAD-based multimodal affective fusion", In 2009 International Conference on Affective Computing and Intelligent Interaction, Amsterdam, The Netherlands, September 10-12, IEEE.
J. URBAIN, S. DUPONT, R. NIEWIADOMSKI, T. DUTOIT, C. PELACHAUD, 2009, "Towards a virtual agent using similarity-based laughter production", in Proc. of Interdisciplinary Workshop on Laughter and other Interactional Vocalisations in Speech, Berlin, February 27-28, 2009.
2008
S.W. GILROY, M. CAVAZZA, R. CHAIGNON, S.M. MAKELA, M. NIRANEN, E. ANDRE, T. VOGT, J. URBAIN, M. BILLINGHURST, H. SEICHTER, M. BENAYOUN, 2008, "E-tree: emotionally driven augmented reality art", In Proc. ACM Multimedia, pages 945-948, Vancouver, BC, Canada.
C. D'ALESSANDRO, B. BOZKURT, B. DOVAL, T. DUTOIT, N. HENRICH, T. VU NGOC, N. STURMEL, 2008, "Phase-Based Methods for Voice Source Analysis", Mohamed Chetouani, Amir Hussain, Bruno Gas, Maurice Milgram, Jean-Luc Zarader (Eds.), Advances in Nonlinear Speech Processing, Springer, ISBN : 978-3-540-77346-7.
M. GURBAN, T. DRUGMAN, T. DUTOIT, J. THIRAN, 2008, "Dynamic modality weighting for multi-stream HMMs in Audio-Visual Speech Recognition", 10th IEEE International Conference on Multimodal Interfaces (ICMI08), Chania, Greece, October 20-22, 2008.
S. AL MOUBAYEB, M. BAKLOUTI, M. CHETOUANI, T. DUTOIT, A. MAHDHAOUI, J-C MARTIN, S. ONDAS, C. PELACHAUD, J. URBAIN, M YILMAZ, 2008, "Multimodal Feedback from Robots and Agents in a Storytelling Experiment", Proc. eNTERFACE08, Paris, pp. 43-55.
2007
J. THIRAN, T. DRUGMAN, M. GURBAN, A. VALLES, 2007, "Definition et selection d'attributs visuels pour la reconnaissance audio-visuelle de la parole", Traitement et Analyse de l'Information : Méthodes et Applications (TAIMA07), Hammamet, Tunisia.
F. CHARLES, S. LEMERCIER, T. VOGT, N. BEE, M. MANCINI, J. URBAIN, M. PRICE, E. ANDRE, C. PELACHAUD, M. CAVAZZA, 2007, "Affective Interactive Narrative in the CALLAS Project", ICVS2007, Saint-Malo, France, 5-7 Dec 2007.
T. DUBUISSON, T. DUTOIT, 2007, "Improvement of Source-Tract Decomposition of Speech using analogy with LF Model for Glottal Source and Tube Model for Vocal Tract", MAVEBA 2007, Proceedings of the 5th International Workshop on Models Analysis of Vocal Emissions in Biomedical Applications, pp. 119-122, Florence, Italy.
T. DRUGMAN, M. GURBAN, J. THIRAN, 2007, "Relevant Feature Selection for Audio-Visual Speech Recognition", IEEE MMSP 2007, International Workshop on Multimedia Signal Processing.
2006
B. BOZKURT, 2006, "New Spectral Methods for the Analysis of Source/Filter charactéristics of speech signals", Presses Universitaires de Louvain, SIMILAR Collection, ISBN 2-87463-013-6.
T. DUTOIT, L. NIGAY, M. SCHNAIDER, 2006, "Editorial of the Special Issue on Multimodal Human-Computer interfaces", Signal Processing, in press.
S. AZAR, C. BOULANGER, L. COUVREUR, V. DELFOSSE, B. JASPART, 2006, "An Agent-Based Multimodal Interface for Sketch Interpretation", Proceeding of IEEE Workshop on Multimedia Signal Processing (MMSP-2006), Victoria, Canada, October 2006.
K. MOUSTAKAS, M. G. STRINTZIS, D. TZOVARAS, S. CARBINI, J. E. VIALLET, S. RAIDT, M. MANCAS, M. DIMICCOLI, E. YAGCI, S. BALCI, E. I. LEON, 2006, "Masterpiece: Physical Interaction and 3D Content-Based Search in VR Applications", IEEE MultiMedia Magazine, pp. 92-100.
O. PIETQUIN, 2006, "Machine Learning for Spoken Dialogue Management: an Experiment with Speech-Based Database Querying", Proceedings of the 12th International Conference on Artificial Intelligence: Methodology, Systems, Applications (AIMSA 2006), Varna (Bulgaria), Published in Lecture Notes of Computer Science, Springer-Verlag.
2005
Y. STYLIANOU, R. BONAL, Y. PANTAZIS, F. CALDERERO, P. LARROY, S. SCHIMKE, F. SEVERIN, F. MATTA, A. VALSAMAKIS, 2005, "GMM-Based Multimodal Biometric Verification", eNTERFACE'05 Summer Workshop on Multimodal Interfaces, Mons, Belgium.
O. PIETQUIN, T. DUTOIT, 2005, "A Probabilistic Framework for Dialog Simulation and Optimal Strategy Learning", IEEE Transactions on Audio, Speech and Language Processing, Volume 14, Issue 2 (2006) 589-599.
T. DUTOIT, 2005, "Proceedings of the eNTERFACE'05 Summer Workshop on Multimodal Interfaces", ed., Presses Universitaires de Louvain, 2005, ISBN : 2-87463-003-9.
B. BOZKURT, L. COUVREUR, 2005, "On the use of phase information for speech recognition", Proc. EUSIPCO'05, Antalya,Turkey.
S. DUPONT, C. RIS, L. COUVREUR, J.M. BOITE, 2005, "A study of implicit and explicit modeling of coarticulation and pronunciation variation", proc. of Interspeech 2005, Lisboa, Sept. 2005.
B. BOZKURT, T. DUTOIT, B. DOVAL, C. D'ALESSANDRO, 2005, "Method for estimating resonance frequencies", PCT patent WO 2005/031702 A1.
Z. HAMMAL, B. BOZKURT, A. CAPLIER, L. COUVREUR, T. DUTOIT, D. UNAY, 2005, "Passive versus active: vocal classification system", Proc. of EUSIPCO' 05, Turkey.
O. PIETQUIN, 2005, "Réseau Bayésien pour un Modèle d'Utilisateur et un Module de Compréhension pour l'Optimisation des Systèmes de Dialogue", Conférence Francophone sur le Traitement du Langage Naturel (TALN 2005), Dourdan (France), juin 2005.
S. DUPONT, C. RIS, O. DEROO, S. POITOUX, 2005, "Feature Extraction and Acoustic Modeling: an Approach for Improved Generalization across Langagues and Accents", proc. of ASRU 2005, San Juan, Puerto Rico, Dec. 2005.
O. PIETQUIN, R. BEAUFORT, 2005, "Comparing ASR Modeling Methods for Spoken Dialogue Simulation and Optimal Strategy Learning", Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech/Eurospeech 2005), Lisbon (Portugal), September 2005.
M. E. SARGIN, F. OFLI, Y. YASINNIK, O. ARAN, A. KARPOV, S. WILSON, Y. YEMEZ, E. ERZIN, A. M. TEKALP, 2005, "Combined Gesture-Speech Analysis and Synthesis", eNTERFACE'05 Summer Workshop on Multimodal Interfaces, Mons, Belgium.
F. SEVERIN, B. BOZKURT, T. DUTOIT, 2005, "HNR extraction in voiced speech, oriented towards voice quality analysis", Proc. EUSIPCO'05, Antalya,Turkey.
E. MENGUSOGLU, C. RIS, 2005, "Use of acoustic prior information for confidence measure in ASR (automatic speech recognition) applications", in Acoustics Research Letters Online (ARLO), Vol. 6, Issue 2, pp. 92-98, April 2005.
J. HAMAIDE, 2005, "Introduction to Single-Speaker Speech Recognition", Game Programming Gems 5, K.Pallister, ed., Charles River Media, ISBN 1-58450-352-1, pp 741-750.
Z. HAMMAL, B. BOZKURT, L. COUVREUR, D. UNAY, A. CAPLIER, T. DUTOIT, 2005, "Classification d'Expressions Vocales Passives Versus Actives", Proc. of the GRETSI Conference, Louvain la Neuve, Belgium.
L. COUVREUR, J.M. BOITE, S. DUPONT, C. RIS, 2005, "Confidence Measure Normalization for Robust Selection of ASR Agents", Proceeding of the International Conference on Speech and Computer (SPECOM-2005), vol. 1, pp. 369-372, Patras, Greece, October 2005.
B. BOZKURT, C. D'ALESSANDRO, B. DOVAL, T. DUTOIT, 2005, "Zeros of Z-Transform Representation With Application to Source-Filter Separation in Speech", IEEE Signal Processing Letters, vol. 12, no. 4, pp.344-347.
O. PIETQUIN, 2005, "A Probabilistic Description of Man-Machine Spoken Communication", in Proceedings of the 5th IEEE International Conference on Multimedia and Expo (ICME 2005), Amsterdam (The Netherlands), July 2005.
2004
C. RIS, L. COUVREUR, 2004, "Improving ASR performance on PDA by contamination of training data", proc. of Specom 2004, 9-th International Conference on Speech and Computer, St. Petersburg, Sept. 2004.
L. COUVREUR, C. COUVREUR, 2004, "Blind Model Selection for Automatic Speech Recognition in Reverberants Environments", Journal of VLSI Signal Processing, Special Issue on Real World Speech Processing, vol. 36, no. 2-3, pp. 189-203, March-February 2004.
O. PIETQUIN, 2004, "Une Description Probabiliste de la Communication Parlée entre Homme et Machine", Actes de la 16ème Conférence Francophone sur l'Interaction Homme-Machine (IHM 2004), Namur (Belgique), Aout-Septembre 2004.
O. PIETQUIN, 2004, "A Framework for Unsupervised Learning of Dialogue Strategies", Presses Universitaires de Louvain, SIMILAR Collection, ISBN 2-930344-63-6.
L. COUVREUR, M. LANIRAY, 2004, "Automatic Noise Recognition in Urban Environments Based on Artificial Neural Networks and Hidden Markov Models", Proceedings of InterNoise, Prague, Czech Republic.
O. PIETQUIN, 2004, "A Framework for Unsupervised Learning of Dialogue Strategies", PhD thesis supervised by Prof. T. Dutoit.
S. DUPONT, C. RIS, 2004, "Combined use of close-talk and throat microphones for improved speech recognition under non-stationary background noise", proc. of Robust 2004 (Workshop (ITRW) on Robustness Issues in Conversational Interaction), Norwich, Aug. 2004.
E. MENGUSOGLU, 2004, "Confidence Measures for Speech/Speaker Recognition and Applications on Turkish LVCSR", PhD thesis supervised by Prof. H. Leich.
2003
O. PIETQUIN, T. DUTOIT, 2003, "Aided Design of Finite-State Dialogue Management Systems", Proc. of the IEEE International Conference on Multimedia & Expo (ICME 2003), Baltimore, july 2003.
S. DUPONT, 2003, "Subband speech processing with neural networks", Patent EP1152399.
S. DUPONT, C. RIS, 2003, "Robust Feature Extraction and Acoustic Modeling at Multitel: Experiments on the Aurora Databases", proc. Eurospeech 2003, Genève, pp. 1789-1792.
M. BAGEIN, O. PIETQUIN, C. RIS, G. WILFART, 2003, "An Architecture for Voice-Enabled Interfaces over Local Wireless Networks", Proc. of the 7th World Multiconference on Systemics, Cybernetics and Informatics (SCI 2003), Orlando (Florida, USA) July 2003.
F. MALFRERE, O. DEROO, T. DUTOIT, C. RIS, 2003, "Phonetic alignement : speech-synthesis-based versus Viterbi-based", Speech Communication, vol. 40, n°4, pp. 503-517.
O. PIETQUIN, 2003, "Environnement Virtuel pour la Simulation et l'Apprentissage de Stratégies de Dialogue", Actes de la 15ème Conférence Francophone sur l'Interaction Homme-Machine (IHM 2003), Caen (France), Novembre 2003.
S. DUPONT, 2003, "Robust parameters for noisy speech recognition", Patent US2003182114.
M. HADIM, M. BAGEIN, P. MANNEBACK, P. MAON, 2003, "Load Balancing Voice Applications with Piranha", The 2003 International Conference on Parallel and Distributed, Las Vegas, Nevada, June 23 - 26, 2003.
E. MENGUSOGLU, 2003, "Confidence Measure Based Model Adaptation for Speaker Verification", Proc. of the 2nd IASTED International Conference on Communications, Internet and, 17-19 November 2003, Scottsdale, AZ, USA.
M. BAGEIN, O. PIETQUIN, C. RIS, G. WILFART, 2003, "Enabling Speech Based Access to Information Management Systems over Wireless Network", Proc. of the 3rd workshop on Applications and Services in Wireless Networks (ASWN 2003), Berne (Switzerland), july 2003.
2002
T. DUTOIT, L. COUVREUR, C. RIS, F. MALFRERE, V. PAGEL, 2002, "Synthèse Vocale et Reconnaisance de la Parole : Droites Gauches et Mondes Parallèles", Actes du 6è Congrès Français d'Acoustique, Lille, 8-11 avril 2002.
O. PIETQUIN, S. RENALS, 2002, "ASR System Modeling For Automatic Evaluation And Optimization of Dialogue Systems", Proceedings of the International Conference on Acoustics Speech and Signal Processing, ICASSP 2002, Orlando may 2002.
L. COUVREUR, C. RIS, 2002, "Model-based Independent Component Analysis for Robust Multi-Microphone Automatic Speech Recognition", Proceedings of the International Conference on Spoken Language Processing (ICSLP'02), vol. 3, pp. 2189-2192, Denver, USA, September 2002.
O. PIETQUIN, T. DUTOIT, 2002, "Modélisation d'un Système de Reconnaissance dans le Cadre de l'Evaluation et l'Optimisation Automatique des Systèmes de Dialogue", Actes des Journées d'Etude de la Parole, JEP 2002, Nancy (France) juin 2002.
2001
E. MENGUSOGLU, O. DEROO, 2001, "Turkish LVCSR: Database Preparation and Language Modeling for an Agglutinative Language", Proc. ICASSP 2001 Student Forum, , Salt Lake City, May 2001.
L. COUVREUR, J.M. BOITE, S. DUPONT, C. RIS, C. COUVREUR, 2001, "Fast Adaptation for Robust Speech Recognition in Reverberant Environments", Proceedings of International Workshop on Adaptation Methods for Speech Recognition (ITRW-2001), pp. 85-88, Sophia-Antipolis, France, August.
S. DEKETELAERE, T. DUTOIT, O. DEROO, 2001, "Speech Processing for Communications : what's new?", Revue HF, March 2001, pp. 5-24.
E. MENGUSOGLU, C. RIS, 2001, "Use of Acoustic Prior Information for Confidence Measure in ASR Applications", Proceedings of European Conference on Speech Communication and Technology (EUROSPEECH-2001), Vol. 4, pp 2557-2561, Aalborg, September 2001.
L. COUVREUR, C. COUVREUR, 2001, "Robust Automatic Speech Recognition in Reverberant Environments by Model Selection", Proc. International Workshop on Hands-Free Speech Communication (HSC'2001), , pp. 147-150, Kyoto, Japan, April 2001.
T. DUTOIT, 2001, ""Je parle, donc je suis ?" Un bilan des développements récents en traitement automatique de la parole", Revue de la Société des Arts, Sciences et Lettres du Hainaut, à paraïtre.
L. COUVREUR, C. RIS, C. COUVREUR, 2001, "Model-based Blind Estimation of Reverberation Time: Application to Robust ASR in Reverberant Environments", Proceedings of European Conference on Speech Communication and Technology (EUROSPEECH-2001), vol. 4, pp. 2631-2634, Aalborg, Denmark, September 2001.
O. PIETQUIN, L. COUVREUR, P. COUVREUR, 2001, "Applied Clustering for Automatic Speaker-Based Segmentation of Audio Material", Belgian Journal of Operations Research, Statistics and Computer Science (JORBEL) Special Issue : OR and Statistics in the Universities of Mons, volume 41, n° 1-2,01.
S. DUPONT, C. RIS, 2001, "Multiband with Contaminated Training Data", Proc. CRAC Workshop (EUROSPEECH 2001 satellite event), Aalborg, Danemark, Sept. 2001.
2000
S. DUPONT, J. LUETTIN, 2000, "Audio-Visual Speech Modeling for Continuous Speech Recognition", IEEE Transactions on Multimedia.
S. DUPONT, 2000, "Etude et développement d'architectures multi-bandes et multi-modales pour la reconnaissance robuste de la parole".
S. DUPONT, L. CHEBOUB, 2000, "Fast Speaker Adaptation of Artificial Neural Networks for Automatic Speech Recognition", Proceeding of ICASSP'2000, Istanbul, Turkey, June 2000.
L. COUVREUR, C. COUVREUR, 2000, "On the Use of Artificial Reverberation for ASR in Highly Reverberant Environments", Proc. IEEE Benelux Signal Processing Symposium (SPS'2000), Hilvaranbeek, The Nederlands, March 2000.
L. COUVREUR, C. RIS, C. COUVREUR, 2000, "A Corpus-Based Approach for Robust ASR in Reverberant Environments", Proc. International Conference on Spoken Language Processing (ICSLP'2000), Beijing, China, October 2000.
E. MENGUSOGLU, O. DEROO, 2000, "Confidence Measures in HMM/MLP Hybrid Speech Recognition for Turkish Language", Proc. ProRISC'2000, Veldhoven, December 2000.
C. RIS, S. DUPONT, 2000, "Assessing Local Noise Level Estimation Methods: Application to Noise Robust ASR", Speech Communication.
S. DUPONT, L. CHEBOUB, 2000, "Fast Speaker Adaptation of HMM/ANN automatic speech recognition systems", IEEE Signal Processing Symposium, Hilvarenbeek, The Netherlands, March 2000.
R. BOITE, T. DUTOIT, J. HANCQ, H. LEICH, H. BOURLARD, 2000, "Traitement de la Parole", 2nd Edition, 488 pp., Presses Polytechniques Universitaires Romandes, Lausanne, ISBN 2-88074-388-5.
L. COUVREUR, C. COUVREUR, 2000, "Wavelet-Based Non-Parametric HMM's: Theory and Applications", Proc. IEEE Benelux Signal Processing Symposium (SPS'2000), Hilvaranbeek, The Nederlands, March 2000.
L. COUVREUR, C. COUVREUR, 2000, "Wavelet-Based Non-Parametric HMM's: Theory and Applications", Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP'2000), Istanbul, Turkey, vol. 1, pp. 604-607, June 2000.
O. DEROO, C. RIS, S. GIELEN, J. VANPARYS, 2000, "Automatic Detection of Mispronounced Phonemes for Language Learning Tools", Proc. International Conference on Spoken Language Processing (ICSLP'2000), Beijing, China, paper #01732, October 2000.
L. COUVREUR, C. COUVREUR, 2000, "Wavelet-Based Method for Non-Parametric Estimation of HMMs", IEEE Signal Processing Letters, vol. 7, no. 2, pp.25-27.
1999
G. DUROU, 1999, "Multilingual Text-Independent Speaker Identification", Proceedings of MIST'99 Workshop, Leusden, Netherlands.
L. COUVREUR, C. COUVREUR, 1999, "Estimation non-paramétrique de modèles de Markov cachés par ondelettes", (invited paper), Seminar on Applications of Stochastic Process (FNRS), Mons, Belgium, june 1999.
C. COUVREUR, L. COUVREUR, 1999, "Towards Non-Parametric HMMs : A Wavelet-based Approach", (invited paper), Proceedings of 1999 IEEE Information Theory Workshop on Detection, Estimation, Classification and Imaging (DECI), pp.49-52, Santa Fe, NM, USA, february 1999.
I. MAGRIN-CHAGNOLLEAU, G. DUROU, 1999, "Time-Frequency Principal Components of Speech : Application to Speaker Recognition", Proceedings of EUROSPEECH'99, Budapest, Hungary.
L. COUVREUR, P. COUVREUR, 1999, "Application de Méthodes de Classification pour la Segmentation Automatique de Programmes Radio/TV en Fonction du Locuteur", Proceedings of 1999 XXXIèmes Journées Françaises de Statistique (SFDS), pp.423-426, Grenoble, France, may 1999.
L. COUVREUR, J.M. BOITE, 1999, "Speaker Tracking in Broadcast Audio Material in the Framework of the THISL Project", Proceedings of 1999 Workshop on Accessing Information in Spoken Audio (ESCA-ETRW), pp.84-89, Cambridge, UK, april 1999.
S. DUPONT, C. RIS, 1999, "Assessing Local Noise Level Estimation Methods", Proc. Workshop on Robust Methods For Speech Recognition in Adverse Conditions (Nokia, COST249, IEEE), Tampere, Finland.
1998
F. JAUCQUET, 1998, "Application de la Reconnaissance Automatique du Locuteur dans les Liaisons par Vocodeurs".
O. DEROO, T. DUTOIT, F. MALFRERE, 1998, "Comparaison of two different alignment systems: speech synthesis vs. Hybrid HMM/ANN", Proc. European Conference on Signal Processing (EUSIPCO'98), Rhodes, Grece, pp. 1161-1164.
S. DUPONT, 1998, "Missing Data Reconstruction for Robust Automatic Speech Recognition in the Framework of Hybrid HMM/ANN Systems", Proc. ICSLP'98, Sidney.
S. DUPONT, 1998, "Reconstruction de Données Manquantes pour la Reconnaissance Robuste de la Parole dans le Cadre des Systèmes Hybrides HMM/ANN", Proc. XXIIèmes Journées d'Etudes sur la Parole, Martigny, pp. 405-408.
S. DUPONT, J. LUETTIN, 1998, "Continuous Audio-Visual Speech Recognition", Proc. of Fifth European Conference on Computer Vision, Freiburg, Germany.
O. DEROO, 1998, "Modèle dépendant du contexte et fusion de données appliqués à la reconnaissance de la parole par modèle hybride HMM/MLP".
G. DUROU, F. JAUCQUET, 1998, "Cross-Language Text-Independent Speaker Identification", Proc. European Conference on Signal Processing (EUSIPCO'98), Rhodes, Grece.
S. DUPONT, J. LUETTIN, 1998, "Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition: Experiments on the M2VTS Database", Proc. ICSLP'98, Sidney.
F. MALFRERE, T. DUTOIT, O. DEROO, 1998, "Phonetic Alignement : Speech Synthesis Based Vs. Hybrid HMM/ANN", Proc. International Conference on Speech and Language Processing, Sidney, Australia, pp. 1571-1574.
O. DEROO, T. DUTOIT, C. RIS, F. MALFRERE, 1998, "Modeles Hybrides et reconnaissance de la parole continue independante du locuteur en francais", Proc. XXIIèmes Journées d'Etudes sur la Parole, Martigny, pp. 401-404.
1997
S. DUPONT, J.M. BOITE, H. BOURLARD, O. DEROO, V. FONTAINE, 1997, "Hybrid HMM/ANN Systems for Training Independent Tasks : Experiments on Phonebook and Related Improvements", Proc. ICASSP'97, Munich, pp.1767-1770.
J. HENNEBERT, C. RIS, H. BOURLARD, N. MORGAN, S. RENALS, 1997, "Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems", Proc. Eurospeech'97, Rhodes, pp. 1951-1954.
H. BOURLARD, S. DUPONT, 1997, "Subband-based Speech Recognition", Proc. ICASSP'97, Munich, pp.1251-1254.
S. DUPONT, 1997, "Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition: Experiments on the M2VTS Database", FPMS-TCTS technical reports, Mons Belgium, July 1997.
S. DUPONT, J.M. BOITE, C. RIS, O. DEROO, V. FONTAINE, L. ZANONI, 1997, "Context Independent and Context Dependent Hybrid HMM/ANN Systems for Training Independent Tasks", Proc. of EUROSPEECH'97, Rhodes, Grèce, pp. 1947-1950.
V. FONTAINE, H. BOURLARD, 1997, "Speaker Dependent Speech Recognition based on Phone-Like Units Models", Proc. ICASSP'97, Munich.
S. DUPONT, C. RIS, H. BOURLARD, 1997, "Robust Speech Recognition Based on Multi-Stream Features", Proc. of ESCA/NATO Workshop on Robust Speech Recognition for Unknown Communication Channels, Pont-à_Mousson, France, pp. 95-98.
V. FONTAINE, J.M. BOITE, C. RIS, 1997, "Nonlinear Discriminant Analysis for Improved Speech Recognition", Proc. of EUROSPEECH'97, Rhodes, Grèce.
S. DUPONT, H. BOURLARD, 1997, "Using Multiple Time Scales in a Multi-Stream Recognition System", Proc. of EUROSPEECH'97, Rhodes, Grèce, pp. 3-6.
J.M. BOITE, S. DUPONT, C. RIS, F. BATAILLE, O. DEROO, V. FONTAINE, L. ZANONI, 1997, "STRUT : Un logiciel complet pour l'entrainement et la reconnaissance de la parole", Proc. Premières Journées Scientifiques et Techniques FRANCIL, pp 41-44, Avignon.
1996
V. FONTAINE, H. LEICH, C. RIS, 1996, "Maximum Mutual Information Codebook Mapping for Discrete Hidden Markov Models", Proceedings ICASSP 96, Atlanta, pp. 593-596.
H. BOURLARD, S. DUPONT, 1996, "A new ASR approach based on independent processing and recombination of partial frequency bands", Proc. ICSLP'96, Philadelphia, pp. 422-425.
O. DEROO, H. LEICH, J.M. BOITE, S. DUPONT, C. RIS, V. FONTAINE, 1996, "Hybrid HMM/ANN systems for Speaker Independant Continuous Speech Recognition In French", Proc. ProRisc 8th Annual WorkShop on Circuits, System and Signal Processing, Mierlo, The Netherlands, pp. 137-141.
H. BOURLARD, C. RIS, Y. KONING, N. MORGAN, 1996, "A New Training Algorithm for Hybrid HMM/ANN Speech Recognition Systems", Proceedings EUSIPCO'96, pp.1583-1586, Trieste.
W. LIPING, 1996, "Incorporation Of Diferent Sources And Different Levels Of Segmental Constraints In Speech Recognition Using Hidden Markov Models".
V. FONTAINE, H. LEICH, C. RIS, 1996, "Nonlinear Discriminant Analysis with Neural Networks for Speech Recognition", Proc. EUSIPCO 96, pp 1583-1586, Trieste.
C. RIS, V. FONTAINE, H. LEICH, 1996, "Comparison Between Two Hybrid HMM/MLP Approaches in Speech Recognition", Proceedings ICASSP 96, Atlanta, pp. 3362-3365.
S. DUPONT, H. BOURLARD, 1996, "Multiband approach for speech recognition", Proc. of ProRISC/IEEE Workshop on Circuits, Systems and Signal Processing, pp. 113-118, Mierlo, The Netherlands.
H. BOURLARD, S. DUPONT, H. HERMANSKY, N. MORGAN, 1996, "Towards sub-band-based speech recognition", Proc. of EUSIPCO'96, Trieste, Italy, pp. 1579-1582.
1995
Y. KONING, H. BOURLARD, N. MORGAN, 1995, "REMAP Modelling for Connectionist Speech Recognition", Proceedings of the 15th Annual Speech Research Symposium Center for Language and Speech Processing, Johns Hopkins University, Baltimore, Maryland, pp. 95-102.
N. MORGAN, H. BOURLARD, 1995, "Continuous Speech Recognition : An Introduction to the Hydrid HMM/Connectionist Approach", IEEE Signal Processing Magazine, Invited Paper, v. 12, n°3, pp. 25-42.
H. BOURLARD, Y. KONING, N. MORGAN, 1995, "REMAP : Recursive Estimation and Maximization of a Posteriori Probabilities in Connectionist Speech Recognition", Proc. EUROSPEECH'95, Madrid, pp. 1663-1666.
N. MORGAN, H. BOURLARD, 1995, "Neural Networks for Statistical Recognition of Continuous Speech", Proceedings of the IEEE, Invited Paper, v. 83, n°5, pp. 741-770.
N. MORGAN, H. BOURLARD, WU, 1995, "SPAM : Experiments with Digit Recognition", Proceedings of the 15th Annual Speech Research Symposium, Center for Language and Speech Processing, Johns Hopkins University, Baltimore, Maryland, pp. 103-110.
H. JUN, H. LEICH, 1995, "A unified way in incorporating segmental feature and segmental model in HMM", Proceedings ICAASP-95, pp. 532-535.
N. MORGAN, H. BOURLARD, 1995, "Speech Recognition and Neural Networks : Pattern Matching", The Handbook of Brain Theory and Neural Networks, M.A. Arbib (Ed.), Bradford Books, The MIT Press.
H. JUN, 1995, "Incorporation Of Diferent Sources And Different Levels Of Segmental Constraints In Speech Recognition Using Hidden Markov Models".
N. MORGAN, H. BOURLARD, SU-LIN WU, 1995, "Digit Recognition with Stochastic Perceptual Speech Models", Proc. EUROSPEECH'95, Madrid, pp.771-774.
1994
H. JUN, H. LEICH, 1994, "Combining stochastic trajectory model and discrimanative feature in speech recognizer", Proceedings IEEE ICASSP 94, pp. II 681-684.
V. FONTAINE, H. LEICH, M. HASLER, J. HENNEBERT, 1994, "Influence of vector quantization on isolated word recognition", Proceedings of EUSIPCO-94, pp. 115-118.
H. LEICH, H. JUN, 1994, "Speech trajectory recognition in SOFM by using Bayes theorem", Procedings ISSIPNN'94, pp. 109-112.
1992
H. LEICH, F. QUESNE, 1992, "Improving the performances of Hidden Markov models for text dependent speaker verification or identification", Proceedings of the Fourth Australian Int. Conf. on Speech Science and Technology, Brisbane, pp. 471-476.
H. BOURLARD, 1992, "Continuous speech recognition : from hidden Markov models to artificial neural networks".
H. LEICH, H. JUN, 1992, "A discriminative training algorithm for the speech recognizers based on neural prediction method", Proceedings EUSIPCO-92, pp. 423-426.
R. BOITE, A. AMRAOUI, 1992, "Isoled-word recognition by self-organizing maps", Proc. PRORISC IEEE Benelux Workshop on circuits, systems and signal processing, pp. 305-310.
L.P. WANG, H. LEICH, B. PASI, 1992, "Endpoint algorithm for speech recognition in noisy background", Proc. PRORISC IEEE Benelux Workshop on circuits, systems and signal processing, pp. 349-356.
1990
R. BOITE, 1990, "La reconnaissance de la Parole et la Vérification du locuteur", papier invité AGEN Communications, novembre 1990, pp. 5-15.

^ Top ^   

Ongoing projects

DiYSE
2009 - 2011
Do-it-Yourself Smart Experiences

COST 2102
2007 - 2011
COST 2102

Edutain
2004 - 2008
Edutain

STRUT
1996 - 2000
Speech Training and Recognition Unified Tool

Former projects

MAGE / pHTS
2010 - 2014
PhD Thesis Maria Astrinaki

MediaTIC
2008 - 2015
MediaTIC

KWS Predict
2007 - 2008
KWS Prediction

IRMA
2005 - 2008
Multimodal Search Interface for Audiovisual content

IC&C
2004 - 2006
Interface Créative & Conception

DOMINI
2004 - 2006
DOMINI

MAIS
2004 - 2007
Mobile Access Information System

MODIVOC
2002 - 2004
Systèmes MObiles et DIstribués à interface VOCale

COST 278
2001 - 2008
Spoken Language Interaction in Telecommunication

ARTHUR
2000 - 2003
ARchitecture de Télécommunication Hospitalière pour les services d'Urgence

DIALOGUE
2000 - 2004
PhD Thesis Olivier Pietquin

CONFIDENCE
2000 - 2004
PhD Thesis Erhan Mengusoglu

RESPITE
1999 - 2002
REcognition of Speech by Partial Information Techniques

DEMOSTHENES
1998 - 1999


THISL
1997 - 2000
THematic Indexing of Spoken Language

SPRACH
1995 - 1998
SPeech Recognition Algorithms for Connectionist Hybrids

COST 250
1995 - 2000
Automatic Speaker Recognition over the Telephone Network

COST 249
1994 - 2000
Continuous Speech Recognition Over the Telephone Network

OOBP
1994 - 2005
Object-Oriented Block Processing

HIMARNNET
1993 - 1995
HIdden MARkov models and Neural NETworks