Ongoing R&D projects
The DIGISTORM project (2016 - 2020)
The ROBIGame project (2014 - 2017)
The i-Treasures project (2013 - 2017)
i-Treasures (Intangible Treasures - Capturing the Intangible Cultural Heritage and Learning the Rare Know-How of Living Human Treasures FP7-ICT-2011-9-600676-i-Treasures) is an Integrated Project (IP) of the European Union's 7th Framework Programme 'ICT for Access to Cultural Resources'. The project started on February 1, 2013, and will last 48 months.
Cultural expression is not limited to architecture, monuments or collections of artifacts. It also includes fragile intangible live expressions, which involve knowledge and skills. Such expressions include music, dance, singing, theatre, human skills and craftsmanship. These manifestations of human intelligence and creativeness constitute our Intangible Cultural Heritage (ICH). ICH is at the same time traditional, contemporary and living, because it does not only refer to inherited knowledge but also to the renewal of contemporary cultural expressions. It refers to the past, to the present, and, certainly to the future and is the mainspring of humanity's cultural diversity.
The main objective of i-Treasures is to develop an open and extendable platform to provide access to ICH resources, enable knowledge exchange between researchers and contribute to the transmission of rare know-how from Living Human Treasures to apprentices.
The JOKER project (2013 - 2016)
This project will build and develop JOKER, a generic intelligent user interface providing a multimodal dialogue system with social communication skills including humor, empathy, compassion, charm, and other informal socially-oriented behavior.
JOKER will emphasize the fusion of verbal and non-verbal channels for emotional and social behavior perception, interaction and generation capabilities. Our paradigm invokes two types of decision: intuitive (mainly based upon non-verbal multimodal cues) and cognitive (based upon fusion of semantic and contextual information with non-verbal multimodal cues.) The intuitive type will be used dynamically in the interaction at the non-verbal level (empathic behavior: synchrony of mimics such as smile, nods) but also at verbal levels for reflex small- talk (politeness behavior: verbal synchrony with hello, how are you, thanks, etc). Cognitive decisions will be used for reasoning on the strategy of the dialog and deciding more complex social behaviors (humor, compassion, white lies, etc.) taking into account the user profile and contextual information.
JOKER will react in real-time with a robust perception module (sensing user's facial expressions, gaze, voice, audio and speech style and content), a social interaction module modelling user and context, with long-term memories, and a generation and synthesis module for maintaining social engagement with the user.
The SonixTrip project (2013 - 2016)
The PREDATTOR project (2012 - 2015)
Based on neuro-psychological and human attention modelling research, the Predattor prototype is able to automatically compute an attention map of any image. This map shows where YOU will gaze when you see an image providing results very close to eye-tracking data. Predattor focuses on neuromarketing and helps you optimize your website or online ads. Cheap and fast, you can use Predattor during the creative process until your key message is visible enough and until you stand out from the competition.
The DiYSE project (2009 - 2011)
The Do-it-Yourself Smart Experiences project (DiYSE) aims at enabling ordinary people to easily create, setup and control applications in their smart living environments as well as in the public Internet-of-Things space, allowing them to leverage aware services and smart objects for obtaining highly personalised, social, interactive, flowing experiences at home and in the city.
The BIOFACT project (2009 - 2012)
In the field of orthoses and prostheses, products suggested on a daily basis to patients suffering of paralysis or having had a limb amputated, little progress has been made despite the significant technological developments in the manufacturing industry and industrial robotics.
However, the development of neurophysiological knowledge and micro-electronic and IT technologies should allow for a better use of the biological signals available in a non-invasive manner such as electroencephalogram (EEG) and electromyogram (EMG) to supplement the deficient motor commands of the handicapped.
Researchers around the world agree on the idea that this century will be the one that will integrate human cerebral capacities with devices made from new materials.
Biomanufacturing - the production by added manufacturing of biocompatible pieces, regroups four fields of application: medical devices, orthoses and prostheses, tissue engineering, and decision supporting anatomical models. Although the last sector is already well represented, the latest EBM (Electron Beam Melting) developments open new perspectives for the direct manufacturing of biocompatible titanium and stainless steel parts.
Today, there is still no tool for the digital simulation of the EBM process. Its development in the MORFEO digital simulation platform will help enrich it with an essential link of the chain in the digital manufacturing of innovative products.
As biomanufacturing R&D projects multiply throughout Europe, it is time to realise results in the form of services offered to businesses active in the sector. The realisation of this biomanufacturing platform in Hainaut will to help meet this growing demand from businesses.
The RECITE project (2007 - 2009)
RECITE aims at extending OCR application for machine vision for objects with different surfaces (metal and so on) and with very various characters. Close to natural scene text understanding, this project focuses on interactively configurable recognition software in order to give access to non-experts people (in SMEs for instance). Hence, the main goal is to enable the creation of dedicated recognizers for particular applications. Based on smart dialogues between the computer and the end-user, particularities of the application, degradations embedded into images will be semi-automatically defined in order to build an efficient recognizer. Additionally, some challenges are met such as extraction and recognition of engraved/embossed characters, which are limitations of systems dealing with natural scene text. In that context, one example is first taken in order to make further the model more versatile: the recognition of engraved characters into metallic and reflective surfaces in uncontrolled environment.
The TRANSLOGISTIC project (2007 - 2011)
TransLogisTIC is an ambitious research project financed by the Walloon Region (2.5 years - 14 m. euros) which is built around a longterm strategy aimed at developing a complete and efficient multimodal transport system in Wallonia as well as high quality logistics services with high added value. Supported by internationnaly recognized Walloon actors (10 enterprises and 5 universities), the project will result in the creation of innovative and efficient products and services.
The COST 2102 project (2007 - 2011)
The main objective of the Action is to develop an advanced acoustical, perceptual and psychological analysis of verbal and non-verbal communication signals originating in spontaneous face-to-face interaction, in order to identify algorithms and automatic procedures capable of identifying human emotional states. Several key aspects will be considered, such as the integration of the developed algorithms and procedures for application in telecommunication, and for the recognition of emotional states, gestures, speech and facial expressions, in anticipation of the implementation of intelligent avatars and interactive dialogue systems that could be exploited to improve user access to future telecommunication services.
The IAP - DYSCO project (2007 - 2011)
The Interuniversity Attraction Poles programme is managed by the Belgian Science Policy Office (BELSPO). The programme was created in 1987 by Guy Verhofstadt, then Minister for Scientific Affairs. It is presently in its sixth phase. Full information about the programme can be found at the IAP website of BELSPO.
The aim of the programme is to fund basic research by promoting collaborative research, through networks of research teams in different Belgian universities. There are presently 44 networks in Belgium, covering all disciplines.
The year 2007 is the starting year for Phase VI of our IAP network "Dynamical systems, control and optimization" (DYSCO) , which covers the period 2007 - 2011. Several of the teams of DYSCO have participated in previous IAP networks.
The PIST project (2004 - 2008)
The PIST project (for Safe and Intelligent Positioning for Transport) deals with the development of sensor fusion systems for self-positioning of vehicles (navigation). The PIST team will design the algorithms for use in applications where safety is crucial, such as railways signalling. The project combines aspects of signal processing, data fusion, system modelisation and integrity assessment.
The Edutain project (2004 - 2008)
The R&D activities of TCTS Lab in the area of edutainment and speech communication have led to the development of real-time voice interfaces based on acoustic features. Such tools play an important part in the voice control of information systems, as studied in a multi-modal perspective by the SIMILAR European Network of Excellence.
The TTSBOX project (2004 - 2008)
TTSBOX performs the synthesis of Genglish (for "Generic English"), an imaginary language obtained by replacing English words by generic words. Genglish therefore has a rather limited lexicon, but its pronunciation maintains most of the problems encountered in natural languages. TTSBOX uses simple data-driven techniques (Bigrams, CARTs, NUUs) while trying to keep the code minimal, so as to keep it readable for students with reasonable MATLAB practice.
The MaxMBROLA project (2004 - 2008)
The main topics of this research project are:
- The development of a flexible external object for Max/MSP (4.5) encapsulating the main features of the MBROLA speech synthesizer and the adaptation of the MBROLA functions to the asynchronous request-based architecture of the Max/MSP environment.
- Discussions and Max/MSP developments about the real-time control issues in the phonetic/prosodic content generation process. This research topic is a good "first-trial" concerning overall issues of real-time manipulation of concatenation-based signals.
- Propositions of various real-time concatenation-based applications (standalone, virtual instruments or Max/MSP patches) allowing performers to produce versatile voice with standard musical devices.
The LASEF project (2004 - 2008)
Le but de ce projet est la dĂ©monstration d'un systĂ¨me LIDAR (LIght Detection And Ranging) pour la dĂ©tection des turbulences et flux d'air et en Ă©tablir un modĂ¨le thĂ©orique. Le systĂ¨me sera basĂ© sur la dĂ©tection des mouvements de particules dans l'air par effet Doppler. Cette technique met en jeu une source d'Ă©mission LASER couplĂ©e Ă un appareil de dĂ©tection de la lumiĂ¨re rĂ©tro-diffusĂ©e par les particules.
L'aboutissement du projet consistera en la dĂ©monstration d'une mesure de dĂ©tection de type LIDAR, Ă l'aide d'un appareil fiable et transportable sur terrain. En effet, la frĂ©quence des atterrissages et des dĂ©collages dans les aĂ©roports est telle qu'il est primordial de vĂ©rifier que la distance entre avions est suffisante, notamment dans le sillage des grands porteurs (A380 notamment). L'application directe recherchĂ©e est donc une mesure des turbulences Ă l'atterrissage ou au dĂ©collage d'un avion.
The HCR-NN project (1998 - 2002)
Off-Line Handwritten Character Recognition using Neural Networks
The STRUT project (1996 - 2000)
The Speech Training and Recognition Unified Tool (STRUT) has been developed to do research on speech recognition and fast development and testing of related applications. The software is able to do speech analysis, models training and speech recognition. The tool consists in many ``independent'' small pieces of code, one for each of identified module in the process of speech recognition: sampling, feature extraction, clustering, probability estimation, and decoding.
It is now being extended (version 2.0) in collaboration with MULTITEL ASBL.
The MBROLA project (1995 - 1999)
The goal of the MBROLA project is to obtain a set a high quality speech synthesizers for as many languages as possible, free for use in non-commercial applications. The ultimate goal is to boost up academic research on speech synthesis, and particularly on prosody generation, known as one of the biggest challenges in Text-to-Speech Synthesis for the years to come. As of 2003, 26 languages are available, and ore than 50 voices. Many other languages are in preparation. The software has been compiled on 21 machine/OS combinations
Past R&D projects
The SLOWDIO project (2013 - 2015)
In this project we implement new methods for time-stretching of stereo audio signals, especially audio generated during sport events. This will enable viewers to watch slow-motion videos with synchronous time-stretched quality-preserved sound.
The HandSketch project (2012 - 2014)
Development of a new digital musical instrument that will give a musician the possibility to perform synthetic singing on stage.
The ILHAIRE project (2011 - 2014)
ILHAIRE is funded under the Future and Emerging Technologies (FET) chapter of the 7th framework program for research of the European Union, a very competitive line of research funding where less than 6% of research proposals get funded. It intends to study the role of laughter during interactions between humans and machines and to develop new paradigms for natural man-machine interactions, including though anthropomorphic avatars that may play an important role in future digital media. The project is in particular focused on non-verbal social communication cues related to smile an laughter, within a framework that will consider laughter as part of dialogs, and using technologies for accurate multimodal capture of the different facets of social communication (voice, gestures, posture and facial expressions). The ILHAIRE consortium is composed of an interdisciplinary team of nine organizations.
The LinkedTV project (2011 - 2015)
LinkedTV is supported under the Networked Media and Search Systems strategic objective of the 7th framework program for research of the European Union. The project aims to provide a novel practical approach to Networked Media based on four phases: annotation, interlinking, retrieval, and presentation. LinkedTV will allow to seamlessly connect multimedia content on the Web by integrating networked media analysis, personalization and presentation technologies within an integrated and coherent framework. UMONS is involved in novel approaches for gathering user preferences through behavior analysis, and for presentation interfaces facilitating content and video search. The LinkedTV consortium is composed of twelve organizations, and it lead by Fraunhofer IAIS.
The COMPTOUX project (2010 - 2013)
The COST IC0903 project (2009 - 2013)
Knowledge Discovery from Moving Objects (MOVE)
The main objective of the Action is to develop improved methods for knowledge extraction from massive amounts of data regarding moving objects.
This Action aims to build a network for collaboration that leads to the improvement of ICT methods for knowledge extraction from massive amounts of data about moving objects. This knowledge is essential to substantiate decision making in public and private sectors. Moving object data typically include trajectories of concrete objects (e.g. humans, vehicles, animals, and goods), as well as trajectories of abstract concepts (e.g. spreading diseases). While movement records are nowadays generated in huge volumes, methods for extracting useful information are still immature, due to fragmentation of research and lack of comprehensiveness from monodisciplinary approaches. Overcoming these limitations calls for COST-like networking. In response to a strong expression of interest from the academic, industrial, and user communities, this Action will empower the development of substantial and widely applicable methods in mobility analysis, focusing on representation and analysis of movement, including spatio-temporal data mining, and visual analytics. Results will be demonstrated through showcases for decision makers. Researchers from various subdomains in computer and geographic information sciences will join domain specialists from a broad range of relevant applications, from courier services and transportation to ecology, and epidemiology, among others. This will make Europe a central stakeholder in an emerging key domain.
The EUCogII project (2009 - 2012)
EUCogII is a European network for researchers in artificial cognitive systems and related areas who want to connect to other researchers and reflect on the challenges and aims of the discipline. The network funds meetings, workshops, members' participation in academic events, faculty exchanges and other activities that further its aims. It continues and builds on the work of the FP6 euCognition network (2006-2008).
EUCogII is funded by the Information and Communication Technologies division of the European Commission, Cognitive Systems and Robotics unit, under the 7th Research Framework Programme. FP7-ICT-EUCogII-231281
The MediaTIC project (2008 - 2015)
The MediaTIC portfolio was submitted in September 2007 in response to the first call for proposals of the ERDF and started on 1st July, 2008. This ambitious project falls within the scope of measure 2.2 dedicated to the exploitation of the potential of research centres. More concretely, the project's objective is to increase the competitiveness of innovating technological SMEs in Wallonia through collective projects dictated by concrete industrial requests. It works as a cross-action for the innovation in the NTIC component of each strategic line defined by the Walloon Marshall Plan.
To reach that goal, Multitel, as a project leader, has gathered a consortium composed of academic entities and research centres split all over the Walloon territory. Actually, MediaTIC has been submitted in both objectives of the period for 2007-2013 of the European structural funds programme, namely "Convergence"Âť and "Regional Competitiveness and employment"Âť. The project counts on the know-how of laboratories such as the SEMI, TCTS and Telecommunications units of the FacultĂ© polytechnique de Mons, the TELE laboratory from the Catholic University of Louvain-la-Neuve, of the research units in microelectronics (Microsys) and signal & image processing (Intelsig) from the University of Liege, of the Centexbel and SIRRIS research centres and finally, of the GIE MUWAC. By calling upon complementary partners, Multitel aimed at providing MediaTIC with the typical action leverages of a collaborative research and allowing the projects focusing towards common objectives.
MediaTIC is a portfolio of six integrated projects oriented towards specific industrial needs. Each one is run by a specialist from Multitel in the targeted field. These thematic platforms are Transmedia, Envimedia, Tracemedia, Intermedia, 3Dmedia and Optimedia.
The OLIMP project (2008 - 2013)
Les applications interactives multimĂ©dias live (Live Interactive MultimĂ©dia ou LIM) rĂ©clament de trĂ¨s hautes performances pour satisfaire les exigences en qualitĂ© et vitesse de traitement. Ce projet se concentre sur l'Ă©tude et le dĂ©veloppement d'outils logiciels et matĂ©riels pour rĂ©pondre aux besoins des utilisateurs de multimĂ©dia en temps rĂ©el. Nous travaillons notamment actuellement sur l'exploitation des processeurs graphiques GPU pour le calcul intensif en traitement d'images, Ă la fois pour des applications mĂ©dicales (dĂ©tection de contours et de mouvements) et des applications en arts numĂ©riques (analyse, dĂ©tection et suivi de mouvements, incrustation de virtuel dans des images rĂ©elles). Pour ce dernier point, il y a collaboration Ă©troite avec le Programme d'Excellence NumĂ©diart (voir http://www.numediart.org).
The CALLAS project (2007 - 2010)
CALLAS ("Conveying Affectiveness in Leading-Edge Living Adaptive Systems") is a European Integrated Project (FP6). It aims at designing and developing multimodal architectures giving a strong importance to emotions, for Arts and Entertainment.
The global idea of the project is that New Medias, targeting recognition and production of emotions, can enhance users' (or spectators') experience and interaction. CALLAS is thus investigating how, at the input level, emotions can be detected and how, at the output level, these emotions can be processed to generate a new audiovisual content enriching users' experience. The input modalities include both vocal and body languages (recorded through video cameras and haptic devices). In order to improve the recognition of emotions, the problem of merging the information coming from these different modalities will also be examined. The applications are ranging from digital theatre productions (playing an audio or visual content in relation with the actors' and spectators' feelings) to real or virtual museum tours (taking the visitor's interest into account to reshape the exposition and select the level of information its audioguide will give), without forgetting interactive television (modifying a scenario according to the spectator's emotions).
The NUMEDIART project (2007 - 2012)
Numediart is a long-term research programme centered on Digital Media Arts, funded by RĂ©gion Wallonne, Belgium (grant NÂ°716631). Its main goal is to foster the development of new media technologies through digital performances and installations, in connection with local companies and artists.
It is organized around three major R&D themes: HyFORGE - hypermedia navigation, COMEDIA - body and media, COPI - digital instrument making. It is performed as a series of short (3-months) projects, typically 3 or 4 of them in parallel, which are concluded by a 1-week "hands on" workshop.
Numediart is the result of collaboration between Polytech.Mons (Information Technology R&D Department) and UCL (TELE Lab), with a center of gravity in Mons, the cultural capital of Wallonia. It also benefits from the expertise of the Multitel research center on multimedia and telecommunications. As such, it is the R&D component of MONS2015, a broader effort towards making Mons the cultural capital of Europe in 2015.
The COST SID project (2007 - 2011)
Sonic Interaction Design is the exploitation of sound as one of the principal channels conveying information, meaning, and aesthetic/emotional qualities in interactive contexts. The Action proactively contributes to the creation and consolidation of new design theories, tools, and practices in this innovative and interdisciplinary domain. While being advanced through a few sparse projects, this field relies on the COST - SID Action to strengthen the links between scientists, artists, and designers in the European Research Area. The COST - SID platform stands on four legs: (i) perception, cognition, and emotion; (ii) design; (iii) interactive art; (iv) information display and exploration. These are each supported by the research and development of the requisite new interactive technologies. Due to the breadth of its application spectrum, the COST - SID Action has the potential of affecting everyday life through physical and virtual interactive objects, as today there is the possibility to design and actively control their acoustic response so that it conveys an intended aesthetic, informational, or emotional content.
The SERKET project (2006 - 2009)
The goals of SERKET are twofold:
- define the requirements and the
specifications of an open security platform for public places and events
- demonstrate the new architectural principle for security systems on realistic scenarios, by integrating heterogeneous sensors (video, audio, human, etc), by applying advanced fusion technologies of multimedia information and by assessing automatically threats.
The ECLIPSE project (2006 - 2012)
There are various methods of analysis aiming at classifying vocal pathologies, but none is really powerful. First of all, the "perceptive" analysis makes it possible to the doctor to qualify the quality of the voice according to several criteria, the problem of this method being subjectivity of the judgement. That's why specialists prefer the "acoustic" analysis, computer-assisted method consisting in calculating on the vocal signal a series of objective parameters which are used to qualify the voice of the patient. But this method is only effective to analyze supported vowels, and thus not continuous speech, what would be more suitable. Moreover, the strongly hoarse speakers are unable to produce pseudoperiodic speech.
The ECLIPSE project aims to develop software of acoustic analysis for any type of voice and any degree of hoarseness. The project implements the simultaneous analysis of the vocal signals and the images of the vibration of the vocal cords and aims, in addition to the realization of a clinical prototype, the realization of a portable device intended to ensure a follow-up of the patients at the risk on their workplace.
The TANIA project (2006 - 2009)
In the frame of the TANIA project, we aim at designing a decision support tool for the anesthesiologists. The research involves diverse fields of applied mathematics, in particular data mining and signal processing techniques.
The MOUSTIC project (2005 - 2007)
MOUSTIC project aims at developing new frameworks, complementary to the existing ones, for the diffusion of road information in Wallonia.
It would use new channels of diffusion which we propose to develop and to integrate in the existing steps of the WHIST project (Walloon Highway Information System for Traffic).
The system consists of the creation of a free communication channel using existing broadcastings.
During radio transmission, information will be hidden in the form of pseudo-random noise inaudible by a human.
A low cost receiver will decode this information and synthesize it vocally, or display it on a screen.
The IRMA project (2005 - 2008)
L'objectif d'IRMA est de concevoir et dĂ©velopper une interface modulaire innovante pour la recherche et la navigation multimodale personnalisĂ©e, performante, sĂ©curisĂ©e et Ă©conomiquement viable dans des bases de donnĂ©es audiovisuelles indexĂ©es. Elle permettra une recherche contextuelle, intuitive et naturelle complĂ©tĂ©e par une navigation fluide. De la sorte, IRMA fournira un environnement permettant d'exploiter au mieux l'intelligence de l'utilisateur du moteur de recherche.
The COST 277 project (2004 - 2005)
The main objective of this COST Action is to improve the quality and capabilities of the voice services for telecommunication systems through the development of new nonlinear speech processing techniques. The proposed new mathematical methods are expected to provide advances in generic speech processing functions. Examples of these are: higher quality speech synthesis, more efficient speech coding, improved speech recognition, and improved speaker identification.
The IC&C project (2004 - 2006)
Le projet IC&C vise Ă la mise au point d'une interface homme-machine
naturelle pour les systĂ¨mes de dessin et de conception assistĂ©s par
ordinateur. Au contraire des interfaces classiques telles que souris,
claviers, icĂ´nes et menus, le projet IC&C propose une interface inĂ©dite
basĂ©e sur des agents logiciels combinant l'interprĂ©tation du tracĂ©
graphique Ă main libre, l'interprĂ©tation d'image et la reconnaissance
The DOMINI project (2004 - 2006)
This project deals with the development of computerized medical files calls upon competences of hospital needs analysis, with the control of data-processing technologies and of computational linguistics. It also requires to take into account the legal aspects related to the protection of the private life and the medical data.
The F3M project (2004 - 2008)
The goal of the project is to assess the usability of a solution based on wearable computer connected through a wireless network for improvement of the workflow in the field of maintenance, for instance for planes in the aviation sector and for trains in the railways sector. Our concept will equip any field technician with a mobile wearable computer allowing communication in real time with its colleagues and with a central server supervising all the field maintenance process and connected with the existing maintenance database used more traditionally.
The MAIS project (2004 - 2007)
The objective of MAIS is to develop a low-cost, low-consumption, secure smart card that will be readable from a distance.
The main applications of the project will be freight train tractability and inclusion in windshields. For this last application, the project partners work in close collaboration with Glaverbel.
The DREAMS project (2003 - 2008)
Sleep scoring is essential for the detection of sleep pathologies in hospitals. It is usually performed manually by visual inspection of polysomnograms (PSG : EEG+EMG+EOG, mainly). Automated techniques exist, but fail to provide reliable results for pathological sleep.
The DREAMS project precisely aims at producing automated sleep scoring techniques in case of sleep pathologies.
The iMed project (2003 - 2006)
The iMed project is about the design of a method to automatically detect emboli in the vessel tree of the pulmonary artery, from HCT (helicoidal computed tomography) millimeter slices.
The MERCATOR project (2003 - 2007)
In the context of preoperative images visualization and computer-assisted surgical planning, the Mercator project aims at updating the plannings made
before the operation by integrating real-time information resulting from intra-operative events in order to readjust the plans and the initial data on
the real evolution during the operation or the radiotherapy.
The SYPOLE project (2003 - 2006)
The blind or partially sighted people represent 17.5 million people in Europe and
about 75.000 in Wallonia. For most of these people, much information, which
exists in written or imaged forms, is not easily accessible for them. The main
aim of Sypole project is to remedy all these needs by the realization of a
prototype device, which will be portable, autonomous, small-size and easy to use
for blind or partially sighted people. This kind of device will be able to
recognize text and coloured forms, such as logos, and to auto-generate a speech
The SIMILAR project (2003 - 2007)
The SIMILAR European Network of Excellence will create an integrated task force on multimodal interfaces that respond intelligently to speech, gestures, vision, haptics and direct brain connections by merging into a single research group excellent European laboratories in Human-Computer Interaction (HCI) and in Signal Processing.
SIMILAR will develop a common theoretical framework for fusion and fission of multimodal information using the most advanced Signal Processing tools constrained by Human Computer Interaction rules.
SIMILAR will develop a network of usability test facilities and will establish an assessment methodology.
SIMILAR will develop a common distributed software platform available for researchers and the public at large through www.openinterface.org
SIMILAR will address Grand Challenges in the field of edutainment, interfaces for disabled people and interfaces for medical applications.
SIMILAR will establish a top-level foundation which will manage an International Journal, Special Sessions in existing conferences, organize summer schools, interact with key European industrial partners and promote new research activities at the European level.
TCTS Lab's contibution will be on Grand Challenges related to TTS and ASR technologies, and their integration into a multimodal framework. We will also work on enhancing Brain Computer Interfaces. SIMILAR is considered a central project for the evolution of our lab.
The ARMAGEDDON project (2003 - 2004)
Armageddon is an opera sung and played by human-controled robots, in real time. Created by Art Zoyd; Robot voices taken from the MBROLA Project (under Max/MSP).
The STOP project (2003 - 2006)
The STOP Project aims at studying the relationship between speech dynamics and voice quality, based on home-made tools for efficient source-tract separation.
The CAPA project (2002 - 2004)
The CAPA (Automatic Classification of Agricultural Products) project implies 4 labs., from 3 Universities, which combine their respective skills in order to develop an automatic classification system of agricultural products, such as apples, according to the current quality norms applied in practice. The quality will be estimated from the possible marks, the color, or the shape of the products. The aim is to obtain a concrete prototype allowing to show the algorithmical and the mechanical possibilities of an automatic selection of fruits or vegetables.
The MODIVOC project (2002 - 2004)
Speech-based interfaces are about be used in many applications, for which the most demanding is that of being able to recognize any person (without prior training of the machine), even in noisy conditions.
The techniques required to achieve this are mostly availble, but their use in real portable applications is limited by their memory and CPU comsuption. MODIVOC aims at :
- simplifying ASR algorithms
- increasing their robustness
- dispatching CPU load among portable computers in a network
- specifying generic models to apply this solution in heteogeneous environements
The NUMBROLA project (2001 - 2005)
NUMBROLA is an extension of MBROLA towards corpus-based, non-uniform unit (NUU) selection techniques in speech synthesis. The goal of NUMBROLA is to provide a standard concatenative synthesizer to people active in NUU research. A French database has been made available, and a first version of the software. We are currently working on an improved version, based on a modified MBROLA agorithm : TP-MBROLA.
The COST 278 project (2001 - 2008)
The main objective of this Action is to create knowledge in several problem areas of spoken language interaction in telecommunications in order to achieve communicative interfaces that provide a natural human-computer interaction through more cognitive, intuitive and robust interfaces, whether monolingual, multilingual or multimodal.
The scientific programme emphasises speech and dialogue processing in multimodal communication interfaces, issues related to robustness and multilinguality, human-computer dialogue theories, and models and systems and associated tools for the establishment of interactive systems. The programme also involves the evaluation of telecommunication applications in which spoken language is the only or one of many types of input or output modalities.
The MLRR project (2000 - 2001)
The goal of this program is to transcribe a symbolic input, i.e. a string of symbols belonging to some alphabet, into a symbolic output according to a regular grammar described in terms of a system of multi-level rewriting rules (MLRR). "Symbols" and "alphabet" have to be understood here as generic terms: they can be characters, phonemes, syllables, words, phrases, etc.
This project is closed but the software is available in Open Source format.
The ARTHUR project (2000 - 2003)
Le systĂ¨me prototype ARTHUR constituera un point de convergence des groupes de recherche les plus avancĂ©s en technologie de l'information de la RĂ©gion Wallonne autour de la thĂ©matique des technologies de l'information intelligentes et conviviales. En s'attachant Ă une activitĂ© spĂ©cifique, l'assistance aux interventions d'un urgentiste, il est possible de modĂ©liser une chaĂŻne complĂ¨te de maniĂ¨re intĂ©grĂ©e et originale y incluant des recherches sur des domaines aussi chauds que les interfaces homme-machine intelligents pilotĂ©s par la voix, le multicast pour les communications sĂ©curisĂ©es, l'Ă©laboration et le stockage de documents multimĂ©dias actifs et sĂ©curisĂ©s et les interfaces graphiques conviviaux.
The RESPITE project (1999 - 2002)
REcognition of Speech by Partial Information TEchniques ESPRIT Long Term Research RTD Project Ref. 28149.
RESPITE extended and applied two novel technologies missing data theory and multi-stream theory to the problem of robust automatic speech recognition (ASR), with particular application to cellular phones and in-car environments. It also supported studies whose purpose was to inform this endeavour. The specific measurable objectives were to :
- develop techniques for identifying reliable data,
- advance the theory of multi-stream processing,
- advance the theory of missing and masked data handling,
- inform the above by obtaining new perceptual data on speech recognition,
- combine missing data and multistreamprocessing with existing robust ASR methods,
- evaluate all this within a framework of demonstrator ASR applications to cellular phones and in cars.
The DEMOSTHENES project (1998 - 1999)
Acquiring a good command of spoken Dutch is a non-trivial task for most French speaking learners of the language.
In this prospect, two Belgian research teams have joined their expertise in speech recognition (Polytechnique - Mons) and software development for foreign language learning (Namur University) to produce a multimedia courseware for Dutch pronunciation, which detects and corrects the typical errors made by French speaking learners, using the hybrid HMM/ANN systems mastered at TCTS Lab. The final product discriminates pronunciation errors at the phoneme level.
The EULER project (1997 - 2001)
For years, non-coordinated research effort on the design of text-to-speech (TTS) systems has led to unavoidable cross-system and cross-language incompatibility. The EULER project aimed at producing a unified, extensible, and publicly available research, development and production environment for multilingual TTS synthesis. EULER has led to the development of a corpus-based French TTS system. The project is no longer supported, but the software components are still available.
EULER has been reworked into eLITE, by the TTS team of MULTITEL ASBL.
The MBRDICO project (1997 - 2001)
MBRDICO is a talking dictionnary using MBROLA as a back-end speech synthesizer. Text processing is performed using a complete GNU GPL package for automatic phonetization training (letter/phoneme alignement, decision tree building, stress assignment) and duration/intonation generation. French, US English, and Arabic are available. We do not work directly on this project any longer, but all its sources are available for use or extension. This work is the result of a collaboration between:
- FacultĂ© Polytechnique de Mons
- Carnegie Mellon University
- University of Edinburgh
The MBROLIGN project (1997 - 2001)
MBROLIGN is a fast MBROLA-based text-to-speech aligner. It is provided free for use in non commercial applications. The goal of this project is to create large phonetically and prosodically labeled for as many languages as possible, thereby drastically expanding the reach of speech technology. This project is currently closed, but the software is available for database creation.
The W project (1997 - 2001)
The W project aimed at creating a fast computer keyboard driver for people with speech disabilities. The related software is based on grade II Braille languages developed by blind people associations all over the world and minimizes the number of keystrokes to utter a word (the name of the project is the grade II abreviation for "word" in English).
This project has been extended by MULTITEL ASBL in the framework of the FASTY EC/FP5 Project.
The THISL project (1997 - 2000)
Thematic Indexing of Spoken Language (EC RTD Long Term Research Project 23495)
The aim of the THISL project was to produce a broadcast news retrieval demonstrator for the BBC. The approach adopted was to transcribe radio and television broadcasts using the Abbot speech recognizer and then to index the resulting transcriptions using the thislIR information retrieval system - similar to a web search engine - which allows users to search for news items of interest to them. ThislIR returns a list of news clips most relevant to each query which users can listen to. Demonstrators have been produced with both text and spoken query interfaces.
The SPRACH project (1995 - 1998)
SPeech Recognition Algorithms for Connectionist Hybrids (ESPRIT Long Term Research RTD Project Ref. 20077)
The goal of the proposed project is to further improve the current state-of-the-art in continuous speech recognition using Artificial Neural Network (ANN) and Hidden Markov Model (HMM) approaches. Pursuing the theoretical and development work successfully carried out under the WERNICKE project (ESPRIT Basic Research Project 6487, October 1992-October 1995), this new project, referred to as SPRACH ( SPeech Recognition Algorithms for Connectionist Hybrids), will extend the research to robust and flexible speech recognition systems that can easily be adapted to new languages and new domains with new lexica and new syntaxes.
The COST 250 project (1995 - 2000)
Speaker Recognition in Telephony
The COST 249 project (1994 - 2000)
The main objective of the project is to co-ordinate research efforts in the area of multlingual continuous speech recognition for future public network services. This will be accomplished by establishing a unified language-independent speech recognition concept, and by investigating specific topics within the framework of this concept. This way it should be possible to validate the partners' efforts in signal processing, statistical pattern recognition and linguistic processing in a more unified way .
The OOBP project (1994 - 2005)
OOBP is a programming paradigm developped at TCTS Lab since 1994. It is defined as
Object Oriented Programming around processes and combines OOP and block descriptions.
Plug and Play Software extends OOBP by defining input and output data as abstract streams.
The HIMARNNET project (1993 - 1995)
The development and assessment of neural network techniques for improving the robustness of medium vocabulary (50-100 words), speaker-independent, isolated word recognisers for telephone transmission quality speech. The dominant technology is Hidden Markov Models (HMMs) but this has significant limitations, some of which could be alleviated by the judicious use of artificial neural networks (ANNs) or hybrid combinations of both techniques. Direct comparisons of ANN-based, HMM-based, and hybrid ANN/HMM techniques for speech recognition will be made. The developments will be integrated and validated in the context of a telephone application including speech recognition capabilities. A number of prototypes have been demonstrated on low cost commodity systems. The telephone application developed within the project will be the basis for product development by Tedas.