
DATOS BÁSICOS
- Educación, tecnologías educativas, industria audiovisual, administraciones públicas
- Otros
Ministerio de Ciencia de España, Proyectos I+D+i «Retos Investigación» 2018
Multilingual subtitling of classrooms and plenary sessions
Ministerio de Ciencia de España, Proyectos I+D+i «Retos Investigación» 2018
The main aim of the project is to further improve the state of the art in Automatic Speech Recognition (ASR) and Statistical Machine Translation (SMT) to deal with the kind of audiovisual collections we are considering: lecture recordings and plenary sessions. Thus, different open research challenges in ASR and SMT will be addressed throughout the project: i) channel variability, noise and reverberation; ii) far-field speech recognition; iii) speaker diarization; iv) multispeaker recognition; v) on-line speech recognition; vi) neural machine translation; vii) translation of sentences containing ASR errors.
The technology developed will be tested in two use cases. On the one hand, a repository of classroom video recordings called “Videoapunts” developed at the Universitat Politècnica de València. The classes are recorded using microphone arrays and the open source Opencast platform which supports the management of educational audio and video content. On the other hand, transparency portals of public administrations will be considered to create a a repository of video recordings of plenary sessions.
The main aim of the project is to further improve the state of the art in Automatic Speech Recognition (ASR) and Statistical Machine Translation (SMT) to deal with the kind of audiovisual collections we are considering: lecture recordings and plenary sessions. Thus, different open research challenges in ASR and SMT will be addressed throughout the project: i) channel variability, noise and reverberation; ii) far-field speech recognition; iii) speaker diarization; iv) multispeaker recognition; v) on-line speech recognition; vi) neural machine translation; vii) translation of sentences containing ASR errors.
The technology developed will be tested in two use cases. On the one hand, a repository of classroom video recordings called “Videoapunts” developed at the Universitat Politècnica de València. The classes are recorded using microphone arrays and the open source Opencast platform which supports the management of educational audio and video content. On the other hand, transparency portals of public administrations will be considered to create a a repository of video recordings of plenary sessions.
Albert Sanchis Navarro
Profesor Titular de Universidad (Universitat Politècnica de València)
VRAIN