Étude de transformées temps-fréquence pour le codage audio faible retard en haute qualité
Institution:
Rennes 1Disciplines:
Directors:
Abstract EN:
In recent years there has been a phenomenal increase in the number of products and applications which make use of audio coding formats. Among the most successful audio coding schemes, we can list the MPEG-1 Layer III (mp3), the MPEG-2 Advanced Audio Coding (AAC) or its evolution MPEG-4 High Efficiency-Advanced Audio Coding (HE-AAC). More recently, perceptual audio coding has been adapted to achieve low delay audio coding and to become suitable for conversational applications. Traditionally, the use of filter bank such as the Modified Discrete Cosine Transform (MDCT) is a central component of perceptual audio coding and its adaptation to low delay audio coding has become a very popular research topic. Low delay transforms have been developed Fin order to maintain the performances of this main component while reducing dramatically the associated algorithmic delay. This work presents a low delay block switching tool which aliows the direct transition between long transform and short transform without the insertion of transition window. The same principle has been extended to define new perfect reconstruction conditions for the MDCT with relaxed constraints compared to the original definition. A seamless reconstruction method has been derived allowing to increase the flexibility of transform coding schemes with the possibility to select a transform window independently from the previous and the following frames. Additionally, based on this new approach, a new low delay window design procedure has been derived allowing to obtain an analytic defmition. Those new approaches have been successfully applied to the newly developed MPEG low delay audio coding (LD-AAC and ELD-AAC) allowing to significantly improve the quality for transient signais. Moreover, the low delay window design has been adopted in G. 718, a scalable speech and audio codec standardized in ITU-T and has demonstrated its benefit in terms of delay reduction while maintaining the audio quality of a traditional MDCT.
Abstract FR:
Pas de résumé disponible.