Perceptually Controlled Reshaping of Sound Histograms - Université Paris Cité Accéder directement au contenu
Article Dans Une Revue IEEE/ACM Transactions on Audio, Speech and Language Processing Année : 2018

Perceptually Controlled Reshaping of Sound Histograms

Gaël Mahé
Mériem Jaidane
  • Fonction : Auteur
  • PersonId : 842812

Résumé

Many audio processing algorithms have optimal performance for specific signal statistical distributions that may not be fulfilled for all signals. When the original signal is available, we propose to add an inaudible noise so that the distribution of the signal-plus-noise mixture is as close as possible to a given target distribution. The proposed generic algorithm (independent from the application) adds iteratively a low-power white noise to a flat-spectrum version of the signal, until the target distribution or the noise audibility is reached. The latter is assessed through a frequency masking model. Two implementations of this sound reshaping are described, according to the level of the targeted transformation and to the foreseen application: Histogram Global Reshaping (HGR) to change the global shape of the histogram and Histogram Local Reshaping (HLR) to locally " chisel " the histogram, but keeping the global shape unchanged. These two variants are illustrated by two applications where the inaudibility of the noise generated by the algorithm is required: " sparsification " for source separation, and low-pass filtering of the histogram for application of the quantization theorem, respectively. In both cases, the target histogram is reached or almost reached and the transformation is inaudible. The experiments show that the source separation performs better with HGR and that the HLR allows a better application of the quantization theorem.
Fichier principal
Vignette du fichier
SoundHistogramReshaping_TASLP2018pub.pdf (1.07 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01828960 , version 1 (13-07-2018)

Identifiants

Citer

Gaël Mahé, Mériem Jaidane. Perceptually Controlled Reshaping of Sound Histograms. IEEE/ACM Transactions on Audio, Speech and Language Processing, 2018, 26 (9), pp.1671 - 1683. ⟨10.1109/TASLP.2018.2836143⟩. ⟨hal-01828960⟩
62 Consultations
213 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More