Perceptually controlled doping for audio source separation

Gaël Mahé; Everton Z Nadalin; Ricardo Suyama; Joao M. T. Romano

Article Dans Une Revue EURASIP Journal on Advances in Signal Processing Année : 2014

Perceptually controlled doping for audio source separation

(1) , , (2) , (3)

1
2
3

Gaël Mahé

Fonction : Auteur
PersonId : 20279
IdHAL : gael-mahe
ORCID : 0000-0002-3864-2266
IdRef : 070431590

Laboratoire d'Informatique Paris Descartes

Everton Z Nadalin

Fonction : Auteur

Ricardo Suyama

Fonction : Auteur

Engineering Modeling and applied Social Sciences

Joao M. T. Romano

Fonction : Auteur

Laboratory of Signal Processing for Communications

Résumé

The separation of an underdetermined audio mixture can be performed through sparse component analysis (SCA) that relies however on the strong hypothesis that source signals are sparse in some domain. To overcome this difficulty in the case where the original sources are available before the mixing process, the informed source separation (ISS) embeds in the mixture a watermark, which information can help a further separation. Though powerful, this technique is generally specific to a particular mixing setup and may be compromised by an additional bitrate compression stage. Thus, instead of watermarking, we propose a 'doping' method that makes the time-frequency representation of each source more sparse, while preserving its audio quality. This method is based on an iterative decrease of the distance between the distribution of the signal and a target sparse distribution, under a perceptual constraint. We aim to show that the proposed approach is robust to audio coding and that the use of the sparsified signals improves the source separation, in comparison with the original sources. In this work, the analysis is made only in instantaneous mixtures and focused on voice sources.

Mots clés

Informed source separation (ISS) Sparse component analysis (SCA) Doping watermarking Sparsification

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

10.1186%2F1687-6180-2014-27.pdf (1.15 Mo)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Gaël Mahé : Connectez-vous pour contacter le contributeur

https://u-paris.hal.science/hal-01839081

Soumis le : vendredi 13 juillet 2018-17:33:44

Dernière modification le : vendredi 5 août 2022-11:40:57

Archivage à long terme le : lundi 15 octobre 2018-13:30:34

Dates et versions

hal-01839081 , version 1 (13-07-2018)

Identifiants

HAL Id : hal-01839081 , version 1

Citer

Gaël Mahé, Everton Z Nadalin, Ricardo Suyama, Joao M. T. Romano. Perceptually controlled doping for audio source separation. EURASIP Journal on Advances in Signal Processing, 2014, 2014, pp.27 - 27. ⟨hal-01839081⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

LIPADE UP-SCIENCES

40 Consultations

87 Téléchargements

Perceptually controlled doping for audio source separation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager