informations

Type
Ensemble de conférences, symposium, congrès
Lieu de représentation
Ircam, Salle Igor-Stravinsky (Paris)
durée
45 min
date
23 mars 2022

This session will present recent results of the Analysis/Synthesis team. The session will start with a short presentation of the new features of the Version 1.3.0 of the Ircam Singing Synthesis Software ISiS and will continue with more prospective results demonstrating the use of Deep Neural Networks for singing and spoken voice manipulation using the mel spectrogram as a parametric speech represetnation.

ISiS Version 1.3.0 (Guillaume Doras)
Accessing and manipulating the intermediate singing voice parameters representation.

Neural Vocoder (Axel Roebel)
Multi-band Excited WaveNet vocoder for resynthesis of spoken and singing voice from mel spectrograms.

Pitch Manipulation (Frederik Bous)
A deep auto-encoder with bottleneck that disentangles the F0 from the mel spectrogram.

Manipulation of perceived speech attitude (Clément Le Moine Veillon)
A deep neural network for manipulating the perceived attitude in speech.

intervenants


partager


Vous constatez une erreur ?

IRCAM

1, place Igor-Stravinsky
75004 Paris
+33 1 44 78 48 43

heures d'ouverture

Du lundi au vendredi de 9h30 à 19h
Fermé le samedi et le dimanche

accès en transports

Hôtel de Ville, Rambuteau, Châtelet, Les Halles

Institut de Recherche et de Coordination Acoustique/Musique

Copyright © 2022 Ircam. All rights reserved.