Thoughts on auxiliary audio losses using V-Diffusion #54

brentspell · 2023-03-01T19:42:49Z

brentspell
Mar 1, 2023

First, thanks for creating this repo, it is a great resource for audio ML.

The Moûsai paper hints at additional/perceptual losses in the Future Work section. I'm curious whether this would be possible to do in the V-Diffusion framework, since the denoiser predicts the "velocity" of the noise instead of the clean audio. Do you know of a transformation that could be applied to the model outputs at training time, for comparing against ground truth using an additional criterion?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Thoughts on auxiliary audio losses using V-Diffusion #54

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Thoughts on auxiliary audio losses using V-Diffusion #54

Uh oh!

brentspell Mar 1, 2023

Replies: 0 comments

brentspell
Mar 1, 2023