Added `torchaudio.models.Tacotron2()` #669

kaiidams · 2022-07-30T13:12:38Z

Added Tacotron2 implementation from https://github.com/pytorch/audio/blob/e502df0106403f7666f89fee09715256ea2e0df3/torchaudio/models/tacotron2.py

The file has a LICENSE notice from NVIDIA.

PyTorch has four pretrained mode

char based Griffin-Lim
char based WaveRNN
phoneme based Griffin-Lim
phoneme based WaveRNN

Quality of Griffin-Lim is not as good as WaveRNN. Phoneme based requires DeepPhonemizer to process texts which is written with PyTorch.

One of pretrained models from PyTorch converted to TorchSharp.
tacotron2_english_phonemes_1500_epochs_wavernn_ljspeech.pth
https://drive.google.com/file/d/11TnhmCSUy7aO1pv9CBi7Qivhz25nl_2l/view?usp=sharing

GeorgeS2019 · 2022-07-30T17:42:04Z

#598 (comment)

test/TorchSharpTest/TestTorchAudioModels.cs

NiklasGustafsson · 2022-08-01T14:23:09Z

src/TorchSharp/TorchAudio/Modules/Tacotron2.cs

+            RegisterComponents();
+        }
+
+        public (Tensor, Tensor, Tensor, Tensor) forward(


Doc comments on all public methods, please.

NiklasGustafsson · 2022-08-01T14:23:35Z

src/TorchSharp/TorchAudio/Modules/Tacotron2.cs

+            }
+        }
+
+        public class Prenet : nn.Module


A public class -- please add doc comments.

NiklasGustafsson · 2022-08-01T14:24:27Z

src/TorchSharp/TorchAudio/Modules/Tacotron2.cs

+    /// Tacotron2 model based on the implementation from
+    /// Nvidia https://github.com/NVIDIA/DeepLearningExamples/.
+    /// </summary>
+    public class Tacotron2 : nn.Module


I added a couple of comments below, but there are public methods and classes in this class that should have doc comments.

NiklasGustafsson · 2022-08-02T03:02:11Z

@kaiidams -- I'm going to make another release this week, since someone found a very serious performance bug in tensor creation. I'd love to get this PR in the release, but no pressure -- if you don't have time to work on it, I'll include it without doc comments, and then create an issue that I'll assign to you to add them later.

Added torchaudio.models.Tacotron2()

fb62383

kaiidams force-pushed the tacotron2 branch from c8c5bc9 to fb62383 Compare July 30, 2022 16:24

NiklasGustafsson suggested changes Aug 1, 2022

View reviewed changes

Merge branch 'main' into tacotron2

f2122be

NiklasGustafsson and others added 3 commits August 1, 2022 20:23

Merge branch 'main' into tacotron2

e5302a4

Added error checks for PackedSequence.

cd59037

Added more unit tests for tacotron2.

c798130

NiklasGustafsson merged commit 2f0523e into dotnet:main Aug 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added `torchaudio.models.Tacotron2()` #669

Added `torchaudio.models.Tacotron2()` #669

Uh oh!

kaiidams commented Jul 30, 2022 •

edited

Loading

Uh oh!

GeorgeS2019 commented Jul 30, 2022

Uh oh!

Uh oh!

NiklasGustafsson Aug 1, 2022

Uh oh!

NiklasGustafsson Aug 1, 2022

Uh oh!

NiklasGustafsson Aug 1, 2022

Uh oh!

NiklasGustafsson commented Aug 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added torchaudio.models.Tacotron2() #669

Added torchaudio.models.Tacotron2() #669

Uh oh!

Conversation

kaiidams commented Jul 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GeorgeS2019 commented Jul 30, 2022

Uh oh!

Uh oh!

NiklasGustafsson Aug 1, 2022

Choose a reason for hiding this comment

Uh oh!

NiklasGustafsson Aug 1, 2022

Choose a reason for hiding this comment

Uh oh!

NiklasGustafsson Aug 1, 2022

Choose a reason for hiding this comment

Uh oh!

NiklasGustafsson commented Aug 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added `torchaudio.models.Tacotron2()` #669

Added `torchaudio.models.Tacotron2()` #669

kaiidams commented Jul 30, 2022 •

edited

Loading