GitHub - clabra/Audio2Face: NVIDIA Audio2Face Blendshape Implementation with PyTorch. Uses LSTM and CNN (simplified NvidiaNet) models

Audio to Face Blendshape

Implementation with PyTorch.

Base model
- LSTM using MFCC audio features
- CNN(ref simplified version) with LPC features

Prerequisites

Python3
PyTorch v0.3.0
numpy
librosa & audiolazy
scipy
etc.

Files

Scripts to run
- main.py: change net name and set checkpoints folder to train different models
- test_model.py: generate blendshape sequences given extracted audio features (need audio features as input)
- synthesis.py: generate blendshape directly from input wav (need arguements of input audio path)
Classes
- models.py: Classes with LSTM and CNN (simplified NvidiaNet) model.
- models_testae.py: Advanced models with audoencoder design.
- dataset.py: Class for loading dataset.
Input preprocessing
- misc/audio_mfcc.py: extract mfcc features from input wav files
- misc/audio_lpc.py: extract lpc features
- misc/combine.py: combine certain audio feature/blendshape files to obtain a single file for data loading

Usage

Input

To build your own dataset, you need to preprocess your wav/blendshape pairs with misc/audio_mfcc.py or misc/audio_lpc.py. Then combine those feature/blendshape files misc/combine.py to a single feature/blendshape file.

Training

Modify main.py. Set model to the one you need and also specify checkpoint folder.

Evaluation

Both test_model.py and synthesis.py can be used to generate blendshape sequences.
- test_model.py accepts extrated audio features (MFCC/LPC).
- synthesis.py takes raw wav file as input
- State the arguments and it will produce a blenshape test file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Audio to Face Blendshape

Prerequisites

Files

Usage

Input

Training

Evaluation

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
misc		misc
README.md		README.md
dataset.py		dataset.py
main.py		main.py
models.py		models.py
models_testae.py		models_testae.py
synthesis.py		synthesis.py
test.py		test.py
test_model.py		test_model.py

clabra/Audio2Face

Folders and files

Latest commit

History

Repository files navigation

Audio to Face Blendshape

Prerequisites

Files

Usage

Input

Training

Evaluation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages