End to end text to speech system using gruut and onnx
Library for Textless Spoken Language Processing
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
A hackers AI voice assistant, built using Python and PyTorch.
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Vocal Remover using Deep Neural Networks
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow