In defence of metric learning for speaker recognition
-
Updated
Mar 26, 2024 - Python
In defence of metric learning for speaker recognition
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
The authors' implementation of the "Neural Head Reenactment with Latent Pose Descriptors" (CVPR 2020) paper.
Speaker identification with VGGVox network
[WACV 2024] "CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer"
Python toolkit for speech processing
Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments
[IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification
This project partially embodies the state-of-the-art practices in speaker verification technology up until 2020, while attaining the state-of-the-art performance on the VoxCeleb1 test sets.
Few-shot learning experiments mostly on speaker recognition.
说话人识别仓库-说话人表征-ResNet/VGGVox || a ready-to-use repo for Speaker Verification / Speaker Embedding with xvector
A benchmark analysis of some Speaker Verification techniques based on Deep Learning.