In defence of metric learning for speaker recognition
-
Updated
Mar 26, 2024 - Python
In defence of metric learning for speaker recognition
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
The authors' implementation of the "Neural Head Reenactment with Latent Pose Descriptors" (CVPR 2020) paper.
Speaker identification with VGGVox network
Python toolkit for speech processing
[WACV 2024] "CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer"
Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments
[IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
A benchmark analysis of some Speaker Verification techniques based on Deep Learning.
Implementation of [1706.08612] VoxCeleb: a large-scale speaker identification dataset
Few-shot learning experiments mostly on speaker recognition.
说话人识别仓库-说话人表征-ResNet/VGGVox || a ready-to-use repo for Speaker Verification / Speaker Embedding with xvector
Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification
This project partially embodies the state-of-the-art practices in speaker verification technology up until 2020, while attaining the state-of-the-art performance on the VoxCeleb1 test sets.
Add a description, image, and links to the voxceleb topic page so that developers can more easily learn about it.
To associate your repository with the voxceleb topic, visit your repo's landing page and select "manage topics."