DANet-For-Speech-Separation

Pytorch implement of DANet For Speech Separation

Chen Z, Luo Y, Mesgarani N. Deep attractor network for single-microphone speaker separation[C]//2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2017: 246-250.

Requirement

Pytorch 0.4.0
librosa 0.7.1
PyYAML 5.1.2

Due to the RNN multi-GPU parallel problem, only PyTorch 0.4.0 is supported.

Training steps

First, you can use the create_scp script to generate training and test data scp files.

python create_scp.py

Then, in order to reduce the mismatch of training and test environments. Therefore, you need to run the util script to generate a feature normalization file (CMVN).

python ./utils/util.py

Finally, use the following command to train the network.

python train.py -opt ./option/train.yml

The repository model code draws on DANet from naplab. It is found through experiments that the loss value cannot be decreased, and I have limited ability and have not identified the cause of the problem. Therefore, only training codes are released for reference only.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
config		config
data_loader		data_loader
logger		logger
model		model
trainer		trainer
utils		utils
.gitignore		.gitignore
README.md		README.md
cmvn.ark		cmvn.ark
create_scp.py		create_scp.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DANet-For-Speech-Separation

Requirement

Training steps

About

Releases

Packages

Languages

JusperLee/DANet-For-Speech-Separation

Folders and files

Latest commit

History

Repository files navigation

DANet-For-Speech-Separation

Requirement

Training steps

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages