Voice2Series-Reprogramming

Voice2Series: Reprogramming / Prompting Acoustic Models for Time Series Classification

We provide an end-to-end approach (Repro. layer) to reprogram on time series data on raw waveform with a differential mel-spectrogram layer from kapre.
No offiline acoustic feature extraction and all layers are differentiable.
Pytorch version of reprogram layer could be found out in ICASSP 23 Music Reprogramming.
updated: if you have used the ECG 200 dataset in this code, please git pull and refer to the issue for one reported label loading error. (has been fixed)

Tensorflow 2.2 (CUDA=10.0) and Kapre 0.2.0.

PyTorch noted: Echo to many interests from the community, we will also provide Pytorch V2S layers and frameworks, incoperating the new torch audio layers. Feel free to email the authors for further reprogramming collaboration.
option 1 (from yml)

conda env create -f V2S.yml

pip install tensorflow-gpu==2.1.0
pip install kapre==0.2.0
pip install h5py==2.10.0
pip install pyts

Please also check the paper for actual validation details. Many Thanks!

python v2s_main.py --dataset 0 --eps 20 --mod 2 --seg 18 --mapping 1

Epoch 14/20
3601/3601 [==============================] - 4s 1ms/sample - l

Name		Name	Last commit message	Last commit date
Latest commit History 155 Commits
Datasets		Datasets
img		img
results		results
weight		weight
LICENSE		LICENSE
README.md		README.md
SpeechModels.py		SpeechModels.py
V2S.yml		V2S.yml
cam_v2s.py		cam_v2s.py
task_list.txt		task_list.txt
ts_dataloader.py		ts_dataloader.py
ts_model.py		ts_model.py
tsne_v2s.py		tsne_v2s.py
utils.py		utils.py
v2s_main.py		v2s_main.py
vggish.py		vggish.py
vggish_params.py		vggish_params.py
y_features.py		y_features.py
y_params.py		y_params.py
yamnet.py		yamnet.py
yang21j.pdf		yang21j.pdf

Provide feedback