You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@natelawrence
I would love it if you added read/write support for Gentle alignments and YouTube Automatic Captions XML/VTT formats that contain per-word timestamps.
I'd like to put a $150 bounty on both:
Hyperaudio editor reads/writes Gentle JSON.
Hyperaudio editor reads/writes YT AC XML.
For context, Gentle is a forced aligner that (provided an audio file and a transcript) will generate per-word (and even sub-word) timecodes. Their source code is available here on Github.
This second JSON file uses homophone substitutions for words that are not in Gentle's pronunciation dictionary in order to acquire more reliable phoneme timecodes for more words. ASR Timed Text Format Test 2 [Gentle] H.json
The transcripts have some minor custom markup.
[+] = the beginning of a sentence
\\ = the end of a sentence
|| = the end of a paragraph
The corresponding audio file can be obtained here.
@natelawrence
I would love it if you added read/write support for Gentle alignments and YouTube Automatic Captions XML/VTT formats that contain per-word timestamps.
I'd like to put a $150 bounty on both:
Hyperaudio editor reads/writes Gentle JSON.
Hyperaudio editor reads/writes YT AC XML.
https://twitter.com/natelawrence/status/1653373649758605313?s=20
The text was updated successfully, but these errors were encountered: