Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Read write Gentle JSON #160

Open
maboa opened this issue Jul 6, 2023 · 2 comments
Open

Read write Gentle JSON #160

maboa opened this issue Jul 6, 2023 · 2 comments
Labels
enhancement New feature or request user request

Comments

@maboa
Copy link
Member

maboa commented Jul 6, 2023

@natelawrence
I would love it if you added read/write support for Gentle alignments and YouTube Automatic Captions XML/VTT formats that contain per-word timestamps.

I'd like to put a $150 bounty on both:
Hyperaudio editor reads/writes Gentle JSON.
Hyperaudio editor reads/writes YT AC XML.

https://twitter.com/natelawrence/status/1653373649758605313?s=20

@maboa maboa added the enhancement New feature or request label Aug 10, 2023
scarsellifi pushed a commit to scarsellifi/hyperaudio-lite-editor that referenced this issue Nov 10, 2023
…r-type

flag omission of data-player-type
@natelawrence
Copy link

natelawrence commented May 25, 2024

For context, Gentle is a forced aligner that (provided an audio file and a transcript) will generate per-word (and even sub-word) timecodes. Their source code is available here on Github.

🔰 I'm attaching two example Gentle JSON files (for the same recording) below.
ASR Timed Text Format Test 2 [Gentle] P.json

This second JSON file uses homophone substitutions for words that are not in Gentle's pronunciation dictionary in order to acquire more reliable phoneme timecodes for more words.
ASR Timed Text Format Test 2 [Gentle] H.json

The transcripts have some minor custom markup.
[+] = the beginning of a sentence
\\ = the end of a sentence
|| = the end of a paragraph

The corresponding audio file can be obtained here.

@natelawrence
Copy link

Also note that although HyperAudio Converter is able to import Gentle files and convert them to HyperAudio hypertranscripts, the formatting of the HyperAudio hypertranscript is outdated and cannot be directly imported by HyperAudio Lite Editor.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request user request
Projects
None yet
Development

No branches or pull requests

2 participants