Hi, I want to share some software I've written to align an audio recording with a script. It works by transcribing an audio file, splitting up the file into segments, and aligning the best (most recent) take to the script. I hope it can make editing a less daunting process for volunteers.
It's currently hosted at https://kgullion.github.io/vorsovox/ and is run entirely in the browser. I've put an example file of me (very poorly) recording the preamble of the US constitution, along with the script here.
To use, add the files under the select tab, wait for the transcription to finish, and export the highlighted segments. Each segment can be edited by dragging the edges and previewed by double-clicking. For the test data, the output looks something like this:
It does a pretty good job of aligning the file even though the transcriber had some trouble with my slurred pronunciations at times. This is because it first extracts metaphones from both the script and the transcription and aligns those instead. It should work even better on files where the speaker is trying to be legible.
This is still very much beta software. I've got a few more features I'd like to add but you're the subject matter experts, so if you have any ideas, feel free to add them. If you dabble in programming at all, you can find the source code here.
Vorsovox - open-source auto-alignment software
-
- LibriVox Admin Team
- Posts: 24590
- Joined: October 17th, 2010, 9:23 pm
- Location: Basking by the Bayou
- Contact:
Would you be so kind as to make a little video showing how this would work? It seems confusing to me but then most things are confusing to me ! A walk through video would be great.
-
- LibriVox Admin Team
- Posts: 39409
- Joined: April 3rd, 2008, 3:55 am
- Location: Melbourne,Australia
I'm not sure we really are the subject matter experts - we record PD texts read by people for free distribution.
I know some people do use our recordings to improve their nonnative language skills and do read along with the recording so it might help them I don't really know.
So I'm moving this to off topic
And everyone is welcome to discuss it here -
Anne
I know some people do use our recordings to improve their nonnative language skills and do read along with the recording so it might help them I don't really know.
So I'm moving this to off topic
And everyone is welcome to discuss it here -
Anne