All hail cb4960!
I've spent the last four days figuring out how to use subs2srs to make an Anki deck for Tonari no Totoro. I'm not quite done yet. I'm leaving out all the false starts and errors.
Note: if I mention using a program, it doesn't mean it's the best program. It's just the first program I found when I googled for an answer. Feel free to recommend better ones.
Here's how I did it:
1. I bought My Neighbor Totoro. (DVD)
2. I used DvdFab 7 to take out the VOB's.
Why?
The newest version uses encryption to make the disk look corrupt. Even Handbrake chokes on it.
Is this really necessary?
Not for most DVDs. You should be able to manually copy the VOB files to your harddrive.
What are VOBs?
VOBs are the Video Object Files that contain the movie.
3. I used SubRip v1.50 Beta 4 to turn the English subtitles (and timings) into a .srt file.
Why?
I wanted the timings to use on the Japanese subtitles. The English voice actors have to match the mouth flaps when they do a dub, so the timings were going to be good enough to work with.
How does it work?
You click Open Vob (or image sequence, or hardsubbed files), and pick the directory that has the vobs in it and click start. A box pops up and asks you to fill in the first letter it can't recognize. You do this for 26 letters, plus upper case, plus punctuation, and it automagically writes out a text file for you. It's pretty neat, actually.
4. I found the Japanese script for Tonari no Totoro, and hit Google Translate.
What? Why?
Google translate turns out terrible translations, but when you copy/paste selected text, it gives you the original Japanese followed by the mangled English on the same line, like so:
(サツキ) お父さん キャラメル (Satsuki) Caramel Dad
(父さん) おっ ありがとう (Father) Thank you Oh
くたびれたかい? Kai tired?
ううん No
This is a great timesaver for lining up subs.
4. I did something really complicated with OpenOffice to turn the .srt file into something I could manipulate in a spreadsheet.
WHAT?!?
Srt files look like this:
27
00:02:43,070 --> 00:02:46,471
-Dad, want some carameI?
-Thanks. How you doing back there?
28
00:02:46,573 --> 00:02:48,040
-Fine.
-Are you tired? Oop!
What I needed was to put everything on one line like this:
A spreadsheet seemed like the thing to do, with cells for:
Number | Timing range | English translation | Japanese script line |
Why?
So I could mangle it back into an .srt file format after I lined up the script with the timings.
Okay, start the complicated explanation.
5. Complicated explanation:
Using Open Office writer, open the .srt file as a text file.
Go to Edit: Find/Replace.
` Pick the "More Options" button
Click "Regular Expressions"
Search for: $
(Dollar sign, which means linebreak in in Regular Expression speak.)
Replace with: @
(@ means nothing - it's a placeholder for the next step)
Then:
Search for: @
Replace with: \t
(\t means tab in Regular Expression speak. Get the backslash direction right, though.)
Congratulations! You now have a tab delimited text file that any spreadsheet in the world will recognize!
You can reverse the process when you are done working in the spreadsheet. Export -> tab delimited does the trick, and then open the file in a word processor and do these steps backwards. Microsoft Word doesn't use Regular Expressions, but they do let you use special characters and find/replace the same way.
6. Time to cut and paste!
What?!? More steps?
Yes.
Cut and paste the Japanese script with the Google Translate stuff into a column next to the English subtitles. You should be able to do hundreds of lines at a time. This beats doing this in Aegisub, where I could not figure out how to do it except one line at a time, and you lose the English lines as a reference for whether you are lined up correctly.
Cut and paste the Japanese script without the Google Translate stuff into the next column.
The Japanese script is going to have two lines where the English subtitles have one. So, after you cut and paste the script, you have to go back and combine lines, and then delete the empty cell to move things up. Do this for both columns of Japanese script.
7. Finishing work
I'm not done yet, but the finishing steps are:
a. delete the columns with the English subs
b. delete the terrible google translate translation column
c. export the spreadsheet as tab delimited text
d. import the text file into a word processor and exchange the tabs for returns. (Step 5 in reverse)
e. save as a text file. Hopefully the word processor will keep it as UTF-8, or you'll lose all your kana
8. Checking
Open the video in VLC and after starting the movie, go to Video: Subtitles Track: Open File and try out your file.
9. Now that it doesn't sync, download Aegisub, and open your subs in that program.
In Aegisub, after you open your subs, go to Audio: Open audio from video.
After Aegisub crashes, download BeSweet and BeLight. Dump BeSweet into the same folder as BeLight.
Open BeLight, and use it to open one of the VOBs (or better yet, avi) from the movie and pick the WAV tab. You want to save the audio track as a WAV. Do this for each VOB.
Why does Aegisub crash?
Unless it is working with WAV files, it decompresses the audio file into your computer's memory - as a giant WAV file! In my case 972MB of wav files. More than my computer could handle, given the other things I had open.
10. Now you have an important choice:
You can either edit the timings with multiple srt files, the original VOBs for video, and multiple WAV files
OR
You can edit the timings with the one srt file you've just spent hours creating, and you rip the VOBs into one video file, and join the WAVs into one giant WAV file.
# If you want to mess with multiple VOBs & multiple WAVs, you'll have to split your srt file to match.
# If you want to work with what you've created, you'll need DVDfab or Handbrake to turn your VOBs into a single video file (I think mkv format works.) You can use Audacity (yet another free program) to connect all your WAV files.
11. Use Aegisub to edit your subtitle timing.
12. Import your video file into Subs2SRS, along with your subtitle files. Hit preview a bunch of times to see if it complains.
13. There's a 'preview timings' thing in Subs2SRS. You should check to see if it works.
14. Run Subs2SRS.
15. Write a long-winded post describing how you did it.
16. ???
17. Profit!
Thanks for reading this far. It took me 2 1/2 hours to write this up, including going back and checking steps in the different software programs. Please let me know how I can improve this.
I've spent the last four days figuring out how to use subs2srs to make an Anki deck for Tonari no Totoro. I'm not quite done yet. I'm leaving out all the false starts and errors.
Note: if I mention using a program, it doesn't mean it's the best program. It's just the first program I found when I googled for an answer. Feel free to recommend better ones.
Here's how I did it:
1. I bought My Neighbor Totoro. (DVD)
2. I used DvdFab 7 to take out the VOB's.
Why?
The newest version uses encryption to make the disk look corrupt. Even Handbrake chokes on it.
Is this really necessary?
Not for most DVDs. You should be able to manually copy the VOB files to your harddrive.
What are VOBs?
VOBs are the Video Object Files that contain the movie.
3. I used SubRip v1.50 Beta 4 to turn the English subtitles (and timings) into a .srt file.
Why?
I wanted the timings to use on the Japanese subtitles. The English voice actors have to match the mouth flaps when they do a dub, so the timings were going to be good enough to work with.
How does it work?
You click Open Vob (or image sequence, or hardsubbed files), and pick the directory that has the vobs in it and click start. A box pops up and asks you to fill in the first letter it can't recognize. You do this for 26 letters, plus upper case, plus punctuation, and it automagically writes out a text file for you. It's pretty neat, actually.
4. I found the Japanese script for Tonari no Totoro, and hit Google Translate.
What? Why?
Google translate turns out terrible translations, but when you copy/paste selected text, it gives you the original Japanese followed by the mangled English on the same line, like so:
(サツキ) お父さん キャラメル (Satsuki) Caramel Dad
(父さん) おっ ありがとう (Father) Thank you Oh
くたびれたかい? Kai tired?
ううん No
This is a great timesaver for lining up subs.
4. I did something really complicated with OpenOffice to turn the .srt file into something I could manipulate in a spreadsheet.
WHAT?!?
Srt files look like this:
27
00:02:43,070 --> 00:02:46,471
-Dad, want some carameI?
-Thanks. How you doing back there?
28
00:02:46,573 --> 00:02:48,040
-Fine.
-Are you tired? Oop!
What I needed was to put everything on one line like this:
Code:
27 00:02:43,070 --> 00:02:46,471 -Dad, want some carameI? -Thanks. How you doing back there? (サツキ) お父さん キャラメル (Satsuki) Caramel Dad (父さん) おっ ありがと (Father) Thank you OhNumber | Timing range | English translation | Japanese script line |
Why?
So I could mangle it back into an .srt file format after I lined up the script with the timings.
Okay, start the complicated explanation.
5. Complicated explanation:
Using Open Office writer, open the .srt file as a text file.
Go to Edit: Find/Replace.
` Pick the "More Options" button
Click "Regular Expressions"
Search for: $
(Dollar sign, which means linebreak in in Regular Expression speak.)
Replace with: @
(@ means nothing - it's a placeholder for the next step)
Then:
Search for: @
Replace with: \t
(\t means tab in Regular Expression speak. Get the backslash direction right, though.)
Congratulations! You now have a tab delimited text file that any spreadsheet in the world will recognize!
You can reverse the process when you are done working in the spreadsheet. Export -> tab delimited does the trick, and then open the file in a word processor and do these steps backwards. Microsoft Word doesn't use Regular Expressions, but they do let you use special characters and find/replace the same way.
6. Time to cut and paste!
What?!? More steps?
Yes.
Cut and paste the Japanese script with the Google Translate stuff into a column next to the English subtitles. You should be able to do hundreds of lines at a time. This beats doing this in Aegisub, where I could not figure out how to do it except one line at a time, and you lose the English lines as a reference for whether you are lined up correctly.
Cut and paste the Japanese script without the Google Translate stuff into the next column.
The Japanese script is going to have two lines where the English subtitles have one. So, after you cut and paste the script, you have to go back and combine lines, and then delete the empty cell to move things up. Do this for both columns of Japanese script.
7. Finishing work
I'm not done yet, but the finishing steps are:
a. delete the columns with the English subs
b. delete the terrible google translate translation column
c. export the spreadsheet as tab delimited text
d. import the text file into a word processor and exchange the tabs for returns. (Step 5 in reverse)
e. save as a text file. Hopefully the word processor will keep it as UTF-8, or you'll lose all your kana

8. Checking
Open the video in VLC and after starting the movie, go to Video: Subtitles Track: Open File and try out your file.
9. Now that it doesn't sync, download Aegisub, and open your subs in that program.
In Aegisub, after you open your subs, go to Audio: Open audio from video.
After Aegisub crashes, download BeSweet and BeLight. Dump BeSweet into the same folder as BeLight.
Open BeLight, and use it to open one of the VOBs (or better yet, avi) from the movie and pick the WAV tab. You want to save the audio track as a WAV. Do this for each VOB.
Why does Aegisub crash?
Unless it is working with WAV files, it decompresses the audio file into your computer's memory - as a giant WAV file! In my case 972MB of wav files. More than my computer could handle, given the other things I had open.
10. Now you have an important choice:
You can either edit the timings with multiple srt files, the original VOBs for video, and multiple WAV files
OR
You can edit the timings with the one srt file you've just spent hours creating, and you rip the VOBs into one video file, and join the WAVs into one giant WAV file.
# If you want to mess with multiple VOBs & multiple WAVs, you'll have to split your srt file to match.
# If you want to work with what you've created, you'll need DVDfab or Handbrake to turn your VOBs into a single video file (I think mkv format works.) You can use Audacity (yet another free program) to connect all your WAV files.
11. Use Aegisub to edit your subtitle timing.
12. Import your video file into Subs2SRS, along with your subtitle files. Hit preview a bunch of times to see if it complains.
13. There's a 'preview timings' thing in Subs2SRS. You should check to see if it works.
14. Run Subs2SRS.
15. Write a long-winded post describing how you did it.
16. ???
17. Profit!
Thanks for reading this far. It took me 2 1/2 hours to write this up, including going back and checking steps in the different software programs. Please let me know how I can improve this.
