Back

Podcasts to srs

#1
It's nothing special, but I thought someone else could benefit from this so I'm going to explain with a (hopefully) simple step-by-step tutorial how to quickly make .lrc files for podcasts that come with transcript, so that they can be used with subs2srs.

What you'll need:
- mp3 file which you want to srs
- transcript of said audio file*
- the lyric editor provided with Minyliric free**
- subs2srs

*For podcasts with transcripts look into The Podcast Thread, there's at least 3-4 items on that list that include transcripts.
Or you could take a look into podcastle.jp, though I recommend caution especially with episodes with a small number of edits.
Audiobooks are also an option.

**you can use other software to obtain the same result:
Mac users: Karatunes
Linux users: karaoke lyric editor
Windows: ALSong
This guide will use minilyrics to explain the steps, but the procedure shouldn't be much different with alternative software.

The software I use is the one mentioned in this thread, which is... well, not optimal for the task imho, but I couldn't find anything else and I'm pretty satisfied with the results anyway.
A word of caution, even though you can use the program with no limitations for as long as you like, it WILL bug you reminding to go for the premium edition everytime you open the media player you associate with it. Personally I don't mind and there's probably a way to have it open exclusively when you need it to work on some .lrc file, but I didn't look for it.

If that didn't put you off, go through the installation and when asked select the media player you intend to use in combination with this software. I chose windows media player, but it seems to support about every media player that I can think of.

When you are done, open the media player and the Minilyric window should pop up automatically.
Now open the file with the podcast audio. In my example I'll be using this one http://files.podcastjuice.jp/jyounetu/plusp20100430.mp3 from the 情熱大陸+P podcasts.
Starting with podcasts of people reading something, or even audiobook files, should prove easier as there's usually a bit of pause between sentences and there are no people talking over each other, which would make it harder to get the timing down.

Proceed to open the lyric editor:
[Image: 39087677.png]

Now take the podcast transcript ( http://jounetsu.cocolog-nifty.com/plusp/...-e98a.html for the mp3 I'm using) and paste it in:
[Image: 54593932.png]

Now you need to make sure that each sentence is on its own line. Depending on the text you can probably use the period marks as a reference, though often you'll need to first listen to the audio once to look for pauses and indicators of when would be a good time to break that sentence.
Remember that 1 line of text = 1 flashcard so adjust it accordingly. Also eliminate all blank lines (there should be an option in the editor or you can do it manually).

Then place the cursor anywhere on the first line. You should have something like this:
[Image: 76089452.png]

What you'll be using from now on, is the orange button on at the upper-left corner of the editor window
[Image: buttonth.png]
Clicking this button sets the current time as the starting time of the selected line of text, and automatically selects the next line, which means you need to click it right before the selected sentence starts. In other words, what you'll be watching is not the currently selected line, but the previous one: when that one ends it's time to click the button (except for the very first line).

In the mp3 I'm using for this example there quite a bit of intro and commercials, the transcribed part actually starts at around 2:18. So I search for the starting point and pause there on the media player. Go back to the editor and press the orange button:
[Image: 91096550.png]
Now press play either on the media player or on the lyric editor (can also use ctrl+p) and listen to the first sentence being read. When it ends, click the orange button:
[Image: 22749995.png]
*Click*
[Image: 83943653.png]
etc, until you reach the end, then use the "File->Save lyrics as..." option to save your .lrc file (you'll want to save it in a new folder to use it later with sub2srs)

Tips:
- If you mess up you don't need to redo it from scratch, just select the sentence you need to fix and rewind the player to a couple of seconds before that, when the previous sentence ends click the orange button and continue from there (if you need to fix the following sentences too).
- You can use ctrl+pagup/pagdown to go 2 sec back/forward in time, useful if you got distracted and missed the ending point of a sentence.
- Try not to press the button too soon or too late obviously, but don't be maniacal about it. Using sub2srs padding option can usually make up for some minor timing mistakes.
- If there's some audio/cm/noise/silence that you want to skip in between sentences you need, make a blank line on the lyrics editor between those and treat the unneeded part as a sentence, timing it to the blank line. You can exclude it from being exported later with sub2srs.

I won't go in much detail on how to use sub2srs since the general usage is already explained in the original thread with great detail.
What you need to change compared to normal video/sub processing is:
- Insteads of the folder containing japanese subtitle files, you'll be providing the path of the folder where you saved the .lrc file when it asks for "Subs1..."
- Disable generate snapshots, else it'll ask for the folder with the video files.
- Under the "Generate audio clips" option, change source settings from "Extract from video" to "Audio..." and select the folder with the mp3 file (make sure that the folder only contain that file).
I also recommend making use of pad timings and the context options (text only).

I've uploaded a pair of decks that show the final result:

http://www.megaupload.com/?d=TRJ3LY19
2 episodes from 情熱大陸+P, c.ca 100 sentences
original text/audio:
目で『聴く』
カッコいい大人になる方法
(first one is missing context text)

http://www.megaupload.com/?d=Z3IX3DZV
an episode from tbs radio - a piece of laughing, 200~ sentences
Audio
Transcript from podcastle.jp

I've edited the format a bit for my personal usage but it should be easy to adapt them to whatever format you use normally.
Note: there's obviously no english translation, I also cleared the "notes" field because I was using english definitions for the most part and not everyone likes them. Readings are auto-generated and may be off, I usually correct them (listening to the audio) when I see the card the first time.

What I usually do to study with these decks is:
- get the first exposition to the text/audio while making the .lrc file.
- import into anki (optional: regenerate all readings) and study them as new cards, going through them in the correct order (basically listening to it again but a sentence at a time).
- whenever there's new vocab, grammar point, tidbit of info I want to add to the card, I hit the edit button and add the definition/explanation/stuff to the "notes" field, which will show in the answer.
- Then I review them normally as sentence cards.

Aaaand... that's about it. I probably managed to make it sound more complicated than it is, but making the timed .lrc file shouldn't take much more time than it would take to listen to the original audio, once you get used to it.
I'm honestly no expert in this field, so if anyone has suggestions/software/criticism to make the whole process easier or more useful, then by all means please share.

Since this is basically my first post with some substance to it, I would also like to take the occasion to thank cb4960 for that incredibly useful piece of software that is sub2srs, Nuriko for the podcast thread which has basically become my main source of japanese material for immersion, ファブリス for this awesome website that is at the core of this awesome community, and the RevTK community itself as a whole for being informative, resourceful, helpful and for making this journey to mastering the japanese language a much more fulfilling one.
You guys rock!
Edited: 2010-05-19, 3:59 pm
Reply
#2
One, awesome steps. MUCH better than my ham-handed method of using audacity and mark splits.

Two: Do you know of ANY iPod app that could play these lyrics/subs while an audio is playing? Reason being, I have lots of drama audio ripped onto my iPod, be shame not to have lyrics rolling across then screen at the same time.

PS: If iPod automatically does this with the lyric entry for songs in the audio file properties then I'll feel silly.
Reply
#3
Nukemarine Wrote:PS: If iPod automatically does this with the lyric entry for songs in the audio file properties then I'll feel silly.
Time to start feeling silly - most ipods do this (except the shuffles)
Reply
May 16 - 30 : Pretty Big Deal: Save 31% on all Premium Subscriptions! - Sign up here
JapanesePod101
#4
Epic!!!!
Reply
#5
@Nukemarine
Looks like it was already answered, anyway I wouldn't know for sure since I have no i(Pod/Phone/Pad) to go by, but I'm guessing that just having a .lrc file with the same name of the mp3 file or using the "Save lyrics in mp3 file" option from the lyric editor should do the trick.
Reply
#6
awesome.
Reply
#7
I should be more specific - most iPods will show lyrics whilst a song is playing - just paste them into the Lyrics ID3v2 field in your tag editor (e.g. Itunes). If you actually want dynamic (time synced) lyrics you'll need an app like LRC player (http://apptrackr.org/?act=viewapp&appid=335485481)

p.s. cool system Seizar - I wonder if there's a similar tool for the mac?
Edited: 2010-05-19, 1:36 pm
Reply
#8
Blahah Wrote:
Nukemarine Wrote:PS: If iPod automatically does this with the lyric entry for songs in the audio file properties then I'll feel silly.
Time to start feeling silly - most ipods do this (except the shuffles)
I think by silly, I mean I didn't think about doing it. Again, one of those things that seem so obvious after the fact: Rip audio from dramas, put audio on iPod, convert subtitles of said dramas and put on the iPod also.

Now, I know there's a lyrics screen which I knew about. Yeah, I could have copied the subtitle file over but it doesn't automatically advance (and would have required splitting up a lot of subtitle files). If the .lrc format works with the iPod then that's perfect as the it allows longer battery life at expense of no video (I can live with that). I messed with the .lrc format but could not seem to get it to work. A little advice would be appreciated.
Reply
#9
Has anyone used this and Transcriber (per balloonguy's interactive audiobook tool) -- http://forum.koohii.com/showthread.php?p...8#pid95168 ? If so, can you compare them? As of yet, subs2srs can't use .trs files, so it'd be cool if they were similar enough that one could convert .trs to .lrc.
Edited: 2010-05-19, 2:36 pm
Reply
#10
For mac users you can use Karatunes instead of minilyrics to do the lyric timing.

@Nukemarine what iPod do you have? If it's a touch or iPhone you can definitely have the lyrics synced e.g by using the LRC player app I linked in previous post.
Reply
#11
Blahah,

Thanks for the links. I looked at it, and I don't think it'll work for my method just yet. Again, we're talking about converted subtitles files so these are not exactly about top 100 music lists. However, if there's a cache storage, then it's not a stretch to be able to store LRC files directly in there via a ssh or jailbreak method.

It's not a big deal, but seems useful for those on long commutes to be able to listen and read subtitles at the same time.
Reply
#12
@Nest0r
I had a quick look at the program you mentioned, it seems to me that if you export the transcript to limsi label format (I honestly have no idea what this is for actually) it is similar enough to be converted with a simple script (I guess, I have no scripting experience myself except for basic stuff). You do lose speaker informations though.
It looks something like this:
Code:
0.0 ((Yeah)).
0.258 {inhale} He's hilarious. {laugh}
2.41 He's great.
3.055 {inhale} He's really a trip. I know. But it really shows you,
4.77 I mean, you know, you really don't have to put up with the Anthony's of the world
compare to .lrc format:
Code:
[00:00.00] ((Yeah)).
[00:00.25] {inhale} He's hilarious. {laugh}
[00:02.41] He's great.
[00:03.05] {inhale} He's really a trip. I know. But it really shows you,
[00:04.77] I mean, you know, you really don't have to put up with the Anthony's of the world.
(I don't know what's up with that conversation lol, I just opened the example .trs file)

@Blahah
Thanks, I had started looking for something like that but managed to miss karatunes even though it's listed in the wikipedia page for LRC file format. I'll add a list of similar software in the first post.
Reply
#13
Nukemarine Wrote:Thanks for the links. I looked at it, and I don't think it'll work for my method just yet. Again, we're talking about converted subtitles files so these are not exactly about top 100 music lists. However, if there's a cache storage, then it's not a stretch to be able to store LRC files directly in there via a ssh or jailbreak method.
Nukemarine you can just drop all the .lrc files into the Documents folder of the App via SSH, with each .lrc file named the same as the corresponding mp3/m4a in your iTunes library. Then you can just choose files to play through with the player and it will highlight the current line of subs.

I've got it working right now for Japanese hip-hop, but it would work perfectly for audio from a dorama/anime
Edited: 2010-05-19, 4:57 pm
Reply
#14
Blahah, looks like I'll have some fun with it then.

Personally I think podcast like "Japanese Listening Advanced" are good candidates to not only turn into a .LRC but also to share with others. If not for an subs2srs then for simple reading and listening method all on the iTouch.
Reply
#15
You should add Aegisub to your alternative list of script making software. It's a bit more visual, as you can see the audio waves or the awesome Sprectrum analyzer lets you see voice print easier. Plus, it's what the fansubbers use, so there is a big amount of support for it.

Edit: doesn't output .lrc format, but if your dumping it directly to sub2srs it's great. But it probably is not too difficult to do a conversion either.
Edited: 2010-05-19, 10:37 pm
Reply
#16
So how does Aegis Sub compare to Transcriber and the Lyric Fetcher thingy in this thread? Which of these are easiest? I've only used Transcriber, which is very easy, but in retrospect, I'm not sure why I started using it instead of .ass or .lrc programs. Kage Shibari (balloonguy's interactive ebook/audiobook tool) works for Aegis files, and subs2srs with .lrc files, so perhaps I should ditch Transcriber?
Edited: 2010-05-19, 11:05 pm
Reply
#17
yes more anki decks!
Reply
#18
I think Aegisub is pretty easy and worth a shot. Left click to set your starting time, Right click to set your ending time. Hit enter then place the times for the next line. Also, Lots of ways to playback just the beginning or end of the audio segment.

I glanced at youtube, looking at several timing tutorials for Aegisub. This one here seems pretty good.




In this video he is writing the script as he times. It's pretty much the same if you load in a plain text file transcript.

If your just timing audio and want a preview, you can use a blank "Dummy Video" to see your subtitles in action.

Sadly, I'm not seeing much for LRC <-> ASS converters.

This shareware app does LRC <-> SSA/SRT conversion (Formats you can export from Aegisub)
http://download.cnet.com/Lyrics-Subtitle...66948.html

I'll probably see about using my coding skills to see about writing my own format converter sometime. Maybe one that also supports Transcriber format.
Edited: 2010-05-20, 12:21 am
Reply
#19
If you use Aegisub or other similar software to create subtitle files for mp3 audio, then you can reproduce that audio with KMPlayer and use its "subtitle explorer".

You can see the subtitle/lyrics line by line on the main window, or as a whole on the subtitle explorer (where the current line is highlighted).

I posted about using KMPlayer and AGTH for interactive use of the subtitles, and you can do the same with music. It would look something like this (but without video)

[Image: 3b8807fd7b8ef3205804fb4f5c563ad32g.jpg]

I just downloaded Minilyrics and it's totally awesome. The only thing you can't do with it is to click a line of the lyric and go to that point in the song. If you need that option, you can give KMPlayer and its subtitle explorer a try.
Reply
#20
My only problem with the above is that like other programs, it seems to display the subtitles onscreen for videos? Which went against my main reasons for wanting interactive subtitles when I first asked for them. Though ironically the tool balloonguy came up with I ended up using primarily for interactive audiobooks.
Reply
#21
You can show or hide subtitles by simply presing Alt+X. You can even load more than 1 subtitle file at the same time and display them at once or one at a time. In that case, each subtitle file gets its own tab in the subtitle explorer.
Reply
#22
Do you mean show/hide onscreen subtitles while retaining the interactive sidebar subtitles? Cool, I'll have to check it out, if I ever lose my obsession for interactive audiobooks. ;p

I forgot though, it must also be able to remove the timestamps from the interactive lines, in addition to no onscreen subs. I guess that's the other reason I never bothered testing it out. Balloonguy's tool doesn't work with all media, so if I ever need a backup for video, guess I'll try out KMPlayer and its subtitle explorer.

Meanwhile, guess I should test out Minilyric thingy and Aegisub for adding timecodes. Or just wait for cb4960 to add a .trs parser and forget these comments ever happened.
Edited: 2010-05-20, 4:17 am
Reply
#23
Got to say this is awesome.man this will make stuff more enjoyable to srs.
Reply
#24
Anyone know where to find podcasts that are accompanied by transcripts?
Reply
#25
KREVA Wrote:Anyone know where to find podcasts that are accompanied by transcripts?
There should be a recent thread for podcasts, just gotta find it first. Forgot where though.
Reply