Use subs2srs to Create Anki Decks Based on Your Favorite Movie or Show

Index » Learning resources

Reply #726 - 2011 March 10, 4:51 am
nest0r Member
Registered: 2007-10-19 Posts: 5236 Website

I made a list of your tools, hope I didn't miss anything. You really ought to make a mega-thread. ;p

Audio Lesson Studio - Link - “Allows you to create simple, user-defined, .mp3 audio lessons that may be used as an aid for learning languages or just about anything else - use your imagination! Audio can be based on existing media or automatically generated using Text-To-Speech.”

Capture2Text - Japanese OCR Utility - Link - “A utility that allows you to quickly capture a small portion of the screen, OCR it and (by default) save the result to the clipboard.”

cb's Japanese Word Frequency List Generator - Link - “A utility for generating word frequency lists from files that contains Japanese text.”

cbJisho - E-J Dictionary Based on Word Frequency - Link - “An English-to-Japanese dictionary that sorts results based on relative frequencies in blogs, newspapers, and novels.”

JNovel Formatter - Link - “A utility that will convert a Japanese novel (in text form) to one or more specially formatted HTML files.”

JReadability - Link - “A utility that will compute the Obi-2 Reading Level and Hayashi Score for the provided Japanese texts.”

Rikaichan + JDIC Audio Firefox Extension - Link - A modification of Rikaichan to include JDIC vocabulary audio support.

subs2srs - Link - “A utility that allows you to create SRS (for example Anki) import files based on your favorite foreign language movies and TV shows to aid in the language learning process.”

vobsub2text - Link - “A utility that uses OCR technology to automatically convert VOBSUB subtitles (.idx/.sub) to subrip (.srt) subtitles.”


Bonus:

background typing - Link - Autohotkey script that lets you type/paste text into a designated program without switching from the active window.

bassplayer - Link - Media player with adjustable volume to go with the shared Anki plugin Custom Media Player 2. Usage: Link

Frequency list for Japanese novels - Link - List generated using the Word Frequency Generator.

JDIC Audio Extraction Tool - Link - Tools for extracting desired JDIC audio files from the linked offline JDIC vocabulary audio archive.

JReadability results for a large batch of novels - Link - A sorted list of readability scores for Japanese novels.

Last edited by nest0r (2011 March 10, 5:42 am)

Reply #727 - 2011 March 10, 8:08 pm
cb4960 Member
From: Los Angeles Registered: 2007-06-22 Posts: 917

nest0r wrote:

I made a list of your tools, hope I didn't miss anything. You really ought to make a mega-thread. ;p

Well you've certainty made it easy for me.

Maybe we can start a "RevTK Software Developer Showcase" thread where any member can create a single post showing off any Japanese learning software that they have personally created and feel others might be interested in.

Reply #728 - 2011 March 10, 11:21 pm
nest0r Member
Registered: 2007-10-19 Posts: 5236 Website

Good idea. Feel free to edit things as they suit you. I'm sure the links especially'll expire quickly. ;p

Advertising (register and sign in to hide this)
JapanesePod101 Sponsor
 
Reply #729 - 2011 March 12, 7:54 pm
cb4960 Member
From: Los Angeles Registered: 2007-06-22 Posts: 917

Hello,

I have just released version 23.0 of subs2srs.

Download subs2srs v23.0 via SourceForge

What Changed?

Added an Advanced Video Clip Options dialog which allows you to:
1) Enable subs2srs to generate iPhone/iPod/iPad compatible video clips. This can be useful for AnkiMobile users.
2) Select the video codec to use (MPEG4 or h.264)
3) Select an h.264 profile
4) Select the audio codec to use (AAC or MP3)
5) Select video/audio bitrates

cb4960

Last edited by cb4960 (2011 March 12, 7:56 pm)

Reply #730 - 2011 March 12, 8:24 pm
nest0r Member
Registered: 2007-10-19 Posts: 5236 Website

Great, thanks. smile

Japanese everywhere forever!11~

Reply #731 - 2011 March 13, 1:32 am
Cranks Member
Registered: 2010-10-21 Posts: 477

Just bought me some Anime DVDs (bargain marts for the win.) Thankfully they play with subs on my computer. Yeah, for screen capture devices and subs2srs.

Update came at just the right time... THANK YOU!!!

Reply #732 - 2011 March 13, 7:59 am
Cranks Member
Registered: 2010-10-21 Posts: 477

Officially, happy to have those babies any time now CB4960. I had to adjust back down to my normal iPhone resolution, but I have to say that the card is second to none. Definitely, the best card I have seen in terms of ease to make (screen capture video with subs [no typing needed!], add in timings using Mylyrics, run through subs3srs with video & image, import to deck - a possible 45 minutes if not counting screen capture time.)

Impressive to say the least!

I think having that original video there to remind me of the context makes all the difference in the world.

Last edited by Cranks (2011 March 13, 8:00 am)

Reply #733 - 2011 March 20, 12:24 pm
cb4960 Member
From: Los Angeles Registered: 2007-06-22 Posts: 917

Hello,

I have just released version 23.1 of subs2srs.

Download subs2srs v23.1 via SourceForge

What Changed?

Removed the Advanced Video Clip Options dialog and moved the iPod Support checkbox to the main dialog.

After additional testing of v23, I have determined that ffmpeg has timing/cutting issues with h.264 and .mp4.

With h.264, ffmpeg only cuts on the last keyframe, which could be several seconds before the time that you actually want to cut.

When cutting an .mp4 (regardless of codecs used), the cut will take place ~0.5 seconds before it should.

The iPod supported video files in v23.1 still use h.264 and .mp4 but I've helped the timing issues a little bit by reducing the keyframe interval in ffmpeg.

cb4960

Reply #734 - 2011 March 20, 4:19 pm
nest0r Member
Registered: 2007-10-19 Posts: 5236 Website

Interesting, I didn't even know there was a problem, will have to make some substantive decks. ^_^

Reply #735 - 2011 March 20, 8:24 pm
ta12121 Member
From: Canada Registered: 2009-06-02 Posts: 3190

finally going to start using this. I have the subtitle file for the haruhi movie+the haruhi movie. I plan to use this for transcription practice/fun

Reply #736 - 2011 April 01, 7:22 am
aphasiac Member
From: 台湾 Registered: 2009-03-16 Posts: 1036

cb4960, thanks for all your hard work, this program is great - I've just made my first deck, it's so easy to use!

My only problem/feature suggestion is a small one. The drama I'm using (Nihonjin no Shiranai Nihongo) has japanese srt subs captured directly from when it was broadcast on TV. Several times in the show, the character has a long speaking part, and so a sentence has to be continued into the next subtitle. To signify this, the end of the first subtitle contains a little → (indicating "to be continued").

I understand you can put before and after lines on the card to add context, BUT half a sentence that cuts off while the character is talking is not that useful to study.

This might be technically difficult, but it would be handy to be able to select 2 (or more) sequential cards in the "preview" mode, and combine them into 1 card. This takes the beginning time from the first card, the end time from the last card, and combines all the text with 1 long piece of audio.

A neater way might even be to combine cards that have a → or ... on the end of the L2 sub with the next card. However, as this is my first subs2srs drama, I'm not sure if this is common practise with japanese subs, or whether this feature would be redundant. Other long-term users might be able to shed more light - is this a common problem?

Anyway thanks again, keep up the good work!

Reply #737 - 2011 April 01, 7:47 am
Tori-kun このやろう
Registered: 2010-08-27 Posts: 1193 Website

@cb thanks for all the programm and nest0r for posting a comprehensible list!!!
@aphasiac, as I'm on my way finishing core2k (you know), erm and finishing watching the movie 'Nobody Knows' I will also start with Nihonjin no shiranai nihongo. Perhaps we could share our decks then? The show looks promising. If you need DDL links for the raws, pm me.

Edit: The vobsub2textsub tool does not work properly I guess. Only the .png files are created but I cannot find anything looking like a .srt file or so @cb/nest0r/et al. sad(
(Yeah, I used the .idx from Nobody Knows)

Last edited by Tori-kun (2011 April 01, 7:59 am)

Reply #738 - 2011 April 01, 8:20 am
nest0r Member
Registered: 2007-10-19 Posts: 5236 Website

Tori-kun wrote:

@cb thanks for all the programm and nest0r for posting a comprehensible list!!!
@aphasiac, as I'm on my way finishing core2k (you know), erm and finishing watching the movie 'Nobody Knows' I will also start with Nihonjin no shiranai nihongo. Perhaps we could share our decks then? The show looks promising. If you need DDL links for the raws, pm me.

Edit: The vobsub2textsub tool does not work properly I guess. Only the .png files are created but I cannot find anything looking like a .srt file or so @cb/nest0r/et al. sad(
(Yeah, I used the .idx from Nobody Knows)

Works for me. I didn't realize there are Japanese-language vobsubs for Nobody Knows. Did you try it with other .idx files?

Last edited by nest0r (2011 April 01, 8:24 am)

Reply #739 - 2011 April 01, 8:24 am
Tori-kun このやろう
Registered: 2010-08-27 Posts: 1193 Website

Heh, now it works, nest0r.. strange, but anyway. I recognised that the ocred files, srts, have lots of mistakes in there. any way to correct them/smoothen them out?

Reply #740 - 2011 April 01, 8:30 am
nest0r Member
Registered: 2007-10-19 Posts: 5236 Website

Tori-kun wrote:

Heh, now it works, nest0r.. strange, but anyway. I recognised that the ocred files, srts, have lots of mistakes in there. any way to correct them/smoothen them out?

Cool. Well, you could try fiddling with colour and scale. It's imperfect but better than nothing, used in conjunction with the sub/idx on cards.

On the page before this we talk a bit about it: http://forum.koohii.com/viewtopic.php?p … 94#p137094

Reply #741 - 2011 April 01, 9:01 am
Tori-kun このやろう
Registered: 2010-08-27 Posts: 1193 Website

Well, define what means "boring", aphasiac. Despite of their boring content I still did not manage to understand all the core2k cards (not talking about KO2001 at all! That's really difficult to grasp for me!).. I mean the known cards are going pretty fine, but the new ones not.

Nihonjin no shiranai nihongo is indeed nice due it's lenght of 30min per episode and the availablitiy of both eng and jap subs.

Which drama deck are you talking about concretely? I couldn't find one. d-addict's torrents are sometimes not even working for me, therefore I refer to asia-torrents or even megaupload direct downloads of the raws.

I just want to finish of core6k, the next step, as soon as bloody possible XP

Reply #742 - 2011 April 01, 9:03 am
Tori-kun このやろう
Registered: 2010-08-27 Posts: 1193 Website

hxxp://lets-look.com/index.php?/topic/2105-index-j-dramas/

Reply #743 - 2011 April 01, 9:08 am
nest0r Member
Registered: 2007-10-19 Posts: 5236 Website

We should probably keep this thread for specific updates/feedback on the software, and leave digressions about decks and resources and strategies elsewhere.

Reply #744 - 2011 April 01, 9:22 am
aphasiac Member
From: 台湾 Registered: 2009-03-16 Posts: 1036

Indeedy, i will delete my excess posts to reduce clutter.

Reply #745 - 2011 April 03, 11:50 am
cb4960 Member
From: Los Angeles Registered: 2007-06-22 Posts: 917

aphasiac wrote:

cb4960, thanks for all your hard work, this program is great - I've just made my first deck, it's so easy to use!

My only problem/feature suggestion is a small one. The drama I'm using (Nihonjin no Shiranai Nihongo) has japanese srt subs captured directly from when it was broadcast on TV. Several times in the show, the character has a long speaking part, and so a sentence has to be continued into the next subtitle. To signify this, the end of the first subtitle contains a little → (indicating "to be continued").

I understand you can put before and after lines on the card to add context, BUT half a sentence that cuts off while the character is talking is not that useful to study.

This might be technically difficult, but it would be handy to be able to select 2 (or more) sequential cards in the "preview" mode, and combine them into 1 card. This takes the beginning time from the first card, the end time from the last card, and combines all the text with 1 long piece of audio.

A neater way might even be to combine cards that have a → or ... on the end of the L2 sub with the next card. However, as this is my first subs2srs drama, I'm not sure if this is common practise with japanese subs, or whether this feature would be redundant. Other long-term users might be able to shed more light - is this a common problem?

Anyway thanks again, keep up the good work!

You're welcome!

Maybe I can create an option to automatically combine lines that end with a certain user-specified character such as "→".

In the meantime, you can open the subtitle file with Aegisub, select 2 lines that you want to combine, right-click and select "Join (concatenate)".

Last edited by cb4960 (2011 April 03, 11:54 am)

Reply #746 - 2011 April 03, 11:52 am
cb4960 Member
From: Los Angeles Registered: 2007-06-22 Posts: 917

Tori-kun wrote:

Edit: The vobsub2textsub tool does not work properly I guess. Only the .png files are created but I cannot find anything looking like a .srt file or so @cb/nest0r/et al. sad(
(Yeah, I used the .idx from Nobody Knows)

vobsub2text is more of a proof of concept rather than a useful tool at this point in time.

Reply #747 - 2011 May 14, 1:12 pm
overture2112 Member
From: New York Registered: 2010-05-16 Posts: 400

Apparently you can't use more than 1 instance of subs2srs at a time because one of the temp files tries to get used by the other process, causing it to fail towards the end.  Any chance you'd mind changing the temp file's name to include a timestamp or something so it can be safe to use multiple subs2srs instances at once?

Reply #748 - 2011 May 14, 2:42 pm
cb4960 Member
From: Los Angeles Registered: 2007-06-22 Posts: 917

Hello,

I have just released version 23.2 of subs2srs.

Download subs2srs v23.2 via SourceForge

What Changed?

Added a GUID to temporary file names to allow multiple instances of subs2srs to run simultaneously. (Thanks overture2112!).

cb4960

Reply #749 - 2011 May 14, 3:57 pm
overture2112 Member
From: New York Registered: 2010-05-16 Posts: 400

cb4960 wrote:

Hello,

I have just released version 23.2 of subs2srs.

Download subs2srs v23.2 via SourceForge

What Changed?

Added a GUID to temporary file names to allow multiple instances of subs2srs to run simultaneously. (Thanks overture2112!).

cb4960

Wow that was fast! Thanks!

I know subs2srs calls ffmpeg with -threads, but it doesn't manage to peg all my cores since presumably a lot of time is spent waiting on IO, so running multiple instances off multiple hard drives should make it a lot easier to process shows quickly.

Last edited by overture2112 (2011 May 14, 4:47 pm)

Reply #750 - 2011 May 19, 9:25 am
Miguelitius Member
From: Portugal Registered: 2009-08-25 Posts: 36

Hi everyone.

I'm having a problem with subs2srs and I would appreciate some help:

I'm trying to subs2srs "Laputa: Castle In The Sky", I insert both japanese and english subs (in .idx/.sub), the video, output, etc. When I click "preview" or "go" the little box pops up saying the usual:

"Processing subtitles for episode 1 of 1
Note: Vobsubs (.idx/.sub) can take a couple of minutes to process."

And the bar starts to load... BUT it just reaches the end and nothing happens, subs2srs just freezes there (and stays like that for hours if I don't end the process with task manager).

From what I tested already, the problem is not from the video, as it works with other subs, even .idx/.sub ones.

Thanks in advance.