RECENT TOPICS » View all
sarenya wrote:
Hello! I have read this whole thread, though not recently, so I apologize if this information is present somewhere and I've missed it.
I'm currently living in Japan and so have access to all sorts of fun Japanese movie content (Lord of the Rings dubbed, say). I'm wondering what the best way of going all the way from a physical DVD all the way to Anki cards is. It's my understanding that this is currently to manually convert the DVD to a movie, then convert the DVD subtitle tracks to external subtitle files, then feed that collection of files into subs2srs. If this is the case, that's fine, but if there are shortcuts available I'd be eager to exploit them.
In any event, this in an incredible tool and an amazing use of technology to take some of the gruntwork out of getting language resources.
For the video file, you can directly use the .VOB files on the DVD. Just open the DVD in Windows Explorer, navigate to the VIDEO_TS folder and drag the appropriate .VOB files over to whatever your subs2srs video folder is set to.
Of course, you still need the subtitles. To extract the subtitles, you can follow this fairly easy tutorial using VobSub.
However, I've only tested this process with my R1 Azumanga Daioh DVD's and I don't know if it works well with other DVD's. Good Luck!
Last edited by cb4960 (2009 October 19, 9:53 pm)
i have a request.
Would it be possible to get the subs for GTO live action series?
I don't think decks are necessary to make for everyone. What people need to do is get SUB FILES SUB FILES SUB FILES
I'm so sad.
sarenya wrote:
Hello! I have read this whole thread, though not recently, so I apologize if this information is present somewhere and I've missed it.
I'm currently living in Japan and so have access to all sorts of fun Japanese movie content (Lord of the Rings dubbed, say). I'm wondering what the best way of going all the way from a physical DVD all the way to Anki cards is. It's my understanding that this is currently to manually convert the DVD to a movie, then convert the DVD subtitle tracks to external subtitle files, then feed that collection of files into subs2srs. If this is the case, that's fine, but if there are shortcuts available I'd be eager to exploit them.
In any event, this in an incredible tool and an amazing use of technology to take some of the gruntwork out of getting language resources.
I have done the dark knight , men in black , casino royale , etc...
I even picked straight from streaming for long show like gintama when I could nt find all the stuff (I came up with 3 different strategies just for streaming) so now I m pretty experienced with this stuff . I think I ve tried about all different strategies possible , even wrote a spreedsheet for personal use in order to to compare them . And as I ve been doing this for months I know how each of those really work on the long term (at least for me ).
So take my work for granted : unless you re a die hard fan of the movie (did this flick too by the way:D ) or you already have the japanese subtitles perfectly timed , do NOT mistake quantity for quality by importing the whole godamn movie . time consuming , mind confusing , fun spoiling
I must go with Khatz on this one : it s sentence picking not media mining .
Remark : It would be too long to go into details so I ll go straight to my point
I ll make a difference between net time and raw time :net time means how much time you REALLY spend on a task : for extracting a video it can take you up to 5 hours depending on what you re looking for but you can do something else during this time
Favorite way :
- just extract the japanese video : DVDfab is the best mix for effectiveness and simplicity . You can get it quicker or in better quality with other combination of tools see remark
net time 5 minutes raw times 3-4 hours
-demux the audio : virtualdub is the most basic tool . see remark net time 2 minutes
- and from then watch the movie whenever you feel like it
2 different ways
first time you watch the media (movie , anime , drama ,etc...):
watch with the timer (so mouse on the bottom of the screen) and snapshot to avoid breaking the flow . Zsscreen is the best .Here also there are several ways but just go for screenshot . see remark
avoid doing more than 30 shots . Otherwise it gets tedious . When I do more than 30 shots , I just take the first 30 ones and everything else goes in a stock folder .
from there you do the timing :
load the audio in aegisub (you can do it from the video but it s way longer . Aegisub can also extract from the video but it s still longer .) net time 2 minutes
the most efficient way for me is to use "on top" with aegisub so that I might see the screenshot taken in the background with the timer while aegisub is on the foreground . This way I speed up to the scene I m interested in . Once you mastered the shortcuts it takes about 5 minutes for 20 sentences and 10 for 50 sentences (I get quicker once in the mood). I don t like typing in Aegisub : it s awesome for timing if you know the shortcuts but it s very awkard for typing no matter how good you are . I import straigt away in anki and do the typing . This way you can review right on the spot .NEVER DO IT A CHORE . That s where sentence mining is wrong.
when you already know the movie : just watch it any time you want . It works even better as you have a grasp on the whole movie so you know what is the more interesting to you .Watch 4-5 scenes , A single one , change movies if you want . Just have fun . Repeat the same procedure as mentioned above except this time there is no need for snapshot : you just stop and input the sentence the way it is in aegisub.
The more movie you have the better it works . So a 700 mo -1,2 ga size is plenty enough : I have about 20 movies on my hard disk right now and I couldn t afford a 6,6g a piece (not to mention the HD behemot) . High quality pictures are all wrong : they re wrong about movie extraction , wrong about movie storage , wrong about anki storage and online save , wrong about anki review.
N.B: forgot to mention . I use premade subtitles or script once in a while (got the kenshin 追憶編 textfile today .) because it would be plain stupid not to take advantage of them but as it s binding as for the media choice I don t make a requirement and most of time I just watch what I want and write the subs myself. More fun.
N.B2: there are obviously plenty of other strategies and technique details I glossed over . For instance I never talked about extracting DVD subtitles for the timing as it goes clearly into media mining and I m against it .Didn t talk about either about VBRflux for the audio or splitting extraction and compression as I would be compelled to talk about tools. As I said earlier there s so much ground to cover I just talked about my favorite strategy : less time consuming , better result , more fun.
Last edited by ghinzdra (2009 October 21, 12:37 am)
Alternatively (if you have subtitles) you can use subs2srs, import into Anki, tag your favorite lines within Anki and then just export those lines to your sentences deck.
Last edited by cb4960 (2009 October 21, 8:44 pm)
export function don t work with me : it always export a lot of stuff I don t want . For instance when I wanted to export a JPLT1 list it spilled out Fate stay night , advent children ,etc.... I have hundred of megs of media contents exported even when I m just dealing with a text file.
subtitles is double-edged : sure it makes you gain some time . But on the long term it can get in your way . You should study with what you like or what you need . Not what you have .
Is it possible for one of you smart guys to write an "install subs2srs on Mac OSX for dummies"?
I really donīt get it... Yes, Iīm that dumb! ![]()
Ok, seriously, I downloaded and installed Mono, but I donīt understand what Iīm supposed to do next. The ffmpeg thing isnīt working for me, and all this terminal-da-vinci-code speak is way over my head. Pretty please, with sugar on top, help? ![]()
Zorlee wrote:
Is it possible for one of you smart guys to write an "install subs2srs on Mac OSX for dummies"?
I really donīt get it... Yes, Iīm that dumb!
Ok, seriously, I downloaded and installed Mono, but I donīt understand what Iīm supposed to do next. The ffmpeg thing isnīt working for me, and all this terminal-da-vinci-code speak is way over my head. Pretty please, with sugar on top, help?
I have never used Mac OS X, but I searched a little. ffmpeg seems to be available in Fink if you enable unstable (is Fink like apt-get or Synaptic package manager in Ubuntu?).
Running subs2srs.exe (in terminal):
1) Open terminal window. Application -> Utilities -> Terminal
2) Find subs2srs.exe. Type "locate subs2srs.exe" and you should get the path to where it is saved. Will look like this: /full/path/to/sub2srs.exe
3) Go to the folder by typing "cd /full/path/to/"
4) Run subs2srs with "mono subs2srs".
I'm finally getting on the dorama bandwagon. Dl'ing bucketloads of it now. I used to thing it would be too bandwidth heavy but i've worked out I can follow a few per week and seeing as theres a couple coming up with Japanese subs theres no better time. Also getting a few complete series from the past with subs to get me started
I'm looking foward to working through a few series during december/january with the intent of comprehension aswell as output!
I got subs2srs working, I just used my mini-laptop with Linux.
Iīm having some troubles with the shared decks though. I really wanted to use the Death Note decks and the 流星の絆 decks, but I canīt unzip them. I get an error saying something along the lines of "A checksum error occurred during Unrar, or an invalid password was entered".
Iīm using MacPar Deluxe on OSX. I havenīt tried my Linux laptop yet, and will try to unzip them there now. Iīm assuming the shared decks are in tip top shape?
Thank you guys!
I have a question on how to use the anki decks. I downloaded some of the shared decks and the dramas/animes for each deck. Now when I watch the drama I don't understand most of what is being said. Then when I want to srs the sentences I notice that I often know all the vocabs and grammar but couldn't understand because of my bad listening comprehension.
Would you srs those sentences anyway for the sake of improving my listening even though they don't contain any new vocab words or grammar? Or should I skip them and only focus on those sentences which add new information?
Sorry if this has been asked before but I'm new to this srsing from dramas and would like to hear what has worked best for you.
Well, people have different opinions on that "issue" youīre talking about (understanding the words when you see it written, but not when you hear it).
I guess you have to figure out a way that works for you. My way of doing it is SRSing pretty much all sentences that I donīt understand using only my ears, in combination with getting new vocab from other sources. I want to work on listening comprehension and vocabulary... But hey, just do something, anything, and if doesnīt work - change it! ![]()
Sequa wrote:
I have a question on how to use the anki decks. I downloaded some of the shared decks and the dramas/animes for each deck. Now when I watch the drama I don't understand most of what is being said. Then when I want to srs the sentences I notice that I often know all the vocabs and grammar but couldn't understand because of my bad listening comprehension.
Would you srs those sentences anyway for the sake of improving my listening even though they don't contain any new vocab words or grammar? Or should I skip them and only focus on those sentences which add new information?
Sorry if this has been asked before but I'm new to this srsing from dramas and would like to hear what has worked best for you.
SRS'ing what you know visually but can't quite grasp audibly yet is well worthwhile! I watch a dorama ep tonight and noticed that the intro I watched without subs I couldn't quite pick up on something that I got straightaway when I saw the same bit with subs. I understood most of the episode too xD Can't wait to SRS it.
Just realized I haven't used the new version of the program to create decks/audio. Hopefully I can do that soon and give feedback. Yep, just thought I'd randomly say that, since I know cb4960 is dying to know my opinions! ^_^
randomkrazy wrote:
i have a request.
Would it be possible to get the subs for GTO live action series?
Seconded.
zodiac wrote:
What format is the file? If it's mkv you can google mkvtools.
Or you can also open the file as a new subtitle from Aegisub and save the subtitles to a separate file.
Last edited by Sebastian (2009 December 07, 12:22 pm)
Here's another request: Exclude lines *longer* than X milliseconds? Is that possible or already done and I overlooked it? (Or split them, I don't know what'd be better/easier to implement... )
Last edited by ruiner (2009 December 09, 8:58 pm)
cb4960 wrote:
Have Fun!
cb4960
Dear cb4960,
thank you very much for sharing this application on this board!
my friend enjoys me saying "hey, hey, we are not drunken yet!"
... mada mada ...
It is fun, learning how to talk like a drunken japanese pop-idol :-)
PS: it would be great to have this thread "sticking" on top of this forum.
frlmarty wrote:
PS: it would be great to have this thread "sticking" on top of this forum.
It was... for a while. But suddenly disappeared. (or was that it's brother.) f(^^;)
You can now find it in the learning resources stickied post.
I've found that subs2srs needs to be restart between runs, otherwise the audio timings gradually get out of sync (in my case with idx/sub files). Windows 7, subs2srs v16.
Haven't been able to confirm this with a different setup, Mono (2.4.3 and 2.6) is having issues with OS X 10.6.2.
Has anyone else come across this?
cb4960 wrote:
Hello,
I have just released version 14 of subs2srs.
Download subs2srs v14 via SourceForge
Added sequence marker. As suggested by Nukemarine a few posts back, a sequence marker will now added to the second field of the import file.
It has the format:
[3-digit episode number]_[4-digit line number]_[timing info].
So "001_0280_00.18.22.300" means that the line appeared in episode 1 @ 18 minutes 22.3 seconds and it was the 280th line in the episode.
cb4960
Ha, *now* I am going to start using this. Can't believe it took me this long to appreciate it. I think I was worried I couldn't search for the timings in the browser, but clearly one can. Very handy. You're a mensch, cb4960.
Last edited by ruiner (2009 December 17, 7:39 pm)
ruiner wrote:
Here's another request: Exclude lines *longer* than X milliseconds? Is that possible or already done and I overlooked it? (Or split them, I don't know what'd be better/easier to implement... )
Just noticed there's a cropping option. So cool.
Also, any chance for multiple span selections? Perhaps some button for 'add more spans' or somesuch...
Thanks as always.
Last edited by ruiner (2009 December 19, 12:15 am)
this time I need help:
I have been able to set up the first deck from a movie.
now, with the second deck it goes wrong (retried already 3 times):
the anki shows japanese plus english subtitle at once, the answer is an incomplete part of the subs in english and japanese.
did anyone experience this fail before?
could you help me?
thank you!
ruiner wrote:
ruiner wrote:
Here's another request: Exclude lines *longer* than X milliseconds? Is that possible or already done and I overlooked it? (Or split them, I don't know what'd be better/easier to implement... )
Just noticed there's a cropping option. So cool.
Also, any chance for multiple span selections? Perhaps some button for 'add more spans' or somesuch...
Thanks as always.
Oops, sorry to add another feature idea/request, but here's another anyway ;p - The ability to check/uncheck Leading/Trailing Expressions, videos, et cetera? Because personally, most of the time I'd just use the text lines and not bother with media.

