Toki_wo_Kakeru_Shoujo_Anki.rar

Index » Learning resources

 
ahibba Member
Registered: 2008-09-04 Posts: 528 Website

In ajatt.pseudosphere.net:

zort wrote:

I’ve created an Anki deck containing all 1,559 lines of dialog from the anime Toki wo Kakeru Shoujo. Each card contains:
1) An Japanese transcription of a line of dialog
2) An English translation of the line
3) An audio clip of the line
4) A snapshot taken at the halfway point of the line

Everything was extracted from the CrystalNova-KAA release of this anime via my own automated tools. Duplicate lines were purposefully left intact (but they’re tagged, so they should be easy to delete should you so desire).

Toki wo Kakeru Shoujo Anki Deck on Mediafire

To use, simply unrar and double-click on the .anki file.

If the link is ever removed, just post here and I’ll create a mirror.

Expect releases of more anime related Anki decks after the holidays (I can do anything with SRT or ASS/SSA soft subs).

Happy Learning,
zort

Anyone have this file?

Toki_wo_Kakeru_Shoujo_Anki.rar

http://74.125.95.132/search?q=cache:lY6 … k&cd=2
- Show quoted text -

wccrawford Member
From: FL US Registered: 2008-03-28 Posts: 1551

Did you post there, like he asked?

playadom Member
Registered: 2007-06-29 Posts: 468

Doing so would be impossible, on account of the forum being down.

Advertising (register and sign in to hide this)
JapanesePod101 Sponsor
 
nac_est Member
From: Italy Registered: 2006-12-12 Posts: 617 Website

Whoa, that would be pretty awesome...

ahibba Member
Registered: 2008-09-04 Posts: 528 Website

wccrawford wrote:

Did you post there, like he asked?

The answer:

playadom wrote:

Doing so would be impossible, on account of the forum being down.

I don't have other way to contact the poster (zort). I hope he have an account here in RevTK.

cb4960 Member
From: Los Angeles Registered: 2007-06-22 Posts: 917

Hello cb4960 here (aka zort),

Here is the MediaFire link to the deck:
http://www.mediafire.com/download.php?tdmzk2mnmit

I've been using this deck for the last month and I'm really enjoying it. Personally, I prefer it over my iKnow deck because the sentences are completely in the context of one of my favorite movies.

Also, In the original description I forgot to put
5) The auto-generated reading of the Japanese line via kakasi.

Here is a screenshot:
http://img240.imageshack.us/img240/8124/tokiwokakerushoujokh2.th.jpg

Hope this helps someone.

Last edited by cb4960 (2009 January 25, 10:26 am)

stoked Member
From: Switzerland Registered: 2009-01-09 Posts: 378 Website

This looks amazing. Thanks for sharing!

hknamida Member
From: Sweden Registered: 2007-08-16 Posts: 222 Website

I love this movie. I'll archive the deck for when I'm done with iKnow in X years.

saru_yo Member
From: Germany Registered: 2009-01-10 Posts: 24

wow, thanks for sharing, I love that movie! :D

ahibba Member
Registered: 2008-09-04 Posts: 528 Website

Welcome to RevTK (cb4960)

Thank you very much for for creating this great deck and reuploading it.

stoked Member
From: Switzerland Registered: 2009-01-09 Posts: 378 Website

Does anyone know what happened to this forum? Has it moved? Or is it dead? Why?

Reply #12 - 2009 January 25, 1:04 pm
ahibba Member
Registered: 2008-09-04 Posts: 528 Website

stoked wrote:

Does anyone know what happened to this forum? Has it moved? Or is it dead? Why?

The administrator of this forum said:

Adam wrote:

No one uses it anymore, or at least, not a great amount. I'm also struggling to afford my hosting. I have all the files backed up, however.

Reply #13 - 2009 January 25, 1:12 pm
cb4960 Member
From: Los Angeles Registered: 2007-06-22 Posts: 917

Someone contacted me via email wanting to know how I created the deck.

Here is a link to the utility that I made (including a readme):
http://www.mediafire.com/download.php?ynk2j14zmyb

You will need the .Net framework installed.

Note: Don't try to break it, it has rather poor exception handling (reason being that I created it for personal use).

--------------------------------------------------------------------------------

From the readme:


Overview of tools used

subs2srs – A little utility that I created. You input 1 or 2 subtitle files (in SRT,ASS, or SSA format). All it does is parse through subtitles and generate text file(s) in a desired format to be used as input for other tools. If the subtitle files aren’t 1:1, the closest line (chronologically) will be used.

These tools are just the ones I used. Feel free to come up with a better toolchain.

Mp3splt – The tool I used to make the audio clips.
Avs2avi+AviSynth+ImageWriter plugin – Tools that I used to take the snapshots.
Kakasi – Tool that I used to generate the readings.
MKVextractGui – Tool that I used to extract the audio from the MKV video file.
Anki – My SRS.

If you need Japanese subtitles, you might find some here:
http://kitsunekko.net/JapaneseAnimeSubtitles.aspx


Overview of usage

Run subs2srs. subs2srs is basically a utility that will parse the subtitles and allow you to output the parsed info to a file in a one-line format of your choosing.

Read the help file to get an idea of what kinds of things you can do (Help->Formatting…).

You must enter at least one subtitle file.

If you have multiple episodes, use the batch feature. For example, if you have a 23 episode show, enter “1-23” in the Batch text box. Use $batch_num to substitute the current episode number in the subtitle file name or the output format or output file name. $batch_num always outputs 2 digits with a leading zero if necessary.

Enter the output format (you will have to refer to the help file for the tokens you may use.

The output format must look like this:
“<output format>” <token1> <token2> …

The quotes are required.

Enter the output file name (you can use the $batch_num token here).

Click Go.


Example

(This example does not create images or readings – see the advanced section below for that)

That probably did not make too much sense. So here is an example that you can follow:

Assume that we want to run the program for all 23 episodes of the anime Clannad. We will then split the audio files and create the Anki import files.

In a directory called I:/Temp2, create a directory structure like this:

./Clannad
  ./Anki
  ./Media
    ./Clannad_01_Audio
    ./Clannad_02_Audio
    ./Clannad_03_Audio
              .
              .
              .
    ./Clannad_23_Audio
  ./Subs
./TempAudio


Put all the subtitle files in I:/Temp2/Clannad/Subs (there should be 23 Japanese subtitle files and 23 English subtitle files). They are named as follows:
Clannad_01_jap.ass
Clannad_01_eng.ass
Clannad_02_jap.ass
Clannad_02_eng.ass
Clannad_03_jap.ass
Clannad_03_eng.ass
Etc.

Put all the audio files in I:/Temp2/Clannad/TempAudio (there should be 23 files, one for each episode): They are named as follows:
01 - Clannad.mp3
02 - Clannad.mp3
03 - Clannad.mp3
Etc.

The Anki folder will hold all anki import files.
The Media/Audio folders will hold all of the split audio files.

Run subs2Srs

In Subs1, enter I:/Temp2/Clannad/Subs/Clannad_$batch_num_jap.ass
In Subs2, enter I:/Temp2/Clannad/Subs/Clannad_$batch_num_eng.ass

In Frame rate, enter 23.976
In the Replace… with … enter ; and , respectively. This will remove any instance of ; in the dialog. It to prevent Anki import from thinking it’s a separator.
In Batch, enter 1-23
Make sure the Include styled lines is unchecked so that we don’t parse any signs or karaoke effects.

In Output 1 Format we will put a format that will generate a .bat file that will split each audio file with mp3splt (assuming mp3splt is located in C:/WINDOWS/mp3splt.exe). Enter this:
"C:/WINDOWS/mp3splt.exe -q -o Clannad_{0}_{1:00.}.{2:00.}.{3:00.}-{4:00.}.{5:00.}.{6:00.}.mp3 -d I:/Temp2/Clannad/Media/Clannad_{0}_Audio "I:/Temp2/Clannad/TempAudio/{0} - Clannad.mp3" {1:00.}.{2:00.}.{3:00.} {4:00.}.{5:00.}.{6:00.}" $batch_num $s_min_t $s_sec $s_hsec $e_min_t $e_sec $e_hsec

Yes, that is one big line. {0} corresponds to the $batch_num token. {1} corresponds to the $s_min_t token. Etc. The “:00.” Means make it two digits with a leading zero and no decimal (Keep in mind that this is always the case for $batch_num so you don’t have to use this format when using $batch_num).

This is what one of the lines of the .bat files might look like when its generated:
C:/WINDOWS/mp3splt.exe -q -o Clannad_01_00.07.09-00.08.41.mp3 -d I:/Temp2/Clannad/Media/Clannad_01_Audio "I:/Temp2/Clannad/TempAudio/01 - Clannad.mp3" 00.07.09 00.08.41

In the Output 1 Output, enter this:
I:/Temp2/Clannad/TempAudio/Clannad_$batch_num_mp3splt.bat

In Output 2 Format we will put a format that will generate the files that we will use for importing into anki.
Enter this:
"[sound:Clannad_{0}_{1:00.}.{2:00.}.{3:00.}-{4:00.}.{5:00.}.{6:00.}.mp3];{7};{8}" $batch_num $s_min_t $s_sec $s_hsec $e_min_t $e_sec $e_hsec $sub1_line $sub2_line

This is what one of the lines of the semicolon separated anki import files might look like when its generated:
[sound:Clannad_01_00.07.09-00.08.41.mp3];この町は嫌いだ;I hate this city.

In the Output 2 Output, enter this:
I:/Temp2/Clannad/Anki/Clannad_$batch_num_anki.txt

Click Go!

The .bat files were created in I:\Temp2\Clannad\TempAudio
You can double click each one, or just create another batch file that will run each one. They will split the audio and put the split files in the appropriate folder. Move all the files to your Anki media directory.

Import the Anki files one by one.

You are done.


Advanced Topic – Generate Snapshots

For snapshots use avisynth and avs2avi.

Use something like the following format and file names:

"ImageWriter("Clannad_{0}_", {1}, {1}, "jpg")" $batch_num $m_frame
I:/Temp2/Clannad/Media/Clannad_$batch_num_Images/Clannad_$batch_num_snapshot.avs

Your should also change the format of the anki file:
"[sound:Clannad_{0}_{1:00.}.{2:00.}.{3:00.}-{4:00.}.{5:00.}.{6:00.}.mp3];<img src="Clannad_{0}_{7:000000}.jpg">;{8};{9}" $batch_num $s_min_t $s_sec $s_hsec $e_min_t $e_sec $e_hsec $m_frame $sub1_line $sub2_line

Be sure to prepend something like this to each AVS file:
DirectShowSource("I:/Temp2/Clannad/Media/Clannad_01_Images/Clannad_01.mkv").LanczosResize(256,144)
ConvertToRgb

You can run a command like this for each AVS file (takes about 5 minutes per file):
I:/Dev/Utils/AVS2AVI/avs2avi.exe I:/Temp2/Clannad/Media/Clannad_01_Images/Clannad_01_snapshot.avs -o n -c null


Advanced Topic – Generate Readings

For the reading I use something like the following format and file names (this one is for toki wo kakeru shoujo):
";{0}" $sub1_line
I:/Temp2/Toki_wo_Kakeru_Shoujo/Anki/Toki_wo_Kakeru_Shoujo_01_kakasi.txt

Then generated the readings with kakasi and appended them to the Anki file:
cd /cygdrive/i/Temp2/Toki_wo_Kakeru_Shoujo/Anki
cat Toki_wo_Kakeru_Shoujo_01_kakasi.txt | iconv -f UTF-8 -t EUCJP | /cygdrive/c/kakasi/bin/kakasi.exe -JH -p | iconv --unicode-subst=? --byte-subst=? --widechar-subst=? -f EUCJP -t UTF-8 | paste -d '\0' Toki_wo_Kakeru_Shoujo_01_anki.txt - | tr -d '\r' > Toki_wo_Kakeru_Shoujo_01_anki_reading.txt

Last edited by cb4960 (2009 January 25, 1:31 pm)

Reply #14 - 2009 January 25, 2:21 pm
nest0r Member
Registered: 2007-10-19 Posts: 5236 Website

Whoa.

Reply #15 - 2009 January 25, 2:47 pm
Ji_suss Member
From: Toronto Registered: 2008-08-22 Posts: 96

That's what Chinese characters used to look like....

Reply #16 - 2009 January 25, 2:57 pm
cb4960 Member
From: Los Angeles Registered: 2007-06-22 Posts: 917

My technical writing skills leave a lot to be desired smile

Reply #17 - 2009 January 25, 3:03 pm
nest0r Member
Registered: 2007-10-19 Posts: 5236 Website

cb4960 wrote:

My technical writing skills leave a lot to be desired smile

Well, I'm saying whoa because of the immensity of what I've witnessed! The sheer awesomeness. You're the bomb. I'm still processing this... the possibilities....

Reply #18 - 2009 January 25, 3:52 pm
nac_est Member
From: Italy Registered: 2006-12-12 Posts: 617 Website

zort, you're my hero.

Reply #19 - 2009 January 25, 3:58 pm
Ben_Nielson Member
From: Japan Registered: 2008-12-19 Posts: 164

Yeah, this is pretty freaking sweet.

Reply #20 - 2009 January 25, 5:19 pm
Smackle Member
Registered: 2008-01-16 Posts: 463

After doing this, I get the batch file and the anki text file, but when I run the batch file, I don't get the split mp3s. I don't know where they go.

Also, where can I find ImageWriter? My search skills are not very good and I turned up too many different results.

Reply #21 - 2009 January 25, 5:31 pm
nest0r Member
Registered: 2007-10-19 Posts: 5236 Website

Smackle wrote:

After doing this, I get the batch file and the anki text file, but when I run the batch file, I don't get the split mp3s. I don't know where they go.

Also, where can I find ImageWriter? My search skills are not very good and I turned up too many different results.

ImageWriter seems to be built into current versions of Avisynth.

Reply #22 - 2009 January 25, 5:33 pm
cb4960 Member
From: Los Angeles Registered: 2007-06-22 Posts: 917

Smackle wrote:

After doing this, I get the batch file and the anki text file, but when I run the batch file, I don't get the split mp3s. I don't know where they go.

If, for example, this was a line in your batch file:

C:/WINDOWS/mp3splt.exe -q -o Clannad_01_00.07.09-00.08.41.mp3 -d I:/Temp2/Clannad/Media/Clannad_01_Audio "I:/Temp2/Clannad/TempAudio/01 - Clannad.mp3" 00.07.09 00.08.41

The bold part (the part after "-d") is the directory that the split files will be sent to.

Smackle wrote:

Also, where can I find ImageWriter? My search skills are not very good and I turned up too many different results.

It should be included by default with AviSynth (http://avisynth.org/mediawiki/Main_Page)

Reply #23 - 2009 January 25, 5:42 pm
Smackle Member
Registered: 2008-01-16 Posts: 463

"C:\Program Files\mp3splt\mp3splt.exe - q - o Sore_{0}_{1:00.}.{2:00.}.{3:00.}-{4:00.}.{5:00.}.{6:00.}.mp3 -d C:\Program Files\Subs2SRS\Sore\Media\Sore_{0}_Audio "C:\Program Files\Subs2SRS\Sore\TempAudio\{0} - Sore.mp3" {1:00.}.{2:00.}.{3:00.} {4:00.}.{5:00.}.{6:00.}" $batch_num $s_min_t $s_sec $s_hsec $e_min_t $e_sec $e_hsec

Is what I put in the output 1 output, but I cannot find anything in Media except empty folders.

Last edited by Smackle (2009 January 25, 5:48 pm)

Reply #24 - 2009 January 25, 5:56 pm
cb4960 Member
From: Los Angeles Registered: 2007-06-22 Posts: 917

Smackle wrote:

"C:\Program Files\mp3splt\mp3splt.exe - q - o Sore_{0}_{1:00.}.{2:00.}.{3:00.}-{4:00.}.{5:00.}.{6:00.}.mp3 -d C:\Program Files\Subs2SRS\Sore\Media\Sore_{0}_Audio "C:\Program Files\Subs2SRS\Sore\TempAudio\{0} - Sore.mp3" {1:00.}.{2:00.}.{3:00.} {4:00.}.{5:00.}.{6:00.}" $batch_num $s_min_t $s_sec $s_hsec $e_min_t $e_sec $e_hsec

Is what I put in the output 1 output, but I cannot find anything in Media except empty folders.

Don't put a space between "-" and the option. So "- q - o" should be "-q -o".

Reply #25 - 2009 January 25, 5:59 pm
mikankun Member
From: Massachusetts Registered: 2008-11-22 Posts: 49 Website

Are the English readings just the subtitles matched up with the audio?