@thurd
Mean? That's not mean, that's clever I would say, haha
People do not want their subtitles to be stolen so easily for the sake of mining *cynical* Anyway, there IS a way converting the .sub/.idx subtitle files into proper txt-files, but as our language is Japanese this procedure is so time consuming that it is not worth.
Just for the sake of completeness: there is an app - can't find its name right now on my pc, such a huge mess here o0 - where you have to enter every single "character", in this case Hiragana, Katakana and KANJI (!), manually, per hand for the whole script in order to build a complete database, for later, automatic parsing of other idx/sub files, which is saved (store it well if you have it for Japanese built up lol this will take you years, I swear. Perhaps somebody could develop a more proper OCR method for this, f.e. like extracting/converting the KO2001 example sentences, which were pictures, too, into text. Saw the thread on this forum somewhere...)
Talking about such a mass of subtitles one should open up a new spreadsheet for requests of subtitles for the sake (what a nice word, just discovered it!) of completeness, accessible for everyone to add/edit and organise, because the spreadsheet looks rather.. chaotic and some files exist twice.
Additionally I would add the information if the film, the japanese subs are provided for, has a japanese dub actually.. otherwise the japanese subtitles are senseless, following AJATT.