kanji koohii FORUM
Script to steal the audio from the news from TBS - Printable Version

+- kanji koohii FORUM (http://forum.koohii.com)
+-- Forum: Learning Japanese (http://forum.koohii.com/forum-4.html)
+--- Forum: Learning resources (http://forum.koohii.com/forum-9.html)
+--- Thread: Script to steal the audio from the news from TBS (/thread-3115.html)

Pages: 1 2


Script to steal the audio from the news from TBS - Tobberoth - 2009-05-25

mentat_kgs Wrote:Not faster, but it resumes downloads and checks if the file was already downloaded.
Ah, awesome.


Script to steal the audio from the news from TBS - mentat_kgs - 2009-05-26

I just updated it to use each article's title as filenames.

If you don't want this behavior, change the configuration option

USE_JAPANESE_NAMES = true

to

USE_JAPANESE_NAMES = false


Script to steal the audio from the news from TBS - sethg - 2009-05-26

Latest update broke it for me. Got this:

Quote:browsing http://news.tbs.co.jp/
spider.rb:67:in `iconv': ".BtBeI=Be9T$N8x@"... (Iconv::IllegalSequence)
from spider.rb:67:in `clean_text'
from spider.rb:21:in `initialize'
from spider.rb:82:in `new'
from spider.rb:82
from spider.rb:76:in `collect'
from spider.rb:76



Script to steal the audio from the news from TBS - mentat_kgs - 2009-05-26

Just fixed it.


Script to steal the audio from the news from TBS - mentat_kgs - 2009-05-27

Script broke with today's news again and I fixed it again.