mentat_kgs Wrote:Not faster, but it resumes downloads and checks if the file was already downloaded.Ah, awesome.
2009-05-25, 3:06 pm
2009-05-26, 8:58 am
I just updated it to use each article's title as filenames.
If you don't want this behavior, change the configuration option
USE_JAPANESE_NAMES = true
to
USE_JAPANESE_NAMES = false
If you don't want this behavior, change the configuration option
USE_JAPANESE_NAMES = true
to
USE_JAPANESE_NAMES = false
2009-05-26, 9:45 am
Latest update broke it for me. Got this:
Quote:browsing http://news.tbs.co.jp/
spider.rb:67:in `iconv': ".BtBeI=Be9T$N8x@"... (Iconv::IllegalSequence)
from spider.rb:67:in `clean_text'
from spider.rb:21:in `initialize'
from spider.rb:82:in `new'
from spider.rb:82
from spider.rb:76:in `collect'
from spider.rb:76
Advertising (Register to hide)
May 16 - 30 : Pretty Big Deal: Save 31% on all Premium Subscriptions!
- Sign up here
2009-05-26, 12:53 pm
Just fixed it.
2009-05-27, 9:23 am
Script broke with today's news again and I fixed it again.
