Script to steal the audio from the news from TBS

Index » Learning resources

 
Reply #26 - 2009 May 25, 3:06 pm
Tobberoth Member
From: Sweden Registered: 2008-08-25 Posts: 3364

mentat_kgs wrote:

Not faster, but it resumes downloads and checks if the file was already downloaded.

Ah, awesome.

Reply #27 - 2009 May 26, 8:58 am
mentat_kgs Member
From: Brasil Registered: 2008-04-18 Posts: 1671 Website

I just updated it to use each article's title as filenames.

If you don't want this behavior, change the configuration option

USE_JAPANESE_NAMES = true

to

USE_JAPANESE_NAMES = false

Reply #28 - 2009 May 26, 9:45 am
sethg Member
From: m Registered: 2008-11-07 Posts: 505

Latest update broke it for me. Got this:

browsing http://news.tbs.co.jp/
spider.rb:67:in `iconv': ".BtBeI=Be9T$N8x@"... (Iconv::IllegalSequence)
    from spider.rb:67:in `clean_text'
    from spider.rb:21:in `initialize'
    from spider.rb:82:in `new'
    from spider.rb:82
    from spider.rb:76:in `collect'
    from spider.rb:76

Advertising (register and sign in to hide this)
JapanesePod101 Sponsor
 
Reply #29 - 2009 May 26, 12:53 pm
mentat_kgs Member
From: Brasil Registered: 2008-04-18 Posts: 1671 Website

Just fixed it.

Reply #30 - 2009 May 27, 9:23 am
mentat_kgs Member
From: Brasil Registered: 2008-04-18 Posts: 1671 Website

Script broke with today's news again and I fixed it again.