Back

buonaparte's audio and text links

それから
夏目 漱石 
A recording of Sorekara by Natsume Soseki - more than three years to complete, that's something!
http://18.art-studio.cc/~koenoizumi/KOEw...EKARA.html
Japanese texts:
http://www.aozora.gr.jp/cards/000148/card1746.html
http://www.aozora.gr.jp/cards/000148/card56143.html

I must have posted an English translation somewhere, too - I don't remember.
Reply
buonaparte Wrote:http://18.art-studio.cc/~koenoizumi/KOEw...EKARA.html
I know that voice - she did:
天の灝気の薄明に優しく会釈をしようとして、
Reply
People usually refuse to improve, not SHE.
She's undergone a complete metamorphosis. From an awful amateur in every respect to almost a professional. Impressive.
I mean the reader.
Edited: 2014-09-18, 6:08 am
Reply
May 16 - 30 : Pretty Big Deal: Save 31% on all Premium Subscriptions! - Sign up here
JapanesePod101
RawToast お巡りさん,
would you mind updating your mirror?
I might add something from time to time, not as often as I used to, but you never know....

It wouldn't bad at all if more morrors were created.

[Image: 2jc8nk7.png]
Bye.
Reply
"小説の博十が愛したのは、オイラーの公式、
  e^πi + 1 = 0
ということになっている。πは円周率、iは虚数(-1の平方根)。eはπと同じように循環しない無理数で、
  2.718281828459045……
とどこまでも続いてゆく。"

- 『博士の愛した数字』を巡って・小川洋子

EDIT: Only now I noticed you posted the same image in other thread - 小川洋子's story was interesting nevertheless.
Edited: 2014-09-20, 4:43 pm
Reply
Another audiobook in Japanese:

Frankenstein,
or the Modern Prometheus
by
Mary Wollstonecraft (Godwin) Shelley

フランケンシュタイン
112files 16h04min.m3u
files 1-45
http://www.voiceblog.jp/junkoropin/?il=10&io=40
files 46-112

Reply
buonaparte Wrote:I need a tool that would be able to convert Japanese texts with kanji to spaced hiragana, something similar to this http://nihongo.j-talk.com/kanji/, but able to handle large files - novels.
In case you (or somebody else) are still looking for something like that:

Code:
#!/usr/bin/env python
# -*- coding: utf-8 -*-
#
# Convert Japanese text to spaced hiragana.
#
# This tool converts Japanese text (encoded as utf-8) stored in file 'file.in'
# to spaced hiragana text stored in 'file.out' (also utf-8 encoded).
#
# For the conversion, the tool requires MeCab to be present in:
# 'C:\Program Files (x86)\MeCab\bin\mecab.exe'
#
# Usage:
# $ python convert-to-spaced-hiragana.py

import subprocess
import re
import codecs

fn_tmp_in = '.mc-in'
fn_tmp_out = '.mc-out'
command = [
    r'C:\Program Files (x86)\MeCab\bin\mecab.exe',
    fn_tmp_in,
    '-o', fn_tmp_out
]
def mecab(line):
    file = codecs.open(fn_tmp_in, 'wb', 'utf-8')
    file.write(line)
    file.close()

    subprocess.call(command, shell = True)

    file = codecs.open(fn_tmp_out, encoding='utf-8')
    mout = file.read()
    file.close()
    return mout

hira = \
u'がぎぐげござじずぜぞだぢづでどばびぶべぼぱぴぷぺぽあいうえおかきくけこさしすせそたちつてと' + \
u'なにぬねのはひふへほまみむめもやゆよらりるれろわをんぁぃぅぇぉゃゅょっ'
kata = \
u'ガギグゲゴザジズゼゾダヂヅデドバビブベボパピプペポアイウエオカキクケコサシスセソタチツテト' + \
u'ナニヌネノハヒフヘホマミムメモヤユヨラリルレロワヲンァィゥェォャュョッ'
pattern_a = re.compile(r'^(.*\t).*?,.*?,.*?,.*?,.*?,.*?,.*?,(.*?),.*')
pattern_b = re.compile(r'^(.*)\t.*?,.*')
def parse_line(line):

    mout = mecab(line)
    mlines = mout.split('\n')

    sout = u''
    for ml in mlines:
        if pattern_a.match(ml):
            p = pattern_a.sub(r'\1\2', ml)
            pair = p.split('\t')
            if pair[0] == pair[1]:
                sout += pair[0]
            else:
                pair_1 = u''
                for ch in pair[1]:
                    if ch in kata:
                        pair_1 += hira[kata.index(ch)]
                
                has_kanji = False
                for ch in pair[0]:
                    if not ch in hira:
                        has_kanji = True
                        break
                
                if has_kanji and len(sout) > 0:
                    sout += u' '
                sout += pair_1
                if has_kanji:
                    sout += u' '

        elif pattern_b.match(ml):
            p = pattern_b.sub(r'\1', ml)
            sout += p
    return sout

#print u'建物のいちばん上にある一対の鳳凰の像も修理のために降ろしました。'.encode('utf-8')
#print parse_line(u'建物のいちばん上にある一対の鳳凰の像も修理のために降ろしました。').encode('utf-8')
#exit(0)

file_in = codecs.open('file.in', encoding='utf-8')
file_out = codecs.open('file.out', 'wb', 'utf-8')
for line in file_in:
    lo = parse_line(line) + u'\n'
    file_out.write(lo)
file_out.close()
file_in.close()
Edited: 2014-11-21, 6:10 am
Reply
The Bible
Text with furigana (almost all the kanji have furigana - hiragana (katakana there is not furigana, reference))
http://download.jw.org/files/media_bible/67/bi12_J.pdf
Audio - nice, professional voice.
http://www.jw.org/apps/TRGCHlZRQVNYVrXF?...tCMSLang=E

I posted a parallel Japanese-English line-by-line text in html format some time ago.


You can get a Korean New Testament and a Mandarin Bible with pinyin + audio there too.
Parallel line-by-line texts in html format were posted before.
Reply
buonaparte Wrote:RawToast お巡りさん,
would you mind updating your mirror?
I might add something from time to time, not as often as I used to, but you never know....

It wouldn't bad at all if more mirrors were created.

Bye.
The mirror should haven running the mirror script once a week, but that doesn't seem to be happening. I've started a manual update of the site to get it up to date.
Reply
Hi.

Does anyone know where I can find,

"Read Real Japanese Essays: Contemporary Writings by Popular Authors" in PDF and Mp3 format?

Thanks.
Reply
Quote:blogs.yahoo.co.jp willmollywilly
http://users.bestweb.net/~siom/martian_mountain/!L-R/
6 Harrrrrriiii Pottttaaaa sentence-by-sentence books, + 4 Hannnibbballl books by Thomassss Haaaarrisss.
This one is very bulky after unpacking.
How do you use this one?
Reply
Click on Haarrrrii1/
download these
Haaarrrii1.7z.001
Haaarrrii1.7z.002
Haaarrrii1.7z.003
And join them to make one compressed .7z file (a zipped file) , and then you decompress it

and then you do the same with Haarrrrii2/ Smile
Reply
Thanks, psychopatate.
Reply
buonaparte Wrote:I posted a parallel Japanese-English line-by-line text in html format some time ago.
Where?
Thanks.
Reply
Ramazzati Wrote:Where?
http://users.bestweb.net/~siom/martian_m...ineHTML.7z



.7z.001 etc files
no need to join them, just unpack with 7-zip.
Reply
buonaparte Wrote:
Ramazzati Wrote:Where?
http://users.bestweb.net/~siom/martian_m...ineHTML.7z



.7z.001 etc files
no need to join them, just unpack with 7-zip.
Great, thank you.
Reply
buonaparte, unable to mirror your website but if this is of any use as a form of backup, here is everything from Index of /~siom/martian_mountain, excluding files from the following:
! L-R the most important passages.htm
!0 Japanese What’s to learn BEGIN HERE.doc
!0 My Granny.gif
Chinese/
Japanese Core 2000 example layout.htm
Korean/
Littérature japonaise.7z
iKnow6000/
mL-R/

The rest is 10GiB all in their respective folders, direct download:
https://mega.co.nz/#F!EMpAyToL!I4zAeMOXkY4ufTqkbDxsEw
Reply
Just an update, but the first entry under "Links"

"Cultural Interviews with Japanese-Speaking Executives
http://www.laits.utexas.edu/orkelm/japan/index.html"

Has been taken down.

I sent an E-mail to Orlando Kelm and Midori Tanaka, the UTA professors that created the site, and it apparently because the format of the videos is outdated.

Thankfully, they have preserved the content of the old site, and transferred it to http://culturalinterviews.wikispaces.com/

Unfortunately, the interviews don't include links to their corresponding videos, which are now hosted on Youtube, but Professor Kelm says that they are forthcoming. For now, the videos are located here: https://www.youtube.com/user/orlandocourses

They were very quick to reply, and seemed quite pleased that people were taking advantage of the materials hosted on the site. So, probably no need to worry about the interviews vanishing into the ether. At least, not enough to freak out and E-mail two random profs out of the blue... (;^◇^Wink
Reply
Ars longa, vita brevis.
http://users.bestweb.net/%7Esiom/martian...taRhei.avi
Japan that is no longer there.
Reply
Malala (NHK)
http://users.bestweb.net/%7Esiom/martian.../Malala.7z
Human beings bless her name.
Reply
NHK クローズアップ現代
http://www.nhk.or.jp/gendai/movie/
Short educational movies about contemporary issues.
Plus transcripts (more or less).

Choose a movie; if you want more details click
詳しくはこちら
Then scroll down and click
続きを読む
to read the rest.
All the parts of an episode here:
全文表示
Reply
16歳 不屈の少女 
http://users.bestweb.net/%7Esiom/martian...0140108.7z
An 11-minute movie clip narrated in Japanese, except for some 70 seconds at the beginning (an interview with Malala Yousafzai in English, but subtitled in Japanese) + transcript in Japanese.
Reply
Malala is 18 years old now. NHK + FNN
http://users.bestweb.net/~siom/martian_m.../!UPDATES/
Malala18yo.7z
Reply
A guy created an interesting L-R tool (French-English).
Have a look:
http://www.languagelovers.net/fr-en-alice/

His post about it:
http://how-to-learn-any-language.com/for...40784&PN=1
Reply
Just a note to say a huge thanks for this work.

As a side note, I hope that this will work for a Japanese person who is finally getting serious about her English reading and 聞き取り。
Reply