a totally innocent thread about 日本語 books

Index » Learning resources

hereticalrants
Member
From: Winterland
Registered: 2009-10-23
Posts: 289

Well, that's the funny thing, innit?

It would appear that the 日本語 version of OpenOffice has all of that stuff included in it.
(I thought you were talking about something else)

Carry on.

Reply #77 - 2010 March 11, 10:15 pm
ruiner
Member
Registered: 2009-08-20
Posts: 751

Wow, I think I've collected nearly every novel on the Japanese internetz. In legitimately purchased ebook format, I mean. I'm sure the people reading this thread aren't missing much from what they have also bought in remarkably similar collections, however.

Reply #78 - 2010 March 11, 10:50 pm
ta12121
Member
From: Canada
Registered: 2009-06-02
Posts: 3187

I'm going to buy a HUGE load of mangas and novels very soon.....

Advertising (register and sign in to hide this)
JapanesePod101
Sponsor
 
Reply #79 - 2010 March 12, 1:54 am
Jarvik7
Member
From: 名古屋
Registered: 2007-03-05
Posts: 3940

I'm sad no one picked up on my Naked Lunch reference sad

Reply #80 - 2010 March 12, 2:23 am
ruiner
Member
Registered: 2009-08-20
Posts: 751

Jarvik7 wrote:

I'm sad no one picked up on my Naked Lunch reference sad

I picked up on it. Was I supposed to say something? ;p

Reply #81 - 2010 March 12, 2:25 am
Jarvik7
Member
From: 名古屋
Registered: 2007-03-05
Posts: 3940

A wink-wink/nudge-nudge would have been nice.

Reply #82 - 2010 March 12, 11:49 am
ruiner
Member
Registered: 2009-08-20
Posts: 751

Jarvik7 wrote:

A wink-wink/nudge-nudge would have been nice.

Don't be such a baby. You've been in Japan for too long, and become infantilised.

Reply #83 - 2010 March 15, 11:29 pm
Asriel
Member
From: 東京
Registered: 2008-02-26
Posts: 1343

When I want to read something, I know that I like to have something physical in my hands. That's why I like things like the Amazon Kindle, etc..except that they are just way too expensive.

If only they would come up with some way to read ebooks on Nintendo DS or something...That sure would be handy.

Although, I'll bet getting it to work with Japanese books/fonts might take some extra setting up.

Reply #84 - 2010 March 16, 2:27 am
nest0r
Member
Registered: 2007-10-19
Posts: 5234
Website

I don't have a DS but that's cool. Or it would be cool if I had an enormous collection of Japanese books.

Reply #85 - 2010 March 16, 8:14 am
yukamina
Member
From: Canada
Registered: 2006-01-09
Posts: 761

Aren't there some reading "games" for the DS? Novels, literature, or at least some visual novels.

Reply #86 - 2010 March 16, 8:20 am
wccrawford
Member
From: FL US
Registered: 2008-03-28
Posts: 1548

yukamina wrote:

Aren't there some reading "games" for the DS? Novels, literature, or at least some visual novels.

Quite a few, actually, but they are all sadly higher than my current skill level.  I'll get there eventually.  smile

Reply #87 - 2010 March 16, 8:58 am
Asriel
Member
From: 東京
Registered: 2008-02-26
Posts: 1343

as for lit software, there are a few, but how many are there that have hundreds of books of your choice? not to mention being able to change them whenever you please

especially if one happened to have a smorgasbord of japanese texts at their disposal

Reply #88 - 2010 April 01, 3:19 am
nest0r
Member
Registered: 2007-10-19
Posts: 5234
Website

Well, following developments here, I definitely haven't anything good to offer, but I would like to innocently point you to this URL -- http://sn.im/akecgu07 -- which only resembles a shortened URL with randomly scrambled letters and numbers, it certainly doesn't resemble a 'sega', 'pup', or 'toad'.

Just for the record, if I ever happen to totally legitimately upload a public domain shadow-bound book after figuring out the previously linked tools (Transcriber; Kage Shibari--okay that's not the official name, I'm just testing it out), I would hope others would happen to have the source text and similar tools and be able to reference them in case I made any mistakes, though I'd be 99% sure it was mistake-free. Edit: Although it's possible if I ever OCR something--something in the public domain, I mean, words like 忘れた might end up with the wrong kanji, or maybe I'd be too dense to pay attention to paragraph demarcation in vertical writing.

Anyway, I hope no one else pitches in to such imaginary projects, because that would result in a respository of interactive audio-/e-books!

Edit: Make sure not to attempt any kind of quick update by visiting this site: http://sn.im/jr1fsnyl -- especially if you just read Asriel's comment which doesn't reference any particular text.

Last edited by nest0r (2010 April 01, 5:15 am)

Reply #89 - 2010 April 01, 4:27 am
Asriel
Member
From: 東京
Registered: 2008-02-26
Posts: 1343

Just real quick at the beginning (haven't gone through the whole thing):
だれかの忘れた子供用の小さな 靴がひところがっているだけになるような公園だった。
僕は両親が帰ってくるまで


I can tell you that this is one project that I would never like to participate in. In fact, I'm staying far away from looking for potential interesting-sounding books that someone could make *.trs files for

Last edited by Asriel (2010 April 01, 4:29 am)

Reply #90 - 2010 April 01, 4:29 am
kendo99
Member
From: TN
Registered: 2010-03-08
Posts: 182
Website

"Shadow bound" is a sweet moniker for the process you're not pioneering here, nest0r.

Reply #91 - 2010 April 01, 5:39 am
nest0r
Member
Registered: 2007-10-19
Posts: 5234
Website

@Asriel - I would say 'fixed--see edit above you' if I had any involvement in this imaginary thing you're on about, but I don't so I won't. By the time one theoretically gets the hang of the OCR revision process, one might have forgotten to go back and correct the first page. But hypothetically I'm sure that's all they missed? Although if a handful or even a cpl people happened to cite a paragraph # (as is sometimes listed at the tops of vertical writing pages) and 'clear' or whatever, I'm sure that would put minds at ease. For instance I could say 1-19: clear, but my perspective wouldn't be fresh. Not that I have a perspective.

Anyway, I'm sure more organized people than me could think of better ways to not arrange such a proofing process.

@kendo - hehe, yeah, I'm trying to jazz it up. It's not just studying, it's something cool like ninjutsu!!

Also: *ahem* rhinospike *cough*

Last edited by nest0r (2010 April 01, 6:02 am)

Reply #92 - 2010 April 08, 1:24 pm
nest0r
Member
Registered: 2007-10-19
Posts: 5234
Website

So anyone else bother making any more materials yet? I was busy focused on French Core 2000+, but intend to make more shadow-bound books. Or rather, small to mid-sized texts. From source such as what hasn't been posted a couple comments above... I wonder how many people at RhinoSpike would be willing to read the first page or so of a bunch of light--I mean of public domain works that don't have audio...

Then there's articles, essays, etc. Blog entries...

Edit: And remember, when you have the timings for these via Transcriber, not only will they be interactive audio/e-texts, but you might eventually be able to use them in cb4960's subs2srs! ;p books2srs?

Last edited by nest0r (2010 April 08, 1:56 pm)

Reply #93 - 2010 April 18, 9:57 pm
nest0r
Member
Registered: 2007-10-19
Posts: 5234
Website

nest0r wrote:

wccrawford wrote:

nest0r wrote:

BTW, I'll probably think of an obvious solution and feel silly later, but any tips for when one has a hypothetical batch of .txts with a certain encoding, and wishes to make of the texts a searchable database, esp. when their default OS explorer doesn't search them properly?

2 things come to mind...  Use Linux instead, or...  Write some kind of PHP (or other language) script that gives you a nice index of them and let your browser help you with the searching.

Edit:  lol and that's not irony big_smile  I've always said that I'm learning JP to read the books that don't get translated.  After I read it in English, I'll mainly read it in Japanese for practice more than anything.  (Same as Harry Potter.)

Too much trouble. I'll figure something out. Feels like the answer's just out of reach, something simple. Wonder if Adobe Catalog works for .txt/Japanese... or perhaps there's something like Devonthink for Windows...

EDIT: Guess I can make do with searching via MS Word (File-->Search), even though I then must click the results to open in Word and Find the search terms in question. Good enough for now.

Just an update for searching all these hypothetical texts, here: http://forum.koohii.com/viewtopic.php?p … 79#p101079

Reply #94 - 2010 April 19, 12:04 am
Tomnook
Member
From: USA
Registered: 2009-10-09
Posts: 21

If say... A random person had a batch of files. How would they go about making the notepads display Japanese? Would this person have to extract the files in a special way? I understand that they could simply open the notepads in MS Word, but to be able to have notepads display kanji could also be nice.

Said person would also most likely be very thankful to the OP for some random reason or another.

Reply #95 - 2010 April 19, 1:00 am
nest0r
Member
Registered: 2007-10-19
Posts: 5234
Website

Tomnook wrote:

If say... A random person had a batch of files. How would they go about making the notepads display Japanese? Would this person have to extract the files in a special way? I understand that they could simply open the notepads in MS Word, but to be able to have notepads display kanji could also be nice.

Said person would also most likely be very thankful to the OP for some random reason or another.

No problem... ;p

You can always just open in Firefox or another text editor. Why do you want Notepad to do it?

If it's just the name Notepad you want, you can always use: http://www.flos-freeware.ch/notepad2.html (Open the .txt and then File-->Encoding-->Record) or Notepad++, which also can 'replace' Notepad: http://notepad-plus.sourceforge.net/uk/site.htm

If you're using Word, you can do this: http://forum.koohii.com/viewtopic.php?pid=94780#p94780

Last edited by nest0r (2010 April 19, 1:05 am)

Reply #96 - 2010 June 06, 10:23 am
iSoron
Member
From: Canada
Registered: 2008-03-24
Posts: 487

nest0r wrote:

By the way, what are some good URL shorteners? snipURL is one I like, because it allows you to customize the URL!

Sorry to bump such an old thread, but I just found two great URL shorteners.

    sharehub: http://tra.kz/gz91a
    tra.kz: http://sharehub.me/ad1b7

Their urls look quite similar. I wonder what would happen if we swapped their random codes.

Last edited by iSoron (2010 June 06, 10:24 am)

Reply #97 - 2010 June 06, 12:24 pm
pm215
Member
From: UK
Registered: 2008-01-26
Posts: 1352

Totally unrelatedly, I'd like to propose a hypothetical anti-piracy mechanism based on the principle of putting the offending material on incredibly slow servers so that all the would-be pirates die of old age before getting their hands on the material...

Reply #98 - 2010 June 06, 6:43 pm
FooSoft
Member
From: Seattle, WA
Registered: 2009-02-15
Posts: 498
Website

I don't know if anyone cares, but supposedly if one were to have a bunch of novels that were encoded in shift_jis and wanted to convert them to utf-8 while stripping away the annoing readings that are inlined in the text (thus preventing rikaichan from working), they could use this python script I wrote for this purpose:

Code:

#!/usr/bin/python

import sys
import os
import re

def cleanup_contents(contents):
    for reading in re.findall(u"\u300a[^\u300b]+\u300b", contents):
        contents = contents.replace(reading, u"", 1)
    return contents

def process_file(filename, backup):
    filename_bak = filename + ".bak"

    if backup and os.path.exists(filename_bak):
        return

    file_read = open(filename, "rb")
    contents = file_read.read().decode("shift_jis", "replace")
    file_read.close()

    if backup:
        os.rename(filename, filename_bak)

    file_write = open(filename, "wb")
    file_write.write(cleanup_contents(contents).encode("utf-8", "replace"))
    file_write.close()

params = sys.argv[1:]
backup = "-b" in params
filenames = [param for param in params if param[0] != "-"]

for filename in filenames:
    process_file(filename, backup)

Last edited by FooSoft (2010 June 06, 6:44 pm)

Reply #99 - 2010 June 06, 9:15 pm
ninetimes
Member
Registered: 2008-10-08
Posts: 114

Woah, long code quotes totally shattered the forum's formatting.

Edit: Damn, was kind of a long shot but was hoping inserting another cell might extend your post out.  But instead...

Last edited by ninetimes (2010 June 06, 9:15 pm)

Reply #100 - 2010 June 06, 9:44 pm
FooSoft
Member
From: Seattle, WA
Registered: 2009-02-15
Posts: 498
Website

Yeah, looks like code tags might be having issues...

































Some buffer lines smile