RECENT TOPICS » View all
I just finished RTK1 and like a lot of you, I'm doing the sentences method. I'm sure everyone else that is doing this is having similar problems. Where do you get all these sentences from. Now I'm using the file that people made for KO2001 and it's great, but for some people who don't have the book, they are stuck without sentences or have to find things(type things) themselves. Anyways, my idea is to use Lang-8. Lang-8 is a website where you write a blog in Japanese and then Japanese people correct it for you(grammar, kanji errors,ect).Now if we write a blog and someone corrects our sentences, we can use these corrected sentences in our SRS. However I think the really useful part about this is, we can post these sentences without having to worry about copyright laws. If we could get 10 people to write a blog every week and then post 5 of the corrected sentences with hiragana and translations, we could all have 50 new sentences each week. In 6 months we could have 1200 or more sentences.
After we have a good amount we could also compile them into a spreadsheet or divide them into 3 or more spreadsheets(beginner, intermediate and advanced, ect). This is also a good chance for people to practice writing in Japanese. The real reason I think this will be helpful is people won't have to type things up as much. For me too much of my time is spent looking for sentences and then typing the sentences. With this everyone will have a huge repository of sentences about many different things, they can just cut and paste into their SRS. And they won't have to worry about lack of translations or readings, ect. What do you think?
Good luck. I personally would have actually started the beginner spreadsheet as a proof of concept before posting. If you want to get an idea like this started, you really have to show people it's doable, and not just post about how you think it's a good idea.
I dont think its a good idea, because the sentences arent originating from native speakers. So, they might be 'correct' per se, but still perhaps a bit odd.
Of course, if someone really wanted to do that, they could just look at many of the corrected entries that are already viewable on there...
Chandlerhimself wrote:
I'm sure everyone else that is doing this is having similar problems. Where do you get all these sentences from.
You're in Yokohama and you're having trouble finding Japanese sentences?!!!!
Of course I can find sentences. I can't find them with readings and translations that I can easily put into a SRS. I can't just ram my phone bill into the DVD drive of my computer.
I know they aren't coming from native speakers, but this could be fixed by having a native speaker look over them and make sure they don't sound too strange. Besides textbooks aren't exactly natural and even native speakers don't all speak the same way, so even when you are learning something from a friend you might be picking up something that only your friend says or only certain people say. I don't think you can "sentence mine" that problem away. The only way to learn what words people say most often is to speak to Japanese people and read lots of Japanese from different sources. Unless you're planning to do 100,000 sentences I don't know if you can learn to speak and write naturally through SRS(please don't say AJATT, because I've had people tell me his Japanese is good, but not like a typical Japanese person).
My idea was just a way for people to get some free sentences, so they didn't have to spent there time preparing to learn Japanese. If I have 2 hours to study everyday, I want to spend at least 1 and a half of those hours studying. I don't want to spend 1 hour looking for(typing) sentences.
Translations are not necessary and often a hindrance. All you need is to understand. As for readings, ask somebody. I mean, you're in Japan.
Who says Khatzumoto's Japanese isn't typical? You might as well back this with something if anybody's going to take what you say half seriously.
Besides textbooks aren't exactly natural and even native speakers don't all speak the same way, so even when you are learning something from a friend you might be picking up something that only your friend says or only certain people say. I don't think you can "sentence mine" that problem away.
I think you're problem here is that you called it "the sentences method". It's a bit irritating to see people call it that, when it's not. Sentence collecting is simply a tool for effective review of what you know. The real learning is the massive Japanese reading, exposure and (hopefully) conversing. If that's not going to get you good at Japanese, I don't know what will.
Chandlerhimself wrote:
I can't find them with readings and translations that I can easily put into a SRS. I can't just ram my phone bill into the DVD drive of my computer.
I see. I guess dictionaries are out of fashion these days.
Hey, guys, Chandlerhimself is just tossing out an idea. Feel free to agree with his thoughts or not, but the level of snark on this thread is rather high.
Its not a bad idea, but I think one of the strengths of using a book like KO is you are making the most use of your study time, in the sense that you are studying the Highest frequesncy kanji compounds and voc.
After RTK, I did a little sentnece minning, and surfing websites for sentences, but I found it too time consuming and also I wasnt ever sure if word I found was a common word or not, without checking.
The sthing about KO is it prioritizes your Kanji compunds, readings, and voc study, so that you can be up and reading most things (60-70% at book one, 90% at book two) asap.
I felt that if a lot of people were contributing to blogs, then you would get a range of sentneces for sure, but they might range from Final Fanasty words to computer terms, and while that may be a good thing in the long run, I think compound frequency study is a better plan of attack.
Still, the idea has some merit. Perhapase we could make a KO book 3 with the frequesncy list for the kanji that KO 1and 2 dont cover, using lang 8.
Life is strange, you live in Yokohama (I envy you) and have trouble in finding sentences, I live in Italy and have the exact opposite problem. I have too much sentences to add in Anki ^__^.
Namely I'm doing KO, plus I read Death Note (and there are plenty of sentences to mine there). Again I watch a lot of doramas and every word pops to my ears is a potential source of sentences from dictionary examples. And there are all those casual sentences I meet here and there too (receipts from online sellers, advertising and so on).
Since a lot of sentences from KO are a bit trivial for me, I decided to not add every sentence to Anki, maybe giving precedence to "native" material would be better (using Tiger&Dragon's script is more challanging ^_^).
By the way, in about three months of AJATT I have about 1900 facts in Anki (with 900 kanji). I think there are a lot of sentences out there, I don't think it's necessary to do this blog posting activity. Best to spend time in mining native sources. ^_^
Lang-8 has a lot of Japanese written by native speakers as well. I wouldn't enter in corrected sentences too much, since they started as non-native and then often they are corrected to be passable Japanese, but maybe not the best.
Lot's of Japanese people use the site and write comments, blog posts, and forum topics in native Japanese. So those would probably be a good source. Just bear in mind the style, which is written Japanese for the internet...which will of course be different than spoken Japanese or more formal writing.
Some people here aren't reading my posts. I never said I don't have sentences. I'm sure I have over 100,000 sentences in my apt right now. What don't have is time to waste typing them into anki and figuring out what they mean. The reason the KO2001 project is so good is, it saves time. I can go and copy everything from a spreadsheet and import it into anki in less than 5 min. I can't type 15 example sentences in 5 min. Maybe some people here are good at typing.
zazen666- I don't think this should replace KO I think you could use both at the same time. You said when you started it took too much time to mine for sentences. I just want to give people like you(and me), who don't have time, something they can put into their SRS and then use their time for studying and not searching. Your idea for a KO vol3 is good. I think it would be great if everyone got together and made a list of sentences using those kanji.
For me it makes no sense for everyone to make their own cards when we can all just share cards. I'm sure everyone on here has different things they want to do when they learn Japanese, but for the most part it wouldn't be a bad idea to use another persons' sentences if it saves time.
Chandlerhimself wrote:
zazen666- I don't think this should replace KO I think you could use both at the same time. You said when you started it took too much time to mine for sentences. I just want to give people like you(and me), who don't have time, something they can put into their SRS and then use their time for studying and not searching. Your idea for a KO vol3 is good. I think it would be great if everyone got together and made a list of sentences using those kanji.
For me it makes no sense for everyone to make their own cards when we can all just share cards. I'm sure everyone on here has different things they want to do when they learn Japanese, but for the most part it wouldn't be a bad idea to use another persons' sentences if it saves time.
I like your idea in theory-the KO project did save me a LOT of time (i typed in half the book myself before the project was born). Hmm, maybe we should start a new thread, since this one seems to have jumped off track. (I dont people should jump down your throat for throwing out ideas)
We could consider the KO vol3. get the list and divide up the Kanji. It doesnt necessary have to use lang 8. We could track down sentences and submit them.
Yet another idea I had was to create SUbject based sentences. I am reading some books on Astronomy now in Japanese and I search around on the internet a bit trying to find a decent list of related terms to plug into Anki to give me a heads up. Words like, solar system, galaxy, light year, light speed, oxygen, hydrogen, shooting star, etc etc.
I thought something like that might be usefull. Loke 50 sentneces with specific voc per each topic. Topics could be what ever. Science, SF, History, Music, Games, whatever.
for me persoanlly, anything like this would really be one the side for me. It took me a year to get thru KO1 and I am satisfied with the results, so I plan to keep on trucking thru KO2-which might take another year-and just read books for fun on the side. Its really had to say what the needs will be a year from now.
Last edited by zazen666 (2008 September 03, 4:29 am)
I like the idea of subject based sentences. If we had 20 people working on it and each person found 5 sentences, you could have 100 new sentences about any subject the next day. I thought Lang-8 was a good idea, because it's not copyright protected and it's a chance for me(and others) to see their mistakes and correct them. However the main thing was just getting more sentences. The is no reason why we have make things harder on ourselves.
Chandlerhimself wrote:
I don't think you can "sentence mine" that problem away. The only way to learn what words people say most often is to speak to Japanese people and read lots of Japanese from different sources. Unless you're planning to do 100,000 sentences I don't know if you can learn to speak and write naturally through SRS(please don't say AJATT, because I've had people tell me his Japanese is good, but not like a typical Japanese person).
You complain that AJATT guy doesn't have natural Japanese, so your solution is to mine sentences that were originally created by non-native speakers?
But like you said right there, the way to really learn is by talking and reading. Now, when you are reading something, you will have to figure out the readings and meaning of the sentences anyways. So this is NOT an extra step that you have to take when adding stuff to your SRS!
And really, I don't see how you can have a shortage of sentences, there are a few other spreadsheet projects going on besides KO, and all of the books in these projects are fairly cheap. There is also the tanaka corpus and the tanuki corpus, where you can find thousands of sentence pairs. And then, there is also the eijiro dictionary ( http://www.alc.co.jp/ ) which has thousands upon thousands of sentence pairs.
So really, there is absolutely no shortage of sentences out there, which are ready to bring into an SRS. I do think there is some merit to collecting and organizing them in a particular way though, such as a "book 3" to KO.
There are plenty of sentences around on the internet. Just cut and paste, it's so much easier than typing them up. Here's my blog about signs, it's Anki ready:
http://itsajapanesesign.blogspot.com/
Here's Albion's blog about small print:
http://gochuui.blogspot.com/
Here is a RevTK thread with parallel texts, translated texts, Japanese and audio:
http://forum.koohii.com/viewtopic.php?id=804
Here's Tae Kim's guide to Japanese Grammar:
http://www.guidetojapanese.org/
And here are all the Tae Kim sentences in a spreadsheet:
Basic Grammar (149 sentences)
http://spreadsheets.google.com/pub?key= … output=txt
Essential Grammar (305 sentences)
http://spreadsheets.google.com/pub?key= … output=txt
Special Expressions (263 sentences)
http://spreadsheets.google.com/pub?key= … output=txt
Advanced Grammar (98 sentences)
http://spreadsheets.google.com/pub?key= … output=txt
Here are some online scripts:
Tonari no Totoro, My Neighbour Totoro, となりの トトロ
http://user.chollian.net/~akizuki/siryo%5E/totoro.htm
Princess Mononoke, もののけ姫
http://user.chollian.net/~akizuki/siryo%5E/mononoke.txt
http://www.nigels.com/mononoke.html
Death Note,
http://lunar.littlestar.jp/stardust/eng … /DN-1.html
Matrix Revolutions, マトリックス レボリューション
http://homepage.mac.com/tomo_sakuray/Ma … pt_JP.html
Alice in Wonderland, 不思議の国のアリス, by Lewis Carroll,
http://www.hp-alice.com/lcj/honyaku/yaku_index.html
http://www.genpaku.org/alice01/alice01j.html
Through the Looking Glass, 鏡の国のアリス by Lewis Carroll,
http://www.genpaku.org/alice02/alice02j.html
Just cut and paste, there are dorama scripts too, although I don't have those sites at hand, just search.
I agree that the ideal is to never, ever type up any sentences, just use all that time to post links to forums, ah..., I mean study more. So I proposed before to have people sign up for a free blogger blog and post some non-copywrite stuff that they find. If everyone did it then we soon would have a large deposit of authentic Japanese which would forever be available. Anyway, I talk about it in this thread:
http://forum.koohii.com/viewtopic.php?id=1831
zazen666 wrote:
Yet another idea I had was to create SUbject based sentences. I am reading some books on Astronomy now in Japanese and I search around on the internet a bit trying to find a decent list of related terms to plug into Anki to give me a heads up. Words like, solar system, galaxy, light year, light speed, oxygen, hydrogen, shooting star, etc etc.
I thought something like that might be usefull. Loke 50 sentneces with specific voc per each topic. Topics could be what ever. Science, SF, History, Music, Games, whatever.
The ALC site has a selection of theme-based sentences. There are 50 themes (including "宇宙") and nearly 3,000 sentences in all. It takes time to copy & paste them individually, but if you want them all neatly in a spreadsheet then email me.
In fact those theme databases are only a small part of ALC's sentence collections. The main page is here.
Thanks for those Tae Kim links, Phauna. Did I miss a previous link to them on the forum?
いただきます。
Ah, they were on the old failed AJATT forum for a while, Pauline C & P ed them. The not yet failed new AJATT forum can be found here:
http://ajatt.pseudosphere.net/
And soon it will also be filled with sentences hopefully. Dorama, anime and movie scripts alone, if gathered together in one place, would be more than enough sentences for anyone.
Last edited by phauna (2008 September 03, 5:13 pm)
timcampbell wrote:
Hey, guys, Chandlerhimself is just tossing out an idea. Feel free to agree with his thoughts or not, but the level of snark on this thread is rather high.
Sorry, this is a bit off-topic and probably a post for another section altogether, but IMO the level of snark on this entire forum is rather high. Maybe it's just a reflection of the demographics (mostly overachieving young males, it seems)? It's a shame, as I don't think that reflects the attitude of the owner, and it is after all a site aimed specifically at learners. Just my observation...
Katsuo wrote:
zazen666 wrote:
Yet another idea I had was to create SUbject based sentences. I am reading some books on Astronomy now in Japanese and I search around on the internet a bit trying to find a decent list of related terms to plug into Anki to give me a heads up. Words like, solar system, galaxy, light year, light speed, oxygen, hydrogen, shooting star, etc etc.
I thought something like that might be usefull. Loke 50 sentneces with specific voc per each topic. Topics could be what ever. Science, SF, History, Music, Games, whatever.The ALC site has a selection of theme-based sentences. There are 50 themes (including "宇宙") and nearly 3,000 sentences in all. It takes time to copy & paste them individually, but if you want them all neatly in a spreadsheet then email me.
In fact those theme databases are only a small part of ALC's sentence collections. The main page is here.
Hey-thanks for that.
As of late, the hard core approach to "the sentence method" has been the subject of some blogs. And I agree with them for the most part.
Consider: I make a new account on RTK, add in 3005 kanji the first day. What have I accomplished? The kanji were always there. Ok, so I take those 3005 kanji, add a story to each and every one of them. Have I accomplished much? Not yet. Ok, I can now just start reviewing the kanji, right? Yeah, but I'm going to be missing virtually every one. The key part was that I had access to 3005 stories that I can add fairly quickly, but just having that is not what'll really count.
I approached the sentence method in a similar way I approached the kanji: one at a time. Sure, that might be one at a time but 30 in one day. With the kanji, I looked at the story and the kanji and drew it down. Nothing radical about it. With the sentences, I look over it, get any definitions or notes that are needed, then write it down once. Again nothing radical. Those sentences don't go from spreadsheet to the SRS until they are vetted.
Even then, it does not mean a thing to me until they get about a month between reviews (kanji and sentences). Anki is great as these cards are listed as mature, and RTK I look at it as stack 4+.
Guess my point ends up is: By the time you get to blogs as your sentence source (and blogs are your thing of enjoyment to read), you should be able to cut and paste a japanese sentence, put in the appropriate definitions (probably using online j-j dictionaries), maybe even add an audio clip. After 1000 or more sentences, you rely less on the english translation. Not sure why, it just comes about. The big thing is you're not typing in the sentences. You just read a blog, saw a word in a sentence you didn't know. Looking up the word, you now understand the sentence fully. Now you're using that sentence to help you remember what that word means. Copy and paste the whole way.
I was thinking we'd use the coordinated spreadsheets for the beginner level stuff we all need at the, well, beginning of our studies. Early on, I saw the futility of throwing every sentence into a spreadsheet (Rosetta Stone, upto unit 3, ughh). My current thinking now is the minimum to get us each into what we like: anime, drama, Japanese girlfriend, walking downtown Yokohama, etc.
What I notice: I'm going to do what I like to do (view Dramas, Karaoke, mangas, walking around). As I do sentences, I tend to get more enjoyment from what I like to do (see above). At times, I was putting off what I liked to do to do sentences. This led to a bit of burnout now and again (including the last week). So, I do what I like to do, realize there's more to figure out so I'd like to go back to RTK and sentences again.
Sorry for the ramble. Needed to get a bit off the chest now and again.
Okay, probalby an absolute newb question, but .... about those spreadsheets in general (I've never heard about them before I came to this site anyway).
How exactly can I put them into Anki?
Copy and paste every single sentence? That's not that much faster than typing, so I suppose there's a faster way to do it?
Please enlighten me.
Thanks and sorry for the OT.
Thanks for the links to those sentences. I didn't know about those resources. I didn't know all those resources were public. I'm glad they are all in one thread now, so everyone can find them easily.
phauna wrote:
I agree that the ideal is to never, ever type up any sentences, just use all that time to post links to forums, ah..., I mean study more. So I proposed before to have people sign up for a free blogger blog and post some non-copywrite stuff that they find. If everyone did it then we soon would have a large deposit of authentic Japanese which would forever be available. Anyway, I talk about it in this thread:
http://forum.koohii.com/viewtopic.php?id=1831
On the subject of copyright and subject-based collections, wouldn't just taking a few sentences from published books be fine? Not whole paragraphs or anything like that, just a couple of choice sentences? Does that fall into 'fair use'?
Using the books about space as example and blogs as the method of delivery, you could take sentences with the more important words (star, planet, comet, solar system, etc.) and post those. Or start a blog for a novel/manga/drama/etc., and take 5/10/15/however-many-you-like sentences you find interesting or useful from a chapter or episode to post.
The goal wouldn't be to have a complete catalogue of every sentence and phrase, just a couple of examples. By keeping a limit on how many you take from each chapter/episode (which you can go beyond if you want), it doesn't seem as massive a task as "mine the whole of 涼宮ハルヒの憂鬱" and hopefully makes it easier for people to get into and keep going.

