Trinity

Index » Feedback

Reply #26 - 2008 January 07, 9:11 pm
dingomick Member
From: Gifu_Japan Registered: 2006-12-16 Posts: 234

Oh. My. God. I've been away from RvTK completely for the last couple weeks since I'm studying for the kanken Feb 3rd (grade schoolers know a f*ck lot of words!!) so I hadn't read any of your updates. Trinity looks AMAZING! yikes It is exactly what I have imagined for a review system. Anki is close, but it scatters learning goals that are intertwined (sentences, vocabulary, readings), and the entry process, while flexible, is often cumbersome.

I think Trinity remembering kanji/vocab you already know and cross-linking them wherever they appear is huge. It's what makes computer review better. Information can simultaneously exist in multiple "places" for mulitple purposes and is not bound to a static location. I shouldn't have to enter 小学校 4 times, once for the word, and 3 times for each kanji, and then have to enter to every time I may use it in a different example sentence.

The mapping feature for the on-yomi is genius. You obviously understand the benefit of incorporating multiple learning methods to support retention.

Will the sentence entry feature automatically recognize vocabulary? If they have to be selected, there is at least a simplified process of clicking and picking, right?

So for reviews, does it just quiz you as an SRS for each entry title? Kanji and vocab are obvious. Will sentences work the same? You will be presented a sentence/"word list" and you fail or pass it if you successfully read it?

I know it's a longshot, but is there any way that Trinity will be available offline? Could there be some sort of database syncing, at least for reviews, that allows you review in a browser offline but have the site recognize your progress afterwards? (Entry could be kept online since that involves so many other databases, but would be megasweet if it was offline too!)

synewave Member
From: Susono, Japan Registered: 2006-06-23 Posts: 864 Website

Will we be able to import lists into Trinity?

Reply #28 - 2008 January 08, 1:27 am
dingomick Member
From: Gifu_Japan Registered: 2006-12-16 Posts: 234

Btw, could this be done by, say, yesterday? tongue I'd love to use it for my kanken reviews.

Advertising (register and sign in to hide this)
JapanesePod101 Sponsor
 
Reply #29 - 2008 January 08, 3:35 am
vosmiura Member
From: SF Bay Area Registered: 2006-08-24 Posts: 1085

dingomick wrote:

Btw, could this be done by, say, yesterday? tongue I'd love to use it for my kanken reviews.

+1. :p

Reply #30 - 2008 January 08, 2:00 pm
ファブリス Administrator
From: Belgium Registered: 2006-06-14 Posts: 4021 Website

meolox wrote:

Excitement rising, really hope I finish by the time this comes online.

Thanks! Well if you haven't finished it can be helpful for RtK1 too. I don't bother clearing my red stack anymore, I just clear the characters when they come up in a word I'm adding in Trinity. This has the advantage that the word helps remember the character. Sometimes there's a blurry line between the compound mnemonic that naturally comes out of the keyword combinations, and the story for the kanji itself, but that's a minor problem.

dingomick wrote:

I shouldn't have to enter 小学校 4 times, once for the word, and 3 times for each kanji, and then have to enter to every time I may use it in a different example sentence.

Thank you, looks like you've figured it all pretty much tongue  To be precise, you will have to enter a word for every sentence it appears in, if you find this association to be useful. If the word is already familiar, you don't have to attach it to the sentence. If you don't remember for sure what the word means, chances are you didn't study it yet.

Trinity does not currently parse the sentences. It would be difficult, but possible to use a specialized program to parse the sentences and extract the related vocabulary. But there are also advantages to doing it manually. One is that the user can decide what words to attach and limit the attached vocabulary to those that are new or challenging at that point. I think having an exhaustive list with each sentence , listing all the particles, and common adjectives or verbs that may be present would make the related vocabulary less effective, particularly when you review sentences, as it will show the linked vocabulary when you flip the card.

Another reason it's not automatic is that unless you're a beginner, you don't want to create hundreds of flashcards for words you know already. So you can add only those words that you would want to review.

Will the sentence entry feature automatically recognize vocabulary? If they have to be selected, there is at least a simplified process of clicking and picking, right?

It's easy to enter words... if you know already how to read them, that's the trick. Otherwise you might have to do some copy/paste from the sentence to the search box, to lookup the word with kanji. You can use cursor keys to browse the lookup results and the enter key to add a word, or click the words directly.

Will sentences work the same? You will be presented a sentence/"word list" and you fail or pass it if you successfully read it?

Sentences have their own flashcards. So you have two types of flashcards in Trinity : vocab and sentences. How you rate your answers for sentences is up to you. You may want to pass if you remembered the reading alone, or to pass if you remembered the whole meaning. If you do use sentences a lot, it would be better to pass vocab even if you remember only the reading (especially new exemplary words added while learning onyomi groups), but pass sentences only when you remember the full meaning.

I know it's a longshot, but is there any way that Trinity will be available offline? Could there be some sort of database syncing, at least for reviews, that allows you review in a browser offline but have the site recognize your progress afterwards?

It's possible with Google Gears for example. I didn't feel it was worth the effort to do this for RtK1. Well, who knows maybe it's not so hard to implement, I have yet to look at it in details. But for Trinity this is something I really want to do, though I wouldn't wait for it.

synewave wrote:

Will we be able to import lists into Trinity?

Sure, but it's not in there yet. For the Trinity beta, which could last 2-3 months I want to see how people use it from the grounds up. I 'm not sure I will add import during that time but who knows. Importing CSV's should be fairly easy.

Reply #31 - 2008 January 09, 3:14 am
Chadokoro_K Member
From: Berkeley, CA - Uji, Japan Registered: 2006-08-22 Posts: 158

Trinity looks very very sweet!

ファブリス, will it be possible for Trinity -- especially the ON readings component -- to include RTK3 kanji? (Pretty please.)

For the ON readings you wrote:

"The current targets are of 1908 kanji for the chinese readings. This number corresponds to the subset of kanji from RtK1 for which there is a chinese reading and where the reading actually appears into a word marked as a "priority" entry into the Japanese dictionary EDICT."

There is also quite a good-sized subset of kanji from RTK3 for which there are Chinese readings that also appear in words marked as a "priority" entry in the Japanese dictionary edict.

I would love to be able to tackle all of the ON readings at once using kanji chains, memory palaces ... oh, and in conquering the "china wall'. (That looks soooo motivating!)

Perhaps a "Include RTK3" checkbox could be created. This way those who have done both RTK 1 & 3 can have the entire ON subset displayed while those who have only studied RTK1 would only see the RTK1 subset of ON readings.

Thanks for all of your hard work.

ファブリス Administrator
From: Belgium Registered: 2006-06-14 Posts: 4021 Website

Hi Chadoro_K

You're right about the RtK3 kanji. In fact some onyomi groups become substantially larger, the largest one, コウ contains ~66 kanji within RtK1 but increases to ~109 with RtK3. At the same time, the total number of groups only increase by ~15.

Chadoro_K wrote:

Perhaps a "Include RTK3" checkbox could be created. This way those who have done both RTK 1 & 3 can have the entire ON subset displayed while those who have only studied RTK1 would only see the RTK1 subset of ON readings.

There will be support for RtK3 as I have been thinking about letting the user choose different "targets", such as the general-use kanji. While I have to think of the implementation of those ahead (for setting up the database etc.) I can't prioritize a RtK3 target at the moment. It won't be in the beta of Trinity.

I also need to think about a way for users to add any kanji with the correct reading into a kanji chain, if someones wants to incorporate the alternative readings; that is one character in multiple onyomi groups.

I think it's efficient to do a long chain at once, but it will be difficult to study it all at once for groups like コウ, especially adding an exemplary compound for each. Once you have a chain and a story or spatial relationships between the characters, I don't see any reason why you couldn't add to it. When you'll "extend" a chain later you'll have to be careful to connect the characters to the exisiting plot.

The chain editor with its visual interface and spatial relationships can make this easier because if you place a character on the "map", you can see immediately closeby characters and can adapt your story or mental image to create a relationship between them.

Reply #33 - 2008 January 09, 3:47 pm
markl11 Member
Registered: 2007-07-24 Posts: 32

Fabrice, any chance of having a CSV *export* function as well?

This would be extremely useful for me because I spend a lot of time off-line. Sometimes it can be days or even weeks sometimes with no access to your site.

So, if I have info in a CSV I could do my own review during those periods (I wouldn't need any import/export of 'Lietner learning info' - simple CSV export would be fantastic, if at all possible :-)   [ pretty please :-)   ]


I made just such a file for the Kanji & stories (for transfer to off-line SRS) for just such a purpose, but what a chore it was to do that manually! I really don't want to have to manually create files again for sentences.


Also, have you considered the possibility of adding a way to lookup/grab defintions from the Yahoo J-J dictionary for those of us that want J definitions and examples?

http://dic.yahoo.co.jp/

I believe that the Yahoo dictionary uses some pretty authoritative sources (please corrrect me if I am wrong, someone!).

Anyway, hope to see Trinity sooooooon,

Mark

Reply #34 - 2008 January 16, 6:02 am
johnzep Member
From: moriya, ibaraki Registered: 2006-05-14 Posts: 373

So any news on the Trinity front? ^_^

ファブリス Administrator
From: Belgium Registered: 2006-06-14 Posts: 4021 Website

Hya,

sorry for the lack of news.

Last weekend I did mostly database optimizations and creating more tables to speed up querries etc.

One thing that got me excited was that I finally saw a optimization decent enough to allow for making mini-games or quizzes that could find out possible kanji compounds with your known kanji or kanji+reading combos. This could be fun. With or without optimization this kind of query could require as much as 2000 times 5000 combinations, so there is still a limit in that it will work by looking at what are the 50 most recently learned kanji+readings for example, and then from that selection, it could quizz you on new compounds you don't know yet, but you are supposed to be able to read. That's just possibilities though, important for me and I needed to plan ahead, but not something that will be in the Trinity beta.

This weekend, just when I thought I was going to get some serious work done, there were hacking attempts on the site, and I had to investigate at ways of making the database secure, update all passwords etc. It's a good thing but it's a bit frustrating as well.

I think also last week I had a new design for the Kanji chains page. The screenshot I posted was just a "proof of concept" so to speak, but it's really not pretty. Another thing I experimented was to do a viewport and scrolling just like Google maps. However I am going to use a fixed view, because I think if the user needs to constantly pan around a kanji chain, it puts more attention on the interface and not enough on the material to study. Instead, I will have a panel that you can toggle on and off like Google Map's left pane, so you can get a full page view. For very long chains however a vertical scrollbar will be necessary.

I also need to really clear up this Trinity concept.

A post from leosmith about graded readers got me thinking. Right now I have Sentences, Vocab and Kanji. However the 2nd part in this "Trinity" is not really vocab, but vocab lists.

I didn't see much use for vocablists yet. But I've realised now that you could effecitvely paste a Japanese text excerpt into the "notes" area, and add the related vocabulary. If those are grouped into vocab list "packs" (I have sentence "packs" at the moment), then graded texts can be shared like this. This is great however I wonder if I could automatically "grade" the content.

One thing is the word frequency but is the word frequency directly related to the level of difficulty of the text? Will the word frequency data from EDICT suffice, etc. etc.

tuuli Member
From: new york Registered: 2007-11-10 Posts: 44

I love the idea of sharing graded (or un-) texts!  One thing I've recently been enjoying on this site is the Reading practice - and I was just thinking that it would be great to be able to save all the texts I've inserted so I could go back to them and use them for studying.
Thanks!

Reply #37 - 2008 January 20, 4:41 pm
meolox Member
Registered: 2007-08-31 Posts: 386

Wow a hacking attempt, how unfortunate sad
I remember you saying you'd hope to get this online by January ファブリス, what's the latest estimation?

vosmiura Member
From: SF Bay Area Registered: 2006-08-24 Posts: 1085

Is there anything that could use some more coding hands, Fabrice?

Reply #39 - 2008 January 21, 6:03 pm
ファブリス Administrator
From: Belgium Registered: 2006-06-14 Posts: 4021 Website

Hya, thanks for the offer.

One thing that I still haven't solved is how to parse Japanese sentences. This can be complex and there's no point in reinventing the wheel. Being able to parse sentences could make the sightreading page dramatically more useful as it could plug into Trinity for adding vocab, sentences etc.

However I don't know of many programs to do this.

I saw there is something called kakasi but the documentation is hard to find, if that's not the one I saw that was all in Japanese and I can't read at that level yet.

There's some hints about it on Rikai's about page too. It looks like David Rudick wrote his own parser afterwards, hopefully I won't need to.

If someone manages to run kakasi and can tell me how it works practically, if I have some Japanese text and I want to find out all the words in it, that could be very useful.

Reply #40 - 2008 January 21, 7:01 pm
styrmis Member
From: United Kingdom Registered: 2007-01-17 Posts: 30

ファブリス、

You might want to get in touch with resolve, since Anki successfully uses Kakasi to determine possible readings for entered sentences.

If you download the Anki source and take a look at the file libanki/anki/support/japanese.py you'll find code which invokes Kakasi for this purpose.

Reply #41 - 2008 January 24, 3:54 pm
vosmiura Member
From: SF Bay Area Registered: 2006-08-24 Posts: 1085

What's the server code written in?

Would be real handy to be able to click on words.  Actually I usually use Rikaichan and copy definitions to clipboard to add to my sentence notes, but if the questions were presented online I wouldn't need to add many word notes to my flashcards since I could always just review with Rikaichan running and hover over the words I need to check.

Reply #42 - 2008 January 28, 2:50 am
Chadokoro_K Member
From: Berkeley, CA - Uji, Japan Registered: 2006-08-22 Posts: 158

ファブリス wrote:

Hi Chadokoro_K

There will be support for RtK3 as I have been thinking about letting the user choose different "targets", such as the general-use kanji. While I have to think of the implementation of those ahead (for setting up the database etc.) I can't prioritize a RtK3 target at the moment. It won't be in the beta of Trinity.

I also need to think about a way for users to add any kanji with the correct reading into a kanji chain, if someones wants to incorporate the alternative readings; that is one character in multiple onyomi groups.

ファブリス,
A belated thank you for this reply.

It is great that Trinity will (eventually) support RtK3 and other targets (such as general-use kanji).

And it's also great that Trinity will allow users to add any kanji with the correct reading into a kanji chain. (Yeah!)

Thanks for all of your hard work on this!

I realize that you have a lot to do to even get the beta online but I think I speak for most everyone when I say,

首を長くして待っています。  smile

Last edited by Chadokoro_K (2008 January 28, 2:50 am)

Reply #43 - 2008 January 29, 8:52 am
Murten Member
From: Karlstad, Sweden Registered: 2007-09-09 Posts: 46

I havent been writing a word here about trinity since I read about it one month ago, which I apologize for, cause ever since I read about it the first time Ive checking for updates about it all the time, read the same news over and over, and Ive been really really excited about it!

I just wanna say to you ファブリス that Im another one of those who are really looking forward to trinity coming online, and Im really grateful for all the work you have been doing and still are doing right now, its amazing!

I hope youre not getting discouraged by the length of that to-do-list, but hanging in there! Im sure studying japanese is gonna be so much funnier and more effective with trinity online, I just cant wait to dive into it!

Last edited by Murten (2008 January 29, 8:53 am)

watashimo Member
From: Germany Registered: 2007-04-28 Posts: 76

Just read the Trinity news for the first time and it sounds awesome. I lost my sentences for Supermemo due to a defect HDD. Time to try something new. The concept sounds great, a system that includes most areas you need for your Japanese studies (Kanji, vocabulary, sentences). I can see myself spending hours per day on your website once Trinity is implemented (currently trying to add all the kanjis into my boxes ;D).

頑張って。

pm215 Member
From: UK Registered: 2008-01-26 Posts: 1354

For sentence parsing, I believe that the parser Jim Breen used to do initial indexing of the Tanaka corpus example sentences was ChaSen. There's a (Japanese) sourceforge site:  http://chasen-legacy.sourceforge.jp/ and it also seems to be in Debian and Ubuntu.

My experience with the examples is that it definitely made mistakes, but then any automatic parser is going to do that...

I just tried installing it and if you feed it

むかし、むかし、ご存知のとおり、うさぎとかめは、山の上まで競争しました。

it outputs

Code:

むかし  ムカシ  むかし  名詞-副詞可能
、      、      、      記号-読点
むかし  ムカシ  むかし  名詞-副詞可能
、      、      、      記号-読点
ご存知  ゴゾンジ        ご存知  名詞-一般
の      ノ      の      助詞-連体化
とおり  トオリ  とおり  名詞-非自立-副詞可能
、      、      、      記号-読点
うさぎ  ウサギ  うさぎ  名詞-一般
と      ト      と      助詞-並立助詞
かめ    カメ    かめ    名詞-一般
は      ハ      は      助詞-係助詞
、      、      、      記号-読点
山の上  ヤマノウエ      山の上  名詞-固有名詞-地域-一般
まで    マデ    まで    助詞-副助詞
競争    キョウソウ      競争    名詞-サ変接続
し      シ      する    動詞-自立       サ変・スル      連用形
まし    マシ    ます    助動詞  特殊・マス      連用形
た      タ      た      助動詞  特殊・タ        基本形
。      。      。      記号-句点
EOS

which isn't too bad (apart from the way it breaks down verbs rather more than you might expect). NB: it expects the input in EUC-JP and provides output the same way.

tomhogers Member
From: Venezuela Registered: 2005-12-22 Posts: 39

Hi Fabrice,

I use ChaSen with J-Gloss and find it "adequate" but, of course, not perfect. It is one of the very few free parsers available that will actually parse words written in kana (if not the only one).
A Windows version is also available with its own dictionary "ipadic" included.

Installation files may be downloaded from:
http://chasen.aist-nara.ac.jp/chasen/di … on.html.en

Cheers,

Tom

tomhogers Member
From: Venezuela Registered: 2005-12-22 Posts: 39

To see the difference using ChaSen for parsing, take a look at the following screen catches of JGloss using its own parsing engine (only parses Kanji) and then with the ChaSen parser included.
I used the same example as pm215.

http://www.flickr.com/photos/98763744@N00/2262762531/

http://www.flickr.com/photos/98763744@N00/2262762571/

As you can see, ChaSen parses the kana words but, as pm215 mentioned, also treats some verbs rather strangely.


Cheers,

Tom

pm215 Member
From: UK Registered: 2008-01-26 Posts: 1354

tomhogers: you might want to try to get jgloss to match up the kind-of-verb annotations ChaSen puts out with EDICT's v5r etc markup so you can avoid picking ones it can't be. Or at least have a special case for suru! :-)

Jim Breen talks about the Tanaka corpus here, and mentions some of the things he did with the ChaSen output:
http://www.csse.monash.edu.au/~jwb/papi … mples.html
The most obvious and easy one is that whereever ChaSen gives you two words you try looking them up in the dictionary as a single word, and if there's a match use that instead. (This counteracts its tendency to split compounds down further than is ideal.)

It ought not to be too hard either to just spot where it outputs lines for masu-form, past tense and the other obvious ones, and glue the relevant syllables back onto the preceding verb. (I had a quick scan of the docs but couldn't immediately spot anything to do this automatically.)

I note that you can tweak its output format with -F (help in -Fh), which is probably a good idea for this kind of automated usage.

Reply #49 - 2008 March 06, 2:37 pm
ファブリス Administrator
From: Belgium Registered: 2006-06-14 Posts: 4021 Website

I've finally come to my senses regarding the necessary refactoring of the site's code and decided to use an existing php framework, instead of rewriting my own (I hear a big "Well DUH!" in the crowd wink).

I'm going to use symfony. I was actually reading symfony's documentation as inspiration for writing my own simplified framework over the last two weeks or so.

I've looked at several other frameworks like CakePHP, CodeIgnition and Zend Framework's MVC implementation. Except perhaps Zend Framework, the other frameworks I saw seemed to be really excellent into their own domains, but not truly all purpose and fully configurable as symfony appears to be.

I still have some reservations with symfony's ORM : "Propel" and "Criterion" seem really burdensome for accessing the database. Speed is also a concern, but the very flexible caching in symfony should more than make up for it.

Using a framework will also make it more likely in the future that I work with another developer, something that today is far too impractical without a solid code base, conventions and documentation.

Another thing I discovered last month is the excellent Aptana IDE. The built-in FTP connections and synchronization is already saving me lots of time.

It could take several weeks to convert the site to symfony, but I think it will be well worth it.

elmimmo New member
From: Tokyo Registered: 2007-09-24 Posts: 7 Website

ファブリス wrote:

It's possible with Google Gears for example. I didn't feel it was worth the effort to do this for RtK1.

My biggest gripe with RtK1 is that I need an internet connection, hence, cannot use it on the go while having lunch, in a cafe or something, meaning I can only really use it when I get back home, late, already dozing... and so kanji start to pile up :-/

Add my ballot to the "Yey for Google Gears ed.!" box. (^_-)

(sorry for the off-topic post, but since a ref. to Google Gears was made...)

Last edited by elmimmo (2008 November 17, 9:06 pm)