Back

New toys for Japanese text analysis

#7
vix86 Wrote:I'm still trying to figure out what a condcordancer is.

Also, aren't the dependency-parsers pretty much doing exactly what MeCab does? The thing used in the Japanese support plugin to help split sentences into parts.
http://en.wikipedia.org/wiki/Concordancer

For the second bit, no, it's showing you the dependency relations of words/strings, but it does first use morphological analyzers like Mecab (e.g. Juman or Chasen for KNP and CaboCha, respectively): For a more detailed explanation of KNP: http://citeseerx.ist.psu.edu/viewdoc/dow...1&type=pdf

See also: http://en.wikipedia.org/wiki/Dependency_grammar

Edit: More: http://jones.ling.indiana.edu/~mdickinso...es-2x3.pdf

AntConc is really easy to use (for Japanese, make sure to go into Global→Language settings and select UTF-8 or Shift_JIS, &c.), so you could play with that to get a feel for what these tools do.
Edited: 2011-05-25, 1:24 pm
Reply

Messages In This Thread