Incorrect stroke counts

Index » Feedback

  • 1
 
Reply #1 - 2012 June 01, 11:30 pm
Sebastian
Member
Registered: 2008-09-09
Posts: 520

Let's share here any incorrect stroke count we happen to find on this site.

I'll start:

2756 鋤 Plow: It's 15 strokes, but the site says 14.

Reply #2 - 2012 June 07, 8:58 am
ファブリス
Administrator
From: Belgium
Registered: 2006-06-14
Posts: 3699
Website

Ok put it on todo list.

Reply #3 - 2012 June 07, 10:31 am
ファブリス
Administrator
From: Belgium
Registered: 2006-06-14
Posts: 3699
Website

Okay here's what happens, my perl script uses manually edited data that was provided by the community for all of RTK 1 data, except for the onyomi which are sourced from KANJIDIC2.xml.

For the RTK3 kanji (#2042 - #3007), I only had a list of keywords that someone kindly provided. So the strokecounts are sourced also from KANJIDIC.

However it appears many entires had multiple strokecounts, here's what it looks like for #2756 "plow":

Code:

<stroke_count>15</stroke_count>
<stroke_count>14</stroke_count>

I changed the script to only use the first stroke count, which I'll assume is the most common one. I compared the data with the previous version and 51 differences were found with opendiff, all of them appear to be RTK3 kanji. I verified about ten of them and it seems they all needed a correction.

I'm going to update the site with the new data, and assume the rest is correct, so I can move on to other todos.

However I thought I'd post the files on Dropbox ( table_kanjis_update_2012_06.zip  106KB ) so if someone has some free time and is willing to go through it, you can compare both utf8 files with a diff program and see if the new stroke counts are correct.

Edit: 17:46  The data is updated on the live site.

Advertising (register and sign in to hide this)
JapanesePod101
Sponsor
 
  • 1