![]() |
|
Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - Printable Version +- kanji koohii FORUM (http://forum.koohii.com) +-- Forum: Learning Japanese (http://forum.koohii.com/forum-4.html) +--- Forum: Learning resources (http://forum.koohii.com/forum-9.html) +--- Thread: Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards (/thread-5802.html) |
Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-07-23 vileru Wrote:The capabilities and interface of the program are beyond my expectations. We're lucky to have you here, cb. However, one thing I think the "Getting Started" page can benefit from is more clarity on what type of files can be imported. I originally tried importing an HTML file and then a .doc file to no avail, but then everything imported perfectly once I tried a .txt file.Thanks for the feedback. I'll try to find a way to clarify this in a future version. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-07-23 Tori-kun Wrote:Is there a chance to make Epwing2Anki format the .tsv output file conveniently like this?Maybe I can put it in the available fields list so you can choose how to position it. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-07-23 rich_f Wrote:Also, is it possible to add a flag for "Full" searches, a la EBWin? I think that may be contributing to some of my "misses," -- for example, 恥をかく doesn't show up in Kenkyuusha 5th edition when I run the program normally. BUT it's in there, just not as a headword. It's found under 恥, with its own definition. Maybe in case a definition or sentences don't show up using the regular mode, try with Full mode? (Because using Full mode on everything is a)overkill and b)slows everything down considerably.)Sounds like a good idea for some future version. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-07-23 rich_f Wrote:Oh, I think I figured out at least *part* of my problem with PDIC. The ini file that comes with Eijiro has to be installed in the main PDIC program directory with the other Eijiro PDIC files-- so just copy everything in the PDIC-UNI directory and paste it in the program's directory.When you get it figured out, I'll see what I can do. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - Sebastian - 2012-07-23 Would it be possible to make Epwing2Anki recognize not-ascii characters in paths? Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - rich_f - 2012-07-24 One thing I noticed: In order to get fewer words left out of the final list, use all of the dictionaries you have, even if you won't use their output. Using Tatoeba for sentences reduced my "Can't find a sentence for this" number from 40 to 17 out of 332 words. I used the "mark source" feature to flag all of the Tatoeba stuff, and I can just nuke it from the list as I feel like it, and my placeholders are still there for when I want to add sentences from DictScrape and/or ALC. If there was just a way to take care of those last 17 words, as well as the 7 that don't have any definitions at all, so that they at least show up in the output list in the order I put them, then that would be perfect. So here's a Feature Request for something else to add to the preferences: Something like, "Include all entries from input list in output list in order, even if nothing is found." It would make it easier to troubleshoot why those entries don't get found, and would probably be a lot easier to implement than Full Search. The current feature to append the input list doesn't quite work here. It only appends the input list if something is found. If nothing is found, then nothing is appended, so then I have to go back and compare 1500 or so entries in the output to the original 332-word input list... and that's a hassle. Another Feature Request would be a button to save the log at the end that gives the final report of what was found and what wasn't, so I have something to reference to without having to create a new doc and copy/paste. But it's not a big priority. Just something that would be nice. I'd rather have the other thing. Re Eijiro and EPWING: I stopped banging my head against the wall trying to convert the current version of Eijiro to a useable EPWING. I need to get a better understanding of how EBStudio creates EPWING dictionaries, and I need a better idea of how to create the proper kind of index for Eijiro using grep and those kinds of tools. (It would be nice to have hiragana gloss for the entries, for example... but that's just not feasible right now.) I'll just keep using ALC. It's good enough for the job. Eijiro is really cool as a resource, but I don't have a week to figure this out. It goes on the someday/maybe list. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-07-24 Sebastian Wrote:Would it be possible to make Epwing2Anki recognize not-ascii characters in paths?I can certainly try. rich_f Wrote:One thing I noticed: In order to get fewer words left out of the final list, use all of the dictionaries you have, even if you won't use their output. Using Tatoeba for sentences reduced my "Can't find a sentence for this" number from 40 to 17 out of 332 words. I used the "mark source" feature to flag all of the Tatoeba stuff, and I can just nuke it from the list as I feel like it, and my placeholders are still there for when I want to add sentences from DictScrape and/or ALC.For the 17 that don't have example sentences, you can just copy them from the results dialog, paste them into a new word list, and run Epwing2Anki again without the "Create a separate card for each example sentence" option checked. But yeah, I can probably add your suggested feature. rich_f Wrote:Another Feature Request would be a button to save the log at the end that gives the final report of what was found and what wasn't, so I have something to reference to without having to create a new doc and copy/paste. But it's not a big priority. Just something that would be nice. I'd rather have the other thing.Sounds easy enough to do. Until then, you can right-click on the Result dialog, select View Source, and then do File -> Save. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - rich_f - 2012-07-25 cb4960 Wrote:For the 17 that don't have example sentences, you can just copy them from the results dialog, paste them into a new word list, and run Epwing2Anki again without the "Create a separate card for each example sentence" option checked. But yeah, I can probably add your suggested feature.Yeah, but it's the same problem-- they're out of order, and I'd have to find them in the list to put them back in. I could also just copy them from the original list, but either way it's time-consuming when you have a large list. Thanks for looking into adding it as a feature. cb4960 Wrote:Until then, you can right-click on the Result dialog, select View Source, and then do File -> Save.Doh. Didn't know I could do that. That's useful enough for now. Thanks for the info. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-07-28 Hello, I have just released version 1.3 of Epwing2Anki. Download Epwing2Anki v1.3 via SourceForge What Changed? ● Added support for the 『研究社 新英和・和英中辞典』 EPWING dictionary. (Thanks kazeatari!). ● Added the "Add placeholders in import file for words that were not found" option to the Fine-Tune Options dialog. Also, when the "Create a separate card for each example sentence" option on the Setup Card Layout page is checked, a placeholder will be created for entries that don't have example sentences. (Thanks rich_f!). ● Added the "Prepend short name of source dictionary to definition" option to the Fine-Tune Options dialog. You can pick from 1) No, 2) Yes, or 3) Yes, if dictionary is not the primary dictionary. ● Added the "Append short name of source dictionary to examples" option to the Fine-Tune Options dialog. You can pick from 1) No, 2) Yes, or 3) Yes, if dictionary is not the primary dictionary. ● In the Fine-Tune Options dialog "Specific to 『研究社 新和英大辞典 第5版』" was renamed to "Specific to J-E Dictionaries" ● Upgraded to eplkup 1.3 which has ability to follow links (needed for『研究社 新英和・和英中辞典』 example sentence support). ● Added punctuation to examples in the Choose Examples dialog. ● If the settting.e2a file is from a previous version of Epwing2Anki, ignore it and use the default settings for the current version instead. (This can happen if the user installed Epwing2Anki over a previous install). ● Show error message when non-ASCII characters are found in the path of an EPWING dictionary. ● Fixed case that allowed blank cards to be generated when a word was not found in the main dictionary list but was found in example dictionary list. cb4960 Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - rich_f - 2012-07-29 You sir, rock. ![]() EDIT: Just ran my list of 332 words through, and it generated ~2200 sentences, and left the placeholders for all of the "huh?" words in place perfectly so I can deal with them as I need to. No problems at all. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-07-29 rich_f Wrote:You sir, rock.Great, thanks again for testing. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-07-29 Hello, I have just released version 1.4 of Epwing2Anki. Download Epwing2Anki v1.4 via SourceForge What Changed? ● In manual disambiguation mode, automatically choose an entry in current dictionary based on the reading and expression of the previous manual disambiguation for the same word in a previous dictionary. This automatic selection will only take place if a single entry matches the reading and expression. ● For dictionaries that are only being searched for their example sentences (meaning that they are disabled in the Setup Dictionaries page but enabled in the Setup Examples page), automatically remove entries that do not have example sentences. This prevents the case were the user is pointlessly asked to manually choose between entries that have no examples. These changes are an attempt to make manual disambiguation mode a quicker process when using a large number of dictionaries. cb4960 Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - rich_f - 2012-07-29 The program unexpectedly quit on me when I was selecting a word list file to process. I was moving out of the Downloads directory to the Documents directory when it just suddenly exited. When I restarted it, it was right back where it was, but I thought I'd post the log just for info purposes: Quote:17:05:55.012: Epwing2Anki version: 1.4.0.0When I restarted it, it ran through my list (the same list I've been testing with all versions), and having the Chuujiten as well helped get down the number of example-less sentences from 17 to 14. EDIT: Also, not sure what you can do with this, but the definition of 素敵 shows up as an example sentence in both 研究社 dictionaries, but the definition field is blank. Another weird thing-- I set the 中辞典 as top priority, and sometimes I'll have blank fields with the prepended (中辞典) just sitting there, like in the definition field for 素敵, or for a blank example sentence for 否定文... it's the only entry for 否定文, so I'm assuming it's a placeholder? Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-08-04 Hello, I have just released version 1.5 of Epwing2Anki. Download Epwing2Anki v1.5 via SourceForge What Changed? ● When definition for highest priority dictionary is blank, use the definition in the next highest priority dictionary and so on. Old behavior was to just use definition from the highest priority dictionary even if it was blank. ● When the "Add placeholders in import file for words that were not found" is checked and "Append short name of source dictionary to examples" is set to Yes, do not append the source dictionary if the the word has no examples. (Thanks rich_f!) ● 『研究社 新英和・和英中辞典』: If an example sentence is in "kana[で|する|と|な|に|の]" format then consider it to be part of the definition. (kanji[で|する|と|な|に|の] is already supported). This covers the case that rich_f brought up with 素敵. ● Renamed "Remove entries with definitions that don't contain alpha characters (a-z or A-Z)" to "De-prioritize definitions that don't contain alpha characters (a-z or A-Z)". ● When "De-prioritize definitions that don't contain alpha characters (a-z or A-Z)" is checked, allow removed entries to be used for example sentences. And if no other definitions are found, use the non-alpha definition. ● 『研究社 新英和・和英中辞典』: For example sentences, fixed case where Japanese and English were not separated correctly. ● 『研究社 新和英大辞典 第5版』: Replaced ugly tilde with nicer tilde. cb4960 Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - kitakitsune - 2012-08-04 Can you add a back button to the program? Like if I choose the wrong examples can I just hit back instead of doing the whole set over again? Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-08-04 kitakitsune Wrote:Can you add a back button to the program?Good idea. I'll try to add it to the next release. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - rich_f - 2012-08-13 I have another suggestion for the "options" list. Daijirin has an annoying habit of having sample sentences like this: ___開店。 (大辞林2) With the word you want in there omitted. (The word I want in there is 本日 (ほんじつ).) It would be neat if there was a way to tell the program that when it encounters this series of characters: ___ in a 大辞林 sample sentence, it should replace the ___ with the word from the list. Also, a general question-- cb, I seem to recall one of your programs could take a word list and compare it against another file. Is there one that does that, or am I just remembering something wrong? What I'm looking for is something where I can export my deck into a tsv/txt file, and compare it with a word list before wasting a lot of time finding out I'm adding words I've already added. It would be a lot faster than searching word by word, especially if the word list is ~2-4k long. (I'm building one of those right now for N1, based on a few vocab builder books, but I don't want to waste time learning words I already know!) Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - kazeatari - 2012-08-13 Thank you for the Kenkyuusha chuujiten ^___^ While I was reading v1.4 new features... Quote:In manual disambiguation mode, automatically choose an entry in current dictionary based on the reading and expression of the previous manual disambiguation for the same word in a previous dictionary. This automatic selection will only take place if a single entry matches the reading and expression....I thought that maybe you can make epwing2anki add a tag for words that are written in the same way (homographs) and a tag for words that have the same reading (homophones), because they are often the most hard to learn. So if I want I can select the tag on anki and focus only the words that are probably the most tricky. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - vileru - 2012-08-14 Out of all the community apps thus far, this one has been the most useful. I could've never efficiently made such great cards without EPwing2Anki. Thank you, cb. Behold the glory!
Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - rich_f - 2012-08-15 Actually, I thought of *one more feature* that would make this program perfect (for me). I have always had a "reading" field in my deck, where Anki automatically generates the reading in hiragana for sentences. (Using MeCab, I think? I forget.) What would be awesome would be a way to generate hiragana/furigana reading for the example sentences on the fly as well, to save the trouble of having to get Anki to do it for each entry. (Anki doesn't generate the reading field on import, unfortunately. At least not for me.) If there isn't a way to do include it in E2A, is there a program out there that already does this? (In batches, anyway.) I don't care if a few readings are a little off, I can always tweak that later. I just want the 90-95% that I know are low-hanging fruit to be taken care of. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - somstuff - 2012-08-15 ^Seconded :] Though it is great already Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-08-15 rich_f Wrote:I have another suggestion for the "options" list. Daijirin has an annoying habit of having sample sentences like this:I'll see what I can do. However, I'm only going to fill in the blanks with the dictionary form of the word. rich_f Wrote:Also, a general question-- cb, I seem to recall one of your programs could take a word list and compare it against another file. Is there one that does that, or am I just remembering something wrong? What I'm looking for is something where I can export my deck into a tsv/txt file, and compare it with a word list before wasting a lot of time finding out I'm adding words I've already added.I do have something that does that, but I don't think I released it yet. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-08-15 kazeatari Wrote:While I was reading v1.4 new features...This might be better suited for another program. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - cb4960 - 2012-08-15 rich_f Wrote:Actually, I thought of *one more feature* that would make this program perfect (for me). I have always had a "reading" field in my deck, where Anki automatically generates the reading in hiragana for sentences. (Using MeCab, I think? I forget.)That's doable. One of my unreleased programs does this so it will mostly be a copy-paste job. Epwing2Anki - Tool For Automatically Generating Anki Vocabulary Cards - Oniichan - 2012-08-15 rich_f Wrote:...Also, a general question-- cb, I seem to recall one of your programs could take a word list and compare it against another file. Is there one that does that, or am I just remembering something wrong? What I'm looking for is something where I can export my deck into a tsv/txt file, and compare it with a word list before wasting a lot of time finding out I'm adding words I've already added.Rich_f, here is a simple, though not elegant, solution in the meantime: - create a column in openoffice calc of all the words you've learned (column A) - then paste the list you want to filter into column B - next, paste this into the first cell in column C =MATCH($B1;$A$1:$A$3018;0) <--replacing 3018 with the final row number of your list in column A. Be sure to use the '$'s exactly as shown, they ensure that the next step will work as intended - copy and paste the content of C1 (or "drag" it) into the remaining cells in column C with an entry beside them in B - You should now have a list of numbers and '#NA's. - select column B and C and 'data>sort' them based on column C (ascending). This should place all of the #NA words on top. These are the ones you want as they have no matches in column A. The numbers refer to the matching entries in the range you selected (not necessarily to the row number if your list doesn't begin in row 1.) - copy and paste these words into a text document or delete the all other cells and save as a text file or csv (I'm not sure what format Epwing2anki needs). hth EDIT: In case you have a reason to keep your unknown list in a certain order (besides alphabetical/a-ka-sa-ta), create a "key" column in D by typing '1' into D1 and '2' into D2, then selecting both and dragging the corner of D2 downward until you reach the last row used by column B. Now, when you sort the results as in the second to last step above, select columns B,C AND D. Then, after deleting the matches in column B, re-sort columns B and D based on D (ascending). |