Back

Extracting vocabulary

#1
Seems like I cannot access the online Anki deck repository with Anki 1.2.1, so I will have to create my own SAT vocabulary deck. I'm not preparing for the test itself; I just want to polish up my English a little bit, as I think it would be also beneficial for my Japanese learning..

I found two websites providing an extensive lists of vocab I have never seen in English before, which I want to extract somehow and create a *.tsv excel sheet that I can import comfortably in Anki. Enterting everything manually would cost a lot of time Confused

I have no idea how to, though. Perhaps someone could teach me how to??

Anyway, here you go.

http://is.gd/ej4Mjb
http://is.gd/o0ylen

If you find/have other ressources at hand, please let me know.
Reply
#2
Tori-kun Wrote:If you find/have other ressources at hand, please let me know.
I recently bought a Collins Cobuild English Dictionary (5th edition) that is excellent for SRS purposes I feel. It's E-E and the definitions are great since they are sentences that describe the meaning in a very clear and concise way. Here's an example:

concise = Something that is concise says everything that is necessary without using any unnecessary words.

I like to add an example sentence (also included on the book) to Anki with the word underlined, with the definition on the back of the card.

Of course I mainly use the electronic version which makes adding the sentences to Anki easier. However, it's still manual labor. However, the 4th and 6th edition e-dictionaries are not so good so the 5th edition is your best bet.
Reply
#3
Yay, a fellow!

I have the Oxford Advanced Learners Dictionary (8th) on CD. Right now, I write down all the words I encounter while reading books in English (currently "God is not great" by Christopher Hitchens and "Fifty Shades of Grey" lol) and learning Japanese vocabulary.

We could create a google spreadsheet, right? It's possible to import those in Anki as far as I know (I'm still on Anki 1.2.1 just for your note).

I'm also want to add example sentences to the vocabulary, however, I kind of need my German translation on the back of the card haha Some learning habit, I suppose.

If you're up for the spreadsheet idea, let me now!


[Edit]

I wonder how to convert this dictionary into EPWING. I could be usable with E2A then...

Also, there is http://dict.cc, which is free and opensource as it seems, however, I have no plan whatsoever how to utilise these databases T-T
Edited: 2013-02-09, 11:42 am
Reply
May 16 - 30 : Pretty Big Deal: Save 31% on all Premium Subscriptions! - Sign up here
JapanesePod101
#4
Ok, started off a spreadsheet. I hope this will be beneficial for all those wanting to polish up their English skills.

Just start adding some rare/erudite terms! :O
Reply
#5
Tori-kun Wrote:Seems like I cannot access the online Anki deck repository with Anki 1.2.1, so I will have to create my own SAT vocabulary deck.
You could easily install Anki 1.2 and Anki 2 side-by-side. Several of us did this for almost a year during the alpha and beta phases.
Reply
#6
I'm fine with the old Anki..

Gosh, if I knew how to even call this "extraction" process, I would google it and learn how to code up something lol
Reply
#7
Though I'm not going to diss on my Oxford dictionary, I prefer dictionary.com; it rarely disappoints. Also word of the day and random quizzes and "did you know this word" Smile
Reply
#8
Tori-kun Wrote:I found two websites providing an extensive lists of vocab I have never seen in English before, which I want to extract somehow and create a *.tsv excel sheet that I can import comfortably in Anki.
Did you miss those export buttons? Or if you just select the text in the tables, at least Safari and Chrome on OS X copy it as TSV.

I just made another text file for English words to review. I exported the GRE Words List and Vocabulary 4000 (English) Anki decks as TSV and selected words to review after filtering the lists by word frequency in subtitles:

freq=$(sed -n 50000,200000p frequency_subs.txt | cut -d' ' -f1)
cat words.txt | while read l; do grep -xq "${l%%:*}" <<< "$freq" && echo "$l"; done
Edited: 2013-05-24, 9:14 pm
Reply
#9
Oh yes I missed them! Thanks for pointing out! Is there a way to copy those into the spreadsheet replacing existant entries?

Here's the spreadsheet btw: http://is.gd/t4vc4M

[Edit] I struggle with Excel -- does anyone know how to make the first column fixed an unaffected from "Sorting A->Z"?? don't want to pull it up every time Confused
Edited: 2013-02-09, 2:44 pm
Reply