Joined: Oct 2007
Posts: 4,582
Thanks:
0
By the way, what do you think would be the easiest way to import those swac files into Anki? Can't decide what to find/replace on (the limit of my scripting abilities). Nevermind I see, the index.xml thingy has text/audio indicators.
Edited: 2010-04-07, 1:56 am
Joined: Mar 2010
Posts: 65
Thanks:
0
Wow nest0r, you were going to do all 5000 by hand? That's true dedication. Where is the audio from? This is not from the pod by the way, it's from the totally innocent book that you found lying around first off. I haven't read the other thread, but if it involves audio I'll take a look now, thanks!
Joined: Mar 2010
Posts: 65
Thanks:
0
Oh, I think I'm confused. I assumed there was some kind of freely available TTS plugin you were talking about.
If we are indeed thinking of the same (harrumph! dictionary), then this should suit your needs right? It's the entire thing, 5000 in total. I think you can just import this into anki now, at least I did... What's the second step? If there's something I'm missing let me know. For example I ignored all the thematic lists (animals etc.) but it's easy enough to do with my script.
Joined: Oct 2007
Posts: 4,582
Thanks:
0
It actually only takes a few minutes, relatively speaking, to do big batches (i.e. from freq. list to Anki). But the initial stage of getting the words into the proper format was the dullest, involving lots of copy/pasting. ;p
And I get the audio from the same place as before, I think I underestimated how much stuff there is. I mean not for every word, but for most of them so far with less duplicates than I thought, though it remains to be seen if this holds up for the least frequent words. As long as I end up with ~6000 sentences I'll be happy. It's just a neurosis thing at this point. ;p
Edit: Oh just read the above comment. I'm glad you ignored those themed things, I have been as well since they don't fit into the frequency numbers. The other steps involve grabbing sentences and audio and sticking into Anki. I discarded the other information for individual words, because I just use StarDict when learning words via the dictionaries one might find where I referenced in the recent Chinese thread.
I'll also probably make a deck from those swac thingies once I'm done. The .xml format is easy enough to parse for Anki. Actually I might not bother since the way they have them set up is good enough as a reference rather than for study... or I'll do something in between.
Edit 2: Maybe someone will post the process they used to make a certain deck if they happen to make another big update which might come shortly but I wouldn't know because that someone is definitely not me, as that would be wrong. Then other someones could use that process to make decks for Chinese, perhaps.
Edited: 2010-04-08, 11:48 pm
Joined: Mar 2010
Posts: 65
Thanks:
0
Well my next two projects (again, when i have time) is to write scripts to pull all audio based on the list I just created, from either the pod people (sentences) and/or swac (words) and automatically remove all duplicates. By pulling audio, I mean just collecting the links so you can import it as remote media in anki, which would do all the downloading for you. However if someone is really already far enough along with this and is feels a kind of zen calm in copying and pasting, then I'd hate to spoil that and duplicate work. :-)
Joined: Mar 2010
Posts: 65
Thanks:
0
Well I'm not sure if I understood either, but wow, brilliant! In that case if someone needs help in any way let a certain person know.
Joined: Oct 2007
Posts: 4,582
Thanks:
0
BTW @Nemotoad, there IS a TTS plugin for Anki for Windows (see Shared Plugins). I just got it working. Nice to have if you want some quick and easy supplementary audio for cards where it's not available. Just hit a key to tell it to play Q or A side, you can specify voices, and customize what special characters to ignore (like one shared deck I acquired has the reading in brackets next to the kanji form of the word, so to keep her (it) from repeating the word twice in a row, you just block opening and closing brackets in the plugin file...
Edit: Loquendo's Bernard and Juliette are great French TTS voices (well as much as any TTS is great)... you can find them 'around'.
Edited: 2010-04-09, 3:17 pm
Joined: Jan 2011
Posts: 10
Thanks:
0
Since Segapuptoad is no longer with us. Would someone be so kind as to direct me on my spiritual jorney to french illumination?
Joined: Sep 2012
Posts: 63
Thanks:
0
Oh well :/
I'm interested ^^, I've been following this thread for a while now.
jobhuntingman, can you please link us to it somewhere else proper ? (if not, can you e-mail me the link please ?)