Okay, I had a couple of hours free, so I dumped Fedora 16 on a box I'm not using, spent another couple of hours getting everything to run, and got everything installed properly to the point where the plugin works!
It looks nice in Fedora. The problem I'm having with my Fedora install is that it's slow as mud, and I'm running off of an SSD. (Just updated the firmware, too. No help.) Ubuntu was even worse, so I don't know what's up with that box.
But the plugin works. I used it a bit, and it's nice. Here's my feedback on the interface:
Generally, everything needs to be more streamlined. In the final version, you'll want something that works similar to YomiChan, where you can assign which fields will go where automatically in Preferences. Setting up a new model/deck just for this is a little tricky. I'd like to be able to dump sentences in my current deck as it is, and leave it to me figure out in which fields I want to stick the example sentences, translations, and definitions. I may want some of the stuff in the "word" deck in every sentence card, all crammed in one field, and I may want to forgo the whole idea of a "word" deck. (But I know it's easier to import that way, so I know it's important for alpha.)
I suppose we could just export the deck to .csv, then import the bits we want into our regular decks, but that adds extra work.
It might be easier to have a way to read in a text file list of words and kana and generate the appropriate fields. (So again, something akin to something else that YomiChan does-- it reads a txt file in Anki and lets you work on it, just in a slightly different manner.) That's kind of what it's doing now, it's just a matter of getting the text/kana from a txt file the user selects as opposed to a deck.
I think being able to use the fields you scrape as you like is important. So if I want 10 cards for 勉強, and I just want to study sample sentences, then I would want to have the scraper plunk the word, the word in kana, and the definition in the same place in all of my cards. In my main deck, I just have a "meaning" field where I dump the sentence translation (if needed), and any definitions I want. Again, this is something that should be configurable in preferences.
I thought of something else-- not everyone wants English translations. If there's a way to turn off English translation selection in the preferences for people who just want the sentences (for the folks who are big on monolingual), then that would be an attractive feature. I'm not picky about that, but I know some people are. They could then run the sentence cards through the sentence glosser plugin to get definitions in JP if they want.
Also, a button to start the plugin from the Anki main screen would be great. Or something along with Ctrl-J.
Ah, I know what would also be great-- an input box in the plugin window, so you can create cards on the fly. Say, for example, I'm reading a book in Yomichan, and I come across a word I *really* want to learn. I want to be able to copy the word, open DictScrape, pop it in the input window, and generate 10 sentence cards of it. (Then maybe flag some of them to run through the Sentence Glosser to pick up all of the definitions of some of the trickier sentences.)
Anyway, these are some random thoughts for now, so you get feedback ASAP. Even as an alpha, it's great. I think the most important thing is to get it so it will run on Win/Mac so you can get more testers.
EDIT: Also, periods. I like JP-style periods in my sentences. Dictscrape seems to cut them off for some reason. (Maybe something else for Preferences?)
EDIT2: Figured out why my box was running so slow-- it's Celeron box I've been meaning to turn into a plain vanilla fileserver.
EDIT 3: Got it running on a faster box in Ubuntu 12.04LTS. The instructions on this page:
http://tightwadtechnica.com/?page_id=4163
were *really* useful in getting lxml et. al. running. (Use apt-get instead of aptitude, but it works with aptitude as well, if you install aptitude.)
I'll do some more testing over the next few days, and see what I can break.
Edited: 2012-07-12, 2:09 pm