I use mahou a lot, but never for sentence-mining. It's convenient in that it has a great cross-reference for the kanji, including its frame number in Heisig and a bunch of other dictionaries and learning resources (KO2001, etc.). Sometimes I'll use the example sentences for reference when I'm trying to write something, but because they are translated from English by American college students, I do not use them as examples of 'real' Japanese for my SRS. (If you want sentences galore, Tanuki seems to be a better bet, but does not provide English translation.)
Here is an excerpt description of the origin of these sentences:
"From inspection, it appears that many of the sentence pairs have been derived from textbooks, e.g. books used by Japanese students of English."
For the full description, see the link below:
http://www.csse.monash.edu.au/~jwb/tanakacorpus.html
You are correct that it has been modified for use in WWJDIC, but note that the bar for removal of 'bad' sentences is set pretty low - only grammatically "wrong" sentences have been removed, with no regard to whether they are "natural" Japanese sentences. Generally they are high on pronouns and read like something out of an old textbook:
"As described below, the Tanaka Corpus has been edited and adapted to be used within the WWWJDIC dictionary server as a set of example sentences associated with words in the dictionary. In order to adapt the corpus for this role, it has been edited as follows:
1. an initial regularization of the punctuation of the Japanese and English sentences was carried out, then duplicate pairs were removed, reducing the original file from 210,000 pairs to 180,000 pairs;
2. sentences which differed only by differences in orthography (e.g. kana/kanji usage, okurigana differences), numbers, proper names, minor grammatical points such as plain/polite verb usage, etc. were reduced to single representative examples;
3. sentences where the Japanese consisted of a short Japanese statement in kana were removed;
4. sentences with spelling errors, kana-kanji conversion errors, etc. were corrected;
5. sentences where the English version did not match the Japanese were edited to make the two versions agree;
6. where the sentences contain gender-specific language or words, the English portion has been tagged with [M] or [F] respectively;
7. sentences where the Japanese was too garbled to derive a valid English equivalent were removed."