Here's a random thought: Being able to search Google with a 'sounds like' option (example: http://cogweb.ucla.edu/SearchTips.html)... Is this already implemented and I missed it? I also wonder at how it works. Integrate a database of known homophones? But then there's the 'fuzzy' stuff, like if you wanted to search Google for lyrics you heard but weren't sure of and could only roughly approximate.
The above example doesn't seem to work well, as if it simply uses wildcards, though it did give me the result 'share' when I searched for 'cher'.
Maybe they can use some form of captcha or misheard lyrics sites somehow to create a database of commonly misheard stuff? I threw in 'captcha' because of the 'recaptcha' OCR thing: http://recaptcha.net/learnmore.html
Edit: Hmm, how about OCRing Japanese vobsubs by incorporating kanji captcha (and kana) into sites like this somehow? As a game or for Serious Business™.
The above example doesn't seem to work well, as if it simply uses wildcards, though it did give me the result 'share' when I searched for 'cher'.
Maybe they can use some form of captcha or misheard lyrics sites somehow to create a database of commonly misheard stuff? I threw in 'captcha' because of the 'recaptcha' OCR thing: http://recaptcha.net/learnmore.html
Edit: Hmm, how about OCRing Japanese vobsubs by incorporating kanji captcha (and kana) into sites like this somehow? As a game or for Serious Business™.
Edited: 2010-04-04, 9:33 pm

instead of 泣、and the next character it gives you is 立.