for proofreading
* a sentence which is not set as belonging to someone is not be trusted (though you still can ask if this sentence is correct or not, or if this sentence is correct you can "adopt' it , or post a comment to say this sentence is "ok" this way if everyone do like you, the proofreading will be really quick)
* you still can check the profile of the owner, to see if he's a native or not, most of the time active contributor have set in their profile their level in all the language they contribute in
in a near feature, we're going to set a special "ok" tag, to make proofreading more reliable, and to have a systematic way to know if a sentence is correct
for downloading, every week we make an export of the database, as the database is feed by Internet user, it's normal that every internet user can reuse for whatever he want
http://tatoeba.org/fre/download_tatoeba_..._sentences
we don't have a per language file, but basically you can filter on the "cmn" lang (it's the international code for mandarin chinese, as there's also cantonese, shanghainese and teotchew) , maybe if a lot of people is interested by the downloading feature we will consider make possible to have directly a file per language
for bandwidth, we hope you're all adults guys, the project is 100 % free, you can even download everything and reuse it to make money, we're nice (no?:p), so be nice, don't dowload it several time a week and everything will be fine

(by the way if you want to make a bittorrent of these file, feel free to do it, it will save us bandwith and ease the spread of our project, and as for any "reuse" we just need you to say where the file/sentences come from

)
* search audio sentences only
I will tell you a secret, I'm working on this right now ;-) , but to give you an idea here is the current audio stat
~1800 in mandarin (~20%)
~200 in shanghainese (~70%)
~1500 in French (~5%)
~100 in Dutch (~3%)
it's not really big amount but all these audio are high quality one and we're on the way for recording
for downloading audio, yes you can (I've said it, everything is free and reusable) rightclick on the button (save as) and here you are
but I know it's not convenient at all, so as we're doing the recording with the excellent shtooka project,
http://swac-collections.org/download.php so I don't know if you know but here you have all the HSK words recorded and on anki you have a "swac/shtooka" plugin to directly link your words to the corresponding shtooka audio, life is nice huh ?:p
if you want to listen them online
example
http://swac-collections.org/overview.php?lang=cmn
so basically we will give them our collection of sentences audio (they already have the shanghainese one) per language, this way with the anki plugin, you will be able to have them easily in your anki
if you have other questions, I'm here
PS: if you or someone else make a torrent, can you tell me

(it's also true about any reuse of our data, it's not mandatory, but you know, it's always nice to see other people find what you do useful )
Edited: 2010-06-30, 3:59 am