kanji koohii FORUM
JapanesePod101 Audio Sentence Mining Project - Printable Version

+- kanji koohii FORUM (http://forum.koohii.com)
+-- Forum: Learning Japanese (http://forum.koohii.com/forum-4.html)
+--- Forum: Group study (http://forum.koohii.com/forum-15.html)
+--- Thread: JapanesePod101 Audio Sentence Mining Project (/thread-6734.html)

Pages: 1 2 3


JapanesePod101 Audio Sentence Mining Project - nalcomis - 2010-11-21

All,

A friend and I are starting a project to sentence mine the dialog portions of lessons from JapanesePod101 and importing them into a shared Anki media deck. We like the JPOD101 stuff because grammar topics are introduced through conversation. We expect to get a lot out of this but, however, extracting individual sentences from a single MP3 and making associated cards is extremely tedious. This is why we would like to collaborate with other people interested in doing the same.

I have already created the master deck, a "temp" deck used to create new cards, and a tracking spreadsheet in Google docs that we are using to track our progress. Packaged Anki deck updates are posted to a Google docs folder which I manually import into the master deck to study from. If a handful of people would like to do this and are willing to create ~20 or so cards a week, the benefits could greatly outweigh the tediousness of creating the cards. I will be honest, it isn't fun...

Anyone interested in participating must be a JapanesePod101 premium subscriber (let's keep this legal..Smile We are using Audacity (freeware audio ripping software) to extract the MP3 files downloaded from the JPOD101 site. I am also in the process of putting together a small tutorial that non-computer savvy people can follow as a guide to creating the cards.

Anyone interested?


JapanesePod101 Audio Sentence Mining Project - loonytik - 2010-11-21

I might be intrested in this. I use the "Real media catcher" program though to extract the mp3. Which rips the mp3 straight away if you click on the button to play the audio for the sentence. Bit less effort than using Audacity Tongue


JapanesePod101 Audio Sentence Mining Project - nalcomis - 2010-11-21

Cool. I just mentioned Audacity because I run Ubuntu, my buddy runs Mac, and my other buddy has a Winders box and Audacity runs on all of them.

All we are looking for is ~20 sentences a week from people... Only takes 30 min or so once you get your "system" down. If I get 10 people doing it, that is 200 new sentences a week. Not bad to supplement your other studies, right? My BIGGEST problem is audio comprehension. I read at the intermediate level, but my auditory recognition is at the prenatal stage. I get jealous when I hear 3-year-olds on the train speaking Japanese.. I think they are geniuses.

Shoot me a PM if you want in.

nalcomis@gmail.com


JapanesePod101 Audio Sentence Mining Project - bladethecoder - 2010-11-21

If you are recording the MP3 files in Audacity and then re-saving them, this will lose some of the quality due to compressing twice. (Like taking a screenshot of a JPEG image.) It would be better to download the original files.

Oh, I missed the part about "extracting individual sentences from a single MP3". So, downloading the dialog clips and then cutting them up?


JapanesePod101 Audio Sentence Mining Project - nalcomis - 2010-11-21

Bladethecoder,

We are just splicing the single file into multiple chunks - each a sentence long. I am actually dropping the bitrate down to 64Kbps to save space on dropbox. They difference in quality is negligible...especially with only spoken voice. They sound fine.


JapanesePod101 Audio Sentence Mining Project - rswarsaw - 2010-11-27

I'd like to get involved... count me in.


JapanesePod101 Audio Sentence Mining Project - buonaparte - 2010-11-27

nalcomis Wrote:extracting individual sentences from a single MP3 and making associated cards is extremely tedious. This is why we would like to collaborate with other people interested in doing the same.
Line-by-line audio is already there, so you can download it to your comp, without having to extract anything.
nalcomis Wrote:Anyone interested in participating must be a JapanesePod101 premium subscriber (let's keep this legal..Smile
It's legal for anyone, because they give you a seven-day free access, so you can download anything you want.


JapanesePod101 Audio Sentence Mining Project - truando - 2010-11-27

I am interested! I have all the tools needed. Let me know what I can do.

I suggest I do the beginner lessons season 1, Lessons 1-100. Or whatever.


JapanesePod101 Audio Sentence Mining Project - loonytik - 2010-11-27

buonaparte Wrote:
nalcomis Wrote:extracting individual sentences from a single MP3 and making associated cards is extremely tedious. This is why we would like to collaborate with other people interested in doing the same.
Line-by-line audio is already there, so you can download it to your comp, without having to extract anything.
nalcomis Wrote:Anyone interested in participating must be a JapanesePod101 premium subscriber (let's keep this legal..Smile
It's legal for anyone, because they give you a seven-day free access, so you can download anything you want.
If your a free member you only listen to audio that is like 1/2 weeks old and not older. so yea...

I don't see Line-by-line audio in Myfeeds(download thingy of Jpod)... You can download all kind of tracks but not line-by-line. Or I am missing something

The line-by-line audio thingy for premium users works with a kind of flash buttons. so you cant really right click and save it . I use a ripper program for that.



I will still pm you nalcomis. I am kind of busy atm.


JapanesePod101 Audio Sentence Mining Project - Blahah - 2010-11-27

Well I just signed up for a free account and am able to download the line-by-line for any lesson. They are all stored in this format:

[EDIT]
I have removed the instructions since buonaparte uploaded the files.


JapanesePod101 Audio Sentence Mining Project - loonytik - 2010-11-27

buonaparte Wrote:
loonytik Wrote:The line-by-line audio thingy for premium users works with a kind of flash buttons. so you cant really right click and save it . I use a ripper program for that.
Of course you cannot just right click and save. There are other ways.
All I can say is:
I have all the line-by-line audio from the site on my comp. When I read the first post in this thread, I was rather puzzled, that's why I let you know that it is not necessary to extract anything.
By the way, I'm not a premium user, have never been.
Well nice download you got there than. I found only simple ones(only with the audio lessons+pdf)(didnt look that hard either though).Thats why I thought heh?Nice hint^^Althought you wont get the newest ones with that. I paid about 70$(or maybe it was even less o.0) for a 2 year membership heh thats notting^^' Which is awesome bang to buck imo.


Blahah awesome found! ( I can even listen to those sounds without logging in LMAO)


JapanesePod101 Audio Sentence Mining Project - Blahah - 2010-11-28

buonaparte it has been public for a long time that you can sign up for a free one-week account and download the whole site - I don't see why they would change this as it's the same thing.

In any case, I'm not interested in scamming them. If they make it available free, I will tell people how to get it. If they don't want it to be free, they are within their rights to hide it. If they don't hide it well enough, I'll find it and tell people how to get it again.

Alternatively you could just torrent the collection that you've already downloaded and I could take the instructions out of the previous post?


JapanesePod101 Audio Sentence Mining Project - HerrPetersen - 2010-11-28

I have already done quiet a number of episodes for CHINESE-pod using the methods outlined in this thread:
http://forum.koohii.com/showthread.php?pid=101186#pid101186
For me this worked like a charm so maybe it would be worth considering for you too.

Since my interest lies in learning Chinese and not Japanese I am not joinging the project though.


JapanesePod101 Audio Sentence Mining Project - Blahah - 2010-11-28

I don't understand, what's wrong with torrenting? It's not illegal or immoral, they give it away for free. If I ever do download them all, I'll torrent them. Obviously it's your decision, but why only share it once? Kinda goes against the community spirit which we all benefit from. One person does a bit of work, gives it back to the community. It makes this a great place to be.

HerrPetersen that does seem a faster method than what nalcomis described


JapanesePod101 Audio Sentence Mining Project - buonaparte - 2010-11-28

By the way, the best introduction to Japanese grammar is to be found here:
http://www.gwu.edu/~eall/vjg/vjghomepage/vjghome.htm


JapanesePod101 Audio Sentence Mining Project - KanjiDevourer - 2010-11-28

buonaparte Wrote:The whole site is to be found at torrents.ru.
Great, over twenty-two gigs of material - that saves me a lot of work downloading! I guess the above instructions can be removed - just register with what is now called rutracker.org.

Quote:By the way, the best introduction to Japanese grammar is to be found here:
http://www.gwu.edu/~eall/vjg/vjghomepage/vjghome.htm
That seems to be a good source of information. Very nice of the George Washington University to have a public page.


JapanesePod101 Audio Sentence Mining Project - ファブリス - 2010-11-28

Admin:

A reminder these are the forum rules.

I will look at this closer coming week. I don't have time for this crap now. It's late, I'm tired.

But I can already tell you that I am tired of all these links (esp. buonaparte's links that don't have any descriptions or source information, btw your behaviour is suspicious to me, are you trying to bump your post count or just plain oblivious that there is an EDIT link with each of your posts?).

Starting from tomorrow, I want source of all links to downloadable materials. Better yet, I want links to normal pages where people can download and judge for themsevles what they are downloading, and how it was offered.



JapanesePod101 Audio Sentence Mining Project - buonaparte - 2010-11-29

Sorry, I did not mean to offend anyone or break any rules. I just shared what people wanted.
Sometimes it is difficult to give the link to the original site as it is no longer there.

All the stuff I posted here was or still is available for free, so anyone can or could download it for themself, it just takes time and some know-how.

I promise I won't be posting anything else, just to save you trouble.

Good luck,


JapanesePod101 Audio Sentence Mining Project - Blahah - 2010-11-29

I agree that perhaps buonaparte went a bit overboard with the links in general, but the links you deleted from this thread ファブリス were not illegal and DID have the source clearly stated - we've been talking about it the whole thread. Please put those links back up as they were really useful and would take a lot of time for someone else to compile.

[edit] perhaps if they were all combined in one post it would be less offensive?


JapanesePod101 Audio Sentence Mining Project - Teskal - 2010-11-29

The best think would be to ask JPOD101 what they think about putting this material online. In some countries, like Germany, it could be a problem to put such material online for everyone. These Files are only free available for 7 days. Without special Software, you cannot download it without problems and normally not all the material in 7 days.

I found it very interesting that it is possible to get so much data so fast, like buonaparte did it. It would be very interesting to know how he did it. But I have an Premium Account till 2012 (and don't think I will cancel it) and so I can get the one-liner everytime.

A Guy I know from another Forum asked already Peter from JPOD101, what they are thinking about this kind of links. It is possible that I can tell more in a few days.


JapanesePod101 Audio Sentence Mining Project - buonaparte - 2010-11-29

If something is on-line and you can access it legally, that means you can download anything you want. If something is not prohibited, it means it is allowed.
You don't need special software, all you have to do is to figure out the urls.
And that is not difficult at all. It is enough to know your webbrowser.
I didn't need their line-by-line audio at all, people asked for it, so I helped them. I was just surprized that somebody wanted to waste hundreds of hours to extract the audio from larger mp3 files.
You don't need days to download, a few hours is enough.

If you can share anki files with audio from Jim Breen's dictionary, I thought you can share anything that you can get legally for free. It just saves time for someone else, nothing more. I don't get any money from it, they don't lose any money from it. What's the problem?

Anyway, I won't be posting any links any longer.
Bread upon the waters, eat and let the others die of hunger.


JapanesePod101 Audio Sentence Mining Project - ファブリス - 2010-11-29

@Blahah: I may have let buonaparte's recent spam fest get to my nerves (plus someone complaining about the thread by email), sorry about that. Delete is permanent and I'm not going to spend hours trying to merge them back from a daily backup. buonaparte can post them again.

As Teskal points out, it does not appear to me that the downloads are truly free. You need at least to register on their site, which means giving them a chance. With that said, JapanesePod101 advertising on this forum DOES generate sales for them. So it's not like I'm worried about stealing from them. The point, really, is that this is a forum to share knowledge, not for download fests.

This forum is primarily for sharing experience and knowledge, and helping each other out with informative topics. If you are going to post downloads, then at least put a minimum of effort to give side information such as the source(s) and what the download contents are. Nobody wants to download from shady websites with little to no information on what they are downloading.

If you really care about others, well first off, you don't post stuff "just in case" someone might need them. You spend time actually trying to answer other's posts, or you collaborate on group efforts. Posting stuff "just in case" someone needs it is just ego masturbation. Only on this board, I don't give a damn about your post count, or how much resources you can share.

OP Wrote:let's keep this legal..
Then you can share scripts and work together to build scripts that take input from specific downloads, run tools on them, and generate the output you need. No need to share the actual data.


JapanesePod101 Audio Sentence Mining Project - buonaparte - 2010-11-29

ファブリス Wrote:Posting stuff "just in case" someone needs it is just ego masturbation. Only on this board, I don't give a damn about your post count, or how much resources you can share.
Sorry to have upset you. I really didn't mean to. I thought I gave enough information each time.
People do share resources here too, so I thought nobody would mind.

As I said before somewhere else, I thought that making one thread with many different topics would be frowned upon. It did not intend to spam your site or something.

If you want I can delete all my posts, if it is too much trouble for you. I realize it is tough being an admin.

Good luck again,


JapanesePod101 Audio Sentence Mining Project - ファブリス - 2010-11-29

Yes, that is what I wrote. Although let's not put it out of context, I am clearly annoyed at your recent posts. Fair? Maybe not. Handle the heat and make better posts, or stop posting links, it's your choice.

You registered like 4 days ago. Yesterday you spammed like 10 different topics each with a download link to some shady download sites with very little contextual information. For all I know you could be purposely posting illegal download links and then try to get me in trouble!


JapanesePod101 Audio Sentence Mining Project - buonaparte - 2010-11-29

ファブリス Wrote:For all I know you could be purposely posting illegal download links and then try to get me in trouble!
Yes, you're perfectly right, I didn't think of that, I did mean well, though. Life is full of surprises.