I'm not sure if its good to have them all joined together. It could make searching through the whole thing simpler, but then its all just 1 giant stream of text, so it could be harder to manually work through it.
I was looking at the files in Notepad++ for windows, and it displays the names of the character codes for a lot of the "junk" characters. For instance, null characters appear at line breaks, so it would probably be good to convert those to actual line breaks, which would make the text more readable.
(EDIT: some info moved up to the first post)
Directions for adding a new chapter to the spreadsheet:
I'm using Notepad++ for windows, which can make the whole process quite simple.
- First you need to actually find the files that you need. So for example, if you want to get the script for chapter 2 of the game, first load up the game and go to chapter 2, and look at some of the text. Then search the files for some of the text that you read in the game. This should show you which file contains the script for that chapter.
- There will be lots of crap in the files at the beginning. Just delete it.
- The files contain line breaks that should be replaced by spaces. So do a search and replace over the files to replace them. (In notepad++, search for \n) In the EN file, you should replace the line breaks with a normal space, but in the JP file use a full-width space. This space keeps words and sentences from running together.
- Next, the scripts contain null characters where the actual line breaks should be. Using notepad++, you can do a search and replace, replacing \0 for \n
- Now, the scripts probably have some junk characters in them which you should manually go through and remove. (these correspond to the green highlighted text in the game)
- Finally you can select the entire text that remains, and paste to a spreadsheet! Japanese in 1 column, English in the other! Make sure it lines up. If not, you might need to do some adjusting.
- Each chapter of the game might be spread across more than one file, so be sure to get it all. In particular, cutscenes are in separate files.
I was looking at the files in Notepad++ for windows, and it displays the names of the character codes for a lot of the "junk" characters. For instance, null characters appear at line breaks, so it would probably be good to convert those to actual line breaks, which would make the text more readable.
(EDIT: some info moved up to the first post)
Directions for adding a new chapter to the spreadsheet:
I'm using Notepad++ for windows, which can make the whole process quite simple.
- First you need to actually find the files that you need. So for example, if you want to get the script for chapter 2 of the game, first load up the game and go to chapter 2, and look at some of the text. Then search the files for some of the text that you read in the game. This should show you which file contains the script for that chapter.
- There will be lots of crap in the files at the beginning. Just delete it.
- The files contain line breaks that should be replaced by spaces. So do a search and replace over the files to replace them. (In notepad++, search for \n) In the EN file, you should replace the line breaks with a normal space, but in the JP file use a full-width space. This space keeps words and sentences from running together.
- Next, the scripts contain null characters where the actual line breaks should be. Using notepad++, you can do a search and replace, replacing \0 for \n
- Now, the scripts probably have some junk characters in them which you should manually go through and remove. (these correspond to the green highlighted text in the game)
- Finally you can select the entire text that remains, and paste to a spreadsheet! Japanese in 1 column, English in the other! Make sure it lines up. If not, you might need to do some adjusting.
- Each chapter of the game might be spread across more than one file, so be sure to get it all. In particular, cutscenes are in separate files.
Edited: 2011-03-03, 1:56 pm
