Chrono Compendium

News and Updates => The Great To-Do List => Topic started by: ZeaLitY on February 01, 2011, 06:08:36 pm

Title: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: ZeaLitY on February 01, 2011, 06:08:36 pm
God, it brings tears of sadness and frustrated pain to my eyes to think about putting them together in a spreadsheet, but yeah, we need to finish the CT DS script. The first step is getting all of it properly ripped.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Mr Bekkler on February 02, 2011, 04:43:12 pm
This makes me wonder what, if any, legitimate differences there are between the French and English versions.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Lady Marle on February 02, 2011, 10:33:32 pm
No idea but the DS is retranslated from the SNES and PS version

I can do the English and probably get it translated but does this require scripting for EVERY character in every scenerio? How about NPCs (i.e Townsfolk. Obviously Schala, Zeal, Ozzie, etc. is apart of the script) ?
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: ZeaLitY on February 02, 2011, 10:56:58 pm
Oh, I mean, this'll really just get accomplished through some kind of DS utility. It probably would have been done by now if the C&D hadn't killed that kind of thing at the Compendium. It'll extract the entire script for us, so the main pain for me will just be putting them all in order in a huge spreadsheet. It's something I'm intimately familiar with, of course.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Lady Marle on February 02, 2011, 10:58:00 pm
Oh, I mean, this'll really just get accomplished through some kind of DS utility. It probably would have been done by now if the C&D hadn't killed that kind of thing at the Compendium. It'll extract the entire script for us, so the main pain for me will just be putting them all in order in a huge spreadsheet. It's something I'm intimately familiar with, of course.
Sadly I don't have one of those :\
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Acacia Sgt on February 03, 2011, 02:07:22 am
Hey guys, I think I've found something for this...

http://whivel.altervista.org/project.php?ref=chronotranslator (http://whivel.altervista.org/project.php?ref=chronotranslator)

This program has a built-in text dumper. With this, we can get all the text.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: ZeaLitY on February 03, 2011, 02:55:54 am
Hey guys, I think I've found something for this...

http://whivel.altervista.org/project.php?ref=chronotranslator (http://whivel.altervista.org/project.php?ref=chronotranslator)

This program has a built-in text dumper. With this, we can get all the text.

HAHAHAHAHAHAHHAHAHAHHAA

FUCKING

BEAUTIFUL

I'M REALLY TIRED AND I SUSPECT VEHEK ALREADY KNEW ABOUT THAT PROBABLY BUT IF THERE ARE NO OBJECTIONS YOU GET A TON OF BOUNTY POINTS


(http://chronofan.com/Zeality/rinzler_by_devilshaven-d35gxez.png)

Just tested and it works, but I don't have a European ROM! Can someone grab that app and test it on the European ROM so we can figure out whether French works?

Also, Japanese predictably comes out like this:

[@ITEM 6 (JP) ]
{36}{15}{C2}{AA}{33}{53}{2E}{18}{30}{C3}{8D}{33}{40}{66}{1E}
{6E}{3F}{59}{36}{15}{C3}{95}{C7}{9A}{32}{C3}{9E}{C6}{80}{C2}{91}{2E}
{70}{C2}{99}{1E}{7D}{38}{48}{24}{69}{30}{C3}{83}{35}{3C}{5A}{30}{1B}\0

Vehek or someone, is there some kind of program that can be written to replace all {#} with the correct Japanese characters? I imagine we might have to do it for some special characters in French, too.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 03, 2011, 09:28:54 am
Also, Japanese predictably comes out like this:

[@ITEM 6 (JP) ]
{36}{15}{C2}{AA}{33}{53}{2E}{18}{30}{C3}{8D}{33}{40}{66}{1E}
{6E}{3F}{59}{36}{15}{C3}{95}{C7}{9A}{32}{C3}{9E}{C6}{80}{C2}{91}{2E}
{70}{C2}{99}{1E}{7D}{38}{48}{24}{69}{30}{C3}{83}{35}{3C}{5A}{30}{1B}\0

Vehek or someone, is there some kind of program that can be written to replace all {#} with the correct Japanese characters? I imagine we might have to do it for some special characters in French, too.

The difficult part is likely to be figuring out which number corresponds to what Japanese character so that we can construct a mapping table (French will be easier in that regard, since only the characters with accents, dieresises, or cedilles should be messed up there--that table I should be able to construct if I have a large enough sample of the French text). Given the mapping table, it should be possible to tinker together something that takes {hex} like the above and spits out, say, UTF-8, in no more than a few hours.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 03, 2011, 10:28:05 am
http://www.romhacking.net/forum/index.php/topic,8029.msg125500.html#msg125500

Maybe someone can make a character table? Sorry my proxy failed to connect to photobucket (or the links are just dead).
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 03, 2011, 11:11:26 am
If that is the mapping for the dialogue font, things should be fairly straightforward (it's only kana, romanji, and some symbols, which will yield a table not much longer than the one for the Cross relative alphabet). Unfortunately, my guess is that the actual Japanese dialogue font is that butt-ugly monster with all the kanji on the next page of that thread, which means that those kanji are going to have to be examined to see if they're in any kind of standard order. If they aren't, each is going to have to be ID'd individually.

It's possible that, in order to obtain the entire script, we are going to have to map both character sets.

I'll try to compare the stuff in that thread to SHIFT-JIS and UTF-8 character orders later today (assuming that I can find that information--I know I used to have it for SHIFT-JIS) and see if there are any commonalities.

Update: I am now 98% certain that the order of the kanji in that one font is completely random, just like the rest of the characters in it (no major encoding that I can find places 人 王 山 一 near the front in that order, with or without other stuff in between). Gack. I hope that isn't the one we need.

Another update: I should have read the thread utunnels linked more carefully, as there is a conversion table already posted there that appears to cover the dog's-breakfast-kanji font. If it's accurate, it will make things a lot easier.

(Is it safe to be discussing this here? It isn't quite hacking, but we're dancing kind of close to the edge...)
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 03, 2011, 11:28:58 am
Whatever, we can always try another place.
 :evil:
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: ZeaLitY on February 03, 2011, 03:46:28 pm
Haha, scripts are too important for the cause, as they say. Full steam ahead.  :kamina
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 03, 2011, 04:16:52 pm
Okay, then. Further information: The kanji table appears to have been created by our very own Vehek, and is, alas, only partial, covering maybe 50% of the glyphs in the font. That means that Vehek may be able to at least confirm whether or not this is the dialogue font. We'll have to wait for him to post.

If we do need this font and Vehek hasn't done any more work on it, I'll post the partial table and a marked-up image of the font, and see if we can crowdsource the remaining glyph IDs--unless someone else has a better idea.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Vehek on February 03, 2011, 07:47:12 pm
I'M REALLY TIRED AND I SUSPECT VEHEK ALREADY KNEW ABOUT THAT PROBABLY BUT IF THERE ARE NO OBJECTIONS YOU GET A TON OF BOUNTY POINTS
I didn't know. :oops: In fact, I was trying to see if I could make something that would take a list and use it to arrange dialog.

That means that Vehek may be able to at least confirm whether or not this is the dialogue font. We'll have to wait for him to post.
The big sheet is the font in the Japanese version. I don't really know Japanese, but really, they wouldn't have gotten rid of all the kanji in the Japanese release.

Where do you want me to put what I have?
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 03, 2011, 08:11:48 pm
If you've done any work on the table since that RHDN post of March 6, 2009, stuff it in an attachment or something and I'll see about working my way forward from there. Given the complete table, constructing a Perl {hex} to kanji mapper should be easy--it's the kind of operation Perl is really good at.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 03, 2011, 10:19:44 pm
Actually the font files are /msg/big/msgcmn.fnt and /msg/small/msgcmn.fnt, if you have a tool like Tahaxan that can explore NDS files.
 :wink:
Well, just in case you didn't know.

http://www.romhacking.net/docs/435/
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Vehek on February 04, 2011, 06:43:34 pm
Here's the table file. Because it still has the English characters and other things, if you try to use it as is with that translation tool, the tool will fail to show the English script due to it rejecting duplicate table entries.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 04, 2011, 08:36:24 pm
Well, I've gotten this far with ZeaLitY's original sample text--

[@ITEM 6 (JP) ]
{36}{15}{C2}{AA}{33}{53}{2E}{18}{30}{C3}{8D}{33}{40}{66}{1E}
{6E}{3F}{59}{36}{15}{C3}{95}{C7}{9A}{32}{C3}{9E}{C6}{80}{C2}{91}{2E}
{70}{C2}{99}{1E}{7D}{38}{48}{24}{69}{30}{C3}{83}{35}{3C}{5A}{30}{1B}\0

maps to

[@ITEM 6 (JP) ]
この??ンイにいる??ンスター
つまりこの????は??????に
ダ??ージをあたえる??ができる。\0

I just need to fix the handling of two-byte characters, and then we'll need to complete that table.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 04, 2011, 08:55:51 pm
Okay, not my biggest PEBCAK of all time, but it may be able to compete for runner-up. 2-byte glyphs now work, so we have:

[@ITEM 6 (JP) ]
このハンイにいるモンスター
つまりこの場合は2ヒキに
ダメージをあたえる?ができる。\0

Or something like that (what I can make out looks sensical enough, although I can only read the hiragana). The question mark is a glyph that's missing from the table, C383.

I'll try to mark up the image of the font to indicate the unID'd glyphs in the morning. And I'll attach my excuse for a program then, for what it's worth--because of a workaround I had to do, chances are that it'll only work on Linux right now.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 05, 2011, 01:41:12 am
It seems this picture matches Vehek's table, somehow.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 05, 2011, 03:02:32 pm
Yes, that would be the one from RHDN. Here's the annotated version:

(http://img225.imageshack.us/img225/460/annotatedfont.png)

Green glyphs are the ones in Vehek's current table (he's gotten about a third of them). White glyphs are the ones we need to add to the table before we can sort out the Japanese-language script. Yellow glyphs are the ones I'm reserving to work on myself (mostly accented Roman characters, plus some symbols and a few kanji). The blue letters and numbers along the edges are indices--write as the row first, then the column. Frex, the index of the lower-case d in the upper right corner is 0F.

What we need to do now is match up the indices with the glyph characters. You do not need to know any Japanese to help with this--a willingness to match leetle tiny pictures with each other is sufficient. http://www.rikai.com/library/kanjitables/kanji_codes.unicode.shtml (http://www.rikai.com/library/kanjitables/kanji_codes.unicode.shtml) has a set of kanji characters if you're not able to type them (you may need the Shift-JIS table linked at the top of that page for a few of the odder things in the font, like the circled numbers). It may be easier to see details if you magnify the image to 200% of its original size (or not). There are still a lot of easily-identifiable glyphs (symbols, half-width kana, and simple kanji) that are not in the table.

Table entries should look like:
08=s
4C=す
C2A8=来

I can't do this alone, people--Vehek's been working on it for two years and has only gotten ~500 of ~1500 glyphs. Please pitch in even if you can only match one or two. Post your entries here so that I can add them to the table.

(ZeaLitY? Your thoughts on offering a small bounty for individual glyphs?)



Anyway, as I see it, there are four steps that need to be accomplished in order to finalize the non-English versions of the CT DS script (feel free to contradict me, everyone):
1. ID the remaining characters in the font
2. Simultaneously, someone uses the utility Acacia Sgt found to extract the script
3. I take the output of 1 and 2 and stuff it through my silly little program to get the Japanese characters
4. ZeaLitY then takes the output of 3 and does the spreadsheet thing

In case anyone's genuinely curious, I've attached the program (rather a grandiose name for <50 lines of code, half of which is boilerplate) that I used to convert the sample text. Requires a Perl interpreter. Also requires Vehek's table file in the same directory. May require that line 8 ("chop($_);") be deleted in order to function under Windows. Usage: perl mapds.pl [file] (with the output being sent to [file].jptxt).
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: ZeaLitY on February 05, 2011, 04:24:15 pm
Something like 10 per glyph?

Hah, this is exciting. And the spreadsheet columns can always be copied and pasted into individual script text files if necessary.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Kodokami on February 05, 2011, 06:10:46 pm
Oh man, oh man. That is difficult to read. No chance of higher resolution, I'm guessing?
-----------
C3AA=山
C482=上
C48E=反
C4B6=臣
C6A6=工
CD84=穴
D082=矢
-----------
C883=失
C8A2=止
CA86=平


I'll add more as I go. (bold=new)
10 down, around 1000 more to go! 8)
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: ZeaLitY on February 05, 2011, 07:02:37 pm
The Compendium forums naturally resize the image to be a little smaller. Try opening it directly in a separate window; makes it marginally easier. (You probably already tried this, though...)
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 05, 2011, 07:25:09 pm
@Kodokami: Thanks for those. Unfortunately, I didn't create the original font image (and my guess is that the characters were never larger than the 8pt they seem to be at now). I can resize it, but the image would be of the same quality as if you just downloaded it, slapped it in Photoshop/the GIMP/whatever, and set the zoom to 200%, so I don't think it's worth it.

@ZeaLitY: Yeah, 5-10 is about what I was thinking. I take it you're okay with proceeding this way, then.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: ZeaLitY on February 05, 2011, 09:52:34 pm
Yeah. We'll inch our way towards glory. This damn script will yield its secrets! And we'll have the Dream Devourer scene to retranslate, per Arc Impulse's discovery of subtler nuances left out of the Jeremy Harmer translation. (Well, nothing earth-shattering; just subtleties...)
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 05, 2011, 10:36:17 pm
Should I type those kanjis? I think I'm at least not bad at reading them.
 :evil:
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Kodokami on February 06, 2011, 01:10:13 am
By the way, if it helps anyone who's working on this, I've been using this website (http://kanji.sljfaq.org/mr.html) to find the right kanji. It's still a pain, but significantly easier when the search is narrowed down.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 06, 2011, 01:27:23 am
These are what I've done so far (still working on the rest).

{ } means it is an empty slot in the picture.
red means previous character is different from Vehek's, or I'm not quite sure.
?? means the character is not quite readable to me.


{ }e{ }toanrsi{ }hl{ }ud
m.cygのwfい…p.b:ー「
なk!!たv,、て'かっIしにS
るTはンうがこんをMれらで?とま
スAもC-Dだ?あPWくすッBお
ルHトイ{ }ラLよOりきさGRアY
EそF{ }わけタN・え0どシドつリ
ダマク1ちみやゃじねガzジデ2
Kレ人プサムロカテせ王3ネミろィ
めキナオ魔様コバ大メば4達ウ1フ
エょボ時j0:力来者ハ前行xVグ
私ノ出ョ何ォャュq6撃U中戦ぞ手
城入5事(ベ『ヴ)チ』体パモ技ビ
間ワ気9ブ場ずツ秘長~村見物2セ
代屋8ゴPへ黒ほ世Q山部/火竜げ
一ゲぶ7宝生回ごケJ—殿ゆ所用ニ
べ今上ソヤ持下女言3思夢ポ法反石
最"地ぬギ強元死)復_ェ(ぜ子ズ
天びH全年勇むひザ方敵ペ水Z家ァ
道当使底命光臣箱ぎ原防ヨ森ふ俺冥
色攻恐ピヘ修剣ホ本現4国空海理広
金变後通{Flame}士作未ざ次+待町+ぐ始
話不M号戻伝.動無老跡連ば5食知
{Dark}性分度開先聖Xぼ自虹6器層闘」
ヒ誰9化日調*心吸古窟外R域御取
里単ゅヌ北民封刑鳥{Heaven}落橋定開能発
墟発名守星倒工目{Droplet}逃帰果装備助成
少賢炎小画込巨G近早太{Heart}軍味説残
洞明以專{Star}電切界妃々印D高賴騒祭
終属務放仕議ゼヲ{note}路合眠宿探暗木
A口黄好直ゥ收立期遠刀負面解携掛
院団返兵教消勝悪除初置必完換砂感
奥頭身失突段乱%新交陽{small note}具闇料階
昔特呼貝願秘決仲会向記男殺Sづ品
父西止飲予宮壊設引友兄続信斬&判
楽同輝風姿要源腕素録我巣売別安違
常育二漠E意進受千裁起武東破内V
半配旅他裏番7急計F{ }Y実扉母主
飛族利混弱白超赤対相機親形線足選
確認転岩挑深丈O望究活了供鉄·X
TB追館夫礼平在可救永登異乗険考
万師氷危夜船L押付音表覚血酋効嘆
墓庫鐮C去運声ゾ態着左員塔鎧射差
毒休研術的和情罪三8閉N過渡然酒
右緑存影製証并走数買ぽ愛応更—共
加幼依得雷姉翼葉示役歴重係君??兜
ぴx正報岬邪??文有視卵殻弓怪投操
伏書悲種捕等查材絶角弾円娘憶字陸
踊処野耐監室排銃倍祖x歌I鐘建精
呪ぁ率限念車遺像弟鋼球侵美拳僕彼
多虫流敗珠薬両暴再械敷警幻辺滅輪
姫波速ぷ苦章削{ }南検緒移捜史病??
迷罠亡獄穴管演銀羽住紋郎値約震希
十型令許良島%頂客狩虜花張片戒鏡
質送→傷非泣困結ユ肉参極降爆寄却
隠草熱似牙冒干亭牢>廷èé銅[]
奇殴湿拔減殊授樹想読位冷造忘格脱
折央K惑月怒触真断謝霊治構契功獣
{ }{upper arrow}商択注故災害寝堂縱観帝í鬼服
巻衣怨打玉談細??購軽孫編損P{right arrow}験
旋店歩ぇ暦障宙集爪盗将継側景改浮
遊導築喰舞剛丸粉烈語価告貴届避優
護至因ぺ適経誇U華試襲霊宇顔習都
禁求①②保床滝問題ú越鳴魅財布冠
英枚矢針雪ぉ{??}組測識退增脳若旦怖
Z徵与硬志系ヅ“”翻訳WLVHM
{ }{ }謎束散惠関燕壁棲焼糧八VS到
威容叉朱雀豆漆袋激柄雄衛盾槍刈吹
息瞬栄頑驚末酬準ぃ皆每密裂ぅ件量
業骨図アクセサリー堅澄刃抱第ぢ<
替Àà各整市制際蹴{ }{ }髪遥短例雰
囲竸慣??句眼難還誓省崩兼灯拝③控
腹崎五討溪谷昇&嵐笑刻紅砲胴屑鉢
嶺隊魂煙脚踏伸棒誘咆哮静徐否催疾
閃召緊払奴歯貸恩狙珍悟労久泊疲植
補麗産析遭遇条義扱溜醒砕府描飾点
咲鋭根{ }寂座惜縛#織◯ゎゐゑヂヮ
ヰヱヵヶ{ascii characters...}
{ascii characters...}
{ascii characters...}
噛河柱{ }{ }{ascii characters...}{(R)}{TM}
{(C)}~朝憧憬祈沈默荒愉快燃錯律廊祝
晴門基順接項況杯嬌派標頃拾類資賞
既式幅紫低響仮預宴序盤撒伐由??微
典跳潰泡即睡臭衝痛恨鼻吐陏申済仏
騷充託暮畑支援勢鎖嫌才陣獲指揚慢
欲陥周承歳歓迎茂採苗鍛逸洗練凄興
逆借奪僞映聴詳縮霸摔彗貫奏粒畜積
歪鉤温般磨司清掃織匹??包妖霧憂純
柔??七浴靭養抽狂筋甘貯藏評級鉱群
囚淵ⅠⅡ環境鑑妙虚憐並孤涯背飼<
独JQabcdefghijklm
npqrtuwxyz一台尻尾奈塊
$;=@[\]^`{|}~
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Vehek on February 06, 2011, 01:37:41 am
While my table was initially based off comparing my dumps to the SNES script and filling it in from that, I never got around to inputting all the kanji from the SNES version.

For instance, that C383 in alfador's test of his program was 事 in the SNES version.
And CC8D is 鐘.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 06, 2011, 07:34:08 am
OK, got most of them done.
Some characters are really hard to read without context, hmm.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 06, 2011, 10:45:40 am
Thanks, utunnels. Looks like I have my work cut out for me, distilling all that down into table entries. If no one gets the remaining kanji in the next couple of days, I'll try to enlarge them--no promises of getting anything useful, though.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 06, 2011, 11:07:08 am
Hmm, it seems there are still several duplicates -- which means one of them could be wrong.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Kodokami on February 06, 2011, 12:19:58 pm
Dang, utunnels, you did that fast. :shock:
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 06, 2011, 01:46:07 pm
@utunnels: I think there are 7 duplications in your table:  一 発 秘 織 衛 開 霊 (mind you, uniq also claims that you have 265 copies of the の kana, so take that with a grain of salt). More later.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 06, 2011, 08:48:38 pm
Yeah, 一 is the most difficult one you know.
-—一ー all look similar to each other, except the length.

The first 発 was wrong, but I haven't recalled what it should be so far. LOL
The first 秘 shoud be 神, LOL, I must be thinking about 樹海の神秘, the name of a CT sound track: http://www.chronocompendium.com/Term/Secret_of_the_Forest.html
The first 織 could be 績, still not sure about both of them though.
I've already fixed 衛 yesterday, but I think I need to see some context to confirm them.
The first  開 should be 聞.
Actually, I'm only sure about the first 霊, the second looks closest to 雲 if it is not 霊, hmm.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 12, 2011, 11:08:50 am
Progress report: I've managed to mangle utunnels' contribution around into the format I need (actually, that was a few days ago now). I'm currently trying to get a better idea of which glyphs are still genuinely unID'd, since my browser settings destroyed the colour codes in utunnels' text.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 12, 2011, 11:28:36 am
Just search for ?? pair, each stands for an unID'd kanji.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 12, 2011, 11:48:56 am
Except that then I have to calculate the index, which is error-prone if I try to do it by hand, so that I can check it against Vehek's list to make sure it isn't a control code or something freaky. Anyway, it turns out that the remaining unknowns are:
C69F
CB9E
CBA6
CCBF
CEB7
D086
D1B3
D486
D4BE
D68A
D691
(I know I said I would do D486 myself, and I did spend a long time staring at it, but I was not able to match it to a UTF-8 character.) If no one figures them out soon, I'll try to extract that characters from the font image and blow them up a couple of notches.

I still don't know which characters differ between utunnels' and Vehek's versions of the table.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 12, 2011, 11:51:49 am
ばァ
Should be these 2, I think you can use my version safely.  :P
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 12, 2011, 01:44:37 pm
Okay, if it's just two kana, I'll just go ahead and use your version. In the unlikely event that one of them is wrong, it should show up fairly early and obviously once we start squirting the text through.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 19, 2011, 10:43:09 am
Only about a week late on this... Anyway, the remaining missing characters:

(http://img201.imageshack.us/img201/473/missingcube.png)

I scaled this up using cubic interpolation, which seems to have clarified some things (and blurred others, but you can't have everything). For comparison's sake, here's an upscale using no interpolation:

(http://img191.imageshack.us/img191/449/missing.png)

If no one can get these within the next few days, we're probably going to have to fill them in from context.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 19, 2011, 11:10:16 am
OK, CB9E must be 帽(cap, hat), since CB9F is 兜(helmet).
D691 from your picture looks mostly like 蒙, if it is not 豪.
CBA6 from your picture looks like 青, and there is no reason to not have one in such a game.
C69F is 廃, since it and its next one make the word 廃墟(ruin).
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 20, 2011, 12:06:14 pm
Added, thanks. Do you think there's any chance at all of IDing the remainder, or should we proceed to the next stage and hope to fill them in from context?
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 20, 2011, 12:17:24 pm
Well, I think just use {xxxx} for the remainder. The chance for them to appear more than once is very small, so we can easily find and replace them later.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 22, 2011, 11:04:00 am
Okay, I'm attaching the table as it currently exists (I even mangled it around to have DOS/Win line endings). Has anyone used that utility AcaciaSgt found to dump the script yet? (I'd try myself, but in the absence of other information I have to assume that it's Windows-only, which means that it would be easier to do for just about anyone else.)
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: ZeaLitY on February 22, 2011, 04:52:22 pm
I assume we don't need to dump the Japanese ROM for whatever reason. The Japanese script is the same on the NTSC ROM, right?
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 22, 2011, 06:12:38 pm
I think it's safe to assume that unless someone has evidence to the contrary (I mean, why would they write a new Japanese script for the English release?) I'll try to have some results from these tomorrow.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on February 23, 2011, 10:27:12 am
Do we have the table file for the small font?
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 23, 2011, 11:00:19 am
No, but I'm not sure it's all that important--spot checks suggest that the actual script was probably encoded entirely from the big font table (I've found random chunks of 1000AD, 600AD, a couple of different endings, and the Lost Sanctum in the dozen or so files I've looked at). We can put a second table together to make sure, I suppose.

Attached are all the files from the BIG subdirectory of the Eng/JP dump with the Japanese (and a few other random characters) reconditioned according to the table. The three leading junk characters in each file are harmless, but not derived from the original text (they're a "this is UTF-8 encoded!" indicator for boneheaded applications that can't figure it out for themselves).

I'll start looking at the Eng/Fr dump now.

UPDATE: It looks like the main part of the PAL dump doesn't need any further processing--all the accented characters are already in place in the French text (so I would guess that the dumping utility was created originally with the PAL version in mind). Most of the content in the SMALL directory is readable here too (the exception is part of system.msg.txt), and none of it appears to be script--mostly battle messages, I think. ZeaLitY, do you think we need to do the small table for the Eng/JP version?
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: ZeaLitY on February 23, 2011, 03:23:00 pm
Prior scripts have had all the messages, I guess. Would it be a lot of work to chase down the small table as well?

Hah...wow, the hour's approaching of having to make the new script spreadsheet.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on February 23, 2011, 03:49:31 pm
I think the relevant font is the second image on the third page of the RHDN thread utunnels linked back in reply #8, the smaller one below the main font. Mostly romaji, kana, and less than a dozen kanji. It shouldn't be difficult to build the table, but I'm not up to doing it myself right now (if someone else does it, I can squirt the files through it without any problems).
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on April 04, 2011, 03:04:20 am
OK, CB9E must be 帽(cap, hat), since CB9F is 兜(helmet).
D691 from your picture looks mostly like 蒙, if it is not 豪.
CBA6 from your picture looks like 青, and there is no reason to not have one in such a game.
C69F is 廃, since it and its next one make the word 廃墟(ruin).

Hmm, after checking alfador's files, it appears D691 is 豪.
item_mes2.msg.txt
[@ITEM 332 (JP) ]
黄金で作られた豪華な兜\0

D086 should be 轟
[@ITEM 48 (JP) ]
{D086}轟雷を呼ぶ\0

D4BE should be 脅
[@ITEM 24 (JP) ]
時の闇の彼方で待つ
新たなる{D4BE}脅威「夢喰い」を倒す\0

[@ITEM 84 (JP) ]
闘技場係員「挑戦したい
  バトルランクを選んでくれ.
  ランクの後の金{D1B3}は参加{CCBF}用だ.\0

D1B3 (I thought you typed D1B6?) should be 額 and CCBF is 費




Oh snap, it seems some characters became question marks, or it was just me?

For example:
竜の里の?{CEB7}に入るための証
In which ?{CEB7} should be 祭壇.



[@ITEM 190 (JP) ]
すべてを裁く{D68A}判の鐮\0

D68A is 審.

Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on April 04, 2011, 09:23:11 am
Thanks for reminding me that I still have work to do on this. I'll see if I can correct the table and squirt the scripts back through. In the meanwhile, can you ID the kanji in the attached font (there's less than a dozen of them, mostly towards the end)? Given those, I should be able to construct the other table for the combat messages, at which point we should have everything.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on April 04, 2011, 09:33:53 am
Before 3: 王

Before R: 魔

Above J & P: 幼体

Before heart: 手 入

Below arrows: 火水天冥虹原石

Above last line: 決定


BTW: I think you should check those question marks first.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on April 04, 2011, 10:13:57 am
I'm pretty sure I already know where the question marks were coming from: they were being subbed in for characters not present in the table (there are some gaps at the end among the meta-substitutions like {Chrono} and {Tech}). I've fixed it so that we should now get the two-byte code instead.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on April 04, 2011, 10:26:33 am
Quote
[@ITEM 22 (JP) ]
電アイス\0

[@ITEM 23 (EN) ]
電Ice\0

[@ITEM 24 (JP) ]
電ケアル\0

[@ITEM 25 (EN) ]
電Cure\0

[@ITEM 26 (JP) ]
電ヘイスト\0

[@ITEM 27 (EN) ]
電Haste\0

Something must be wrong here.

Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on April 04, 2011, 10:34:42 am
...Damn. Incorrect substitution on C784, maybe. Which file is that from?
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on April 04, 2011, 10:49:48 am
It's tech.msg.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on April 04, 2011, 11:02:31 am
...Mmph. Looks like whatever's going on there is Not My Fault--the script as dumped really does have {C7}{84} prefixing those tech names. It could be a bug in the script dumper, or something loony in the script itself.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: utunnels on April 04, 2011, 12:21:43 pm
[@ITEM 41 (EN) ]
{COUNT} damage!
?'s {D7B6} is slowed!\0


Well I saw a lot of ?'s and {D7B6}, I wonder what are they.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on April 04, 2011, 12:51:17 pm
D7B6 is another entry in the meta range--between D7B7={CronoNick} and D7B5={Tech?}, which I believe I assigned to what appeared to be the tech name in "Learned [tech]" messages--meaning that it's a placeholder for another complete string, possibly a tech/enemy tech/attribute name. The ?s are likely another meta entry, possibly the name of the Nth enemy in a given fight. Either way, the information can't be deciphered based on the font image--we need message samples from the actual game.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on April 10, 2011, 01:26:23 pm
Augh. Everything Always Takes Longer Than You Think It Will, but I think that my part of this can finally be put to bed.

The attachment contains all the corrected script and message files (both the BIG and LITTLE directories), plus both file tables, both font images, and the mapping script as it was when I ran these through. Several corrections have been made to the big font table (tabone.txt.jptxt)--those mystery kanji in tech.msg turned to be due to the star character having somehow been omitted from the table, and there were a couple of similar blunders that are now fixed. There are still a few meta-substitutions and maybe one or two kanji missing, but I think they're few enough now that they can be dealt with by hand--they should be clearly marked with a four-digit code.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: FaustWolf on April 10, 2011, 10:44:42 pm
alfador, looks like the attachment didn't go through.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: alfadorredux on April 11, 2011, 08:20:39 am
Trying again.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Tg on February 04, 2012, 03:00:14 pm
Any news on the script making? :)
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Acacia Sgt on March 13, 2015, 11:20:49 pm
Looking over at the to-do list, I remembered this already had quite the progress behind it. Looking back it seems most of it is still there. I think this could be easily finished if resumed.
Title: Bestiary
Post by: Lavos, a one-eyed tick on December 22, 2019, 10:03:36 pm
Seeing as the Bestiary contains data for every enemy in the game, including the bonus areas, would it be possible for anyone to copy it down here? I'm not sure if it's already been done or not, but being able to see all the data for all the enemies in one spot would be helpful. I've ran through the CT DS port many times and still can't seem to find some of the enemies, even some early-game.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: ZeaLitY on September 01, 2020, 01:21:56 am
Merged into this one.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Acacia Sgt on September 11, 2020, 12:20:39 am
I didn't knew if to make a new thread elsewhere for this or not, but...

On the subject of scripts, I decided to compile the ones from the Steam version. I even made a little excel file in an attempt to make a side by side comparison of a little sample text. Like the ones made for the SNES Retranslation. Some I went ahead and did things like preserve the in-game formatting, and even replace some of the <Name_X> parts for the PC's in a few of the languages.

Other than that, the dumped text as-is is in the other zip file. So, if the DS scripts are still a work in progress, here's the Mobile/PC ones in the meantime. Well, they should be almost exact as the DS ones, at least.

I think I got all the text files pertaining to in-game text. I might recheck it later if I missed anything.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: ZeaLitY on September 11, 2020, 12:30:12 am
Oh nice! I forgot, but was this the first time it had been translated to all the other languages (than Japanese/English/French)?

...Man, my memory is completely gone; the English translation used was also the DS Tom Slattery one, right?
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Acacia Sgt on September 11, 2020, 12:33:57 am
Oh nice! I forgot, but was this the first time it had been translated to all the other languages (than Japanese/English/French)?

...Man, my memory is completely gone; the English translation used was also the DS Tom Slattery one, right?

I think for the German, Spanish, and Italian ones they come from the DS version, since it got released in Europe after all. At least, I'd assume so. The Korean and Chinese ones would be entirely new, I think.

It takes the DS script, yes, though I think some time ago in the Discord someone posted images about finding minute changes. So there are changes, if very few, from what I understand.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: ZeaLitY on September 11, 2020, 12:48:03 am
Haha, just have to ask, is the Spanish version a certain dialect? Al fin hice algun progreso en mis estudios de español, y me pregunto si la traducción tiene unos mexicanismos geniales...I might as well replay the game using that version next.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Acacia Sgt on September 11, 2020, 12:56:53 am
It's from Spain. It says so on Steam.

No Latin American Spanish translation for CT yet I'm afraid.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Vehek on September 11, 2020, 02:46:15 am
As far as I know, the DS version only had Japanese, English, and French. The other languages came with the mobile ports.

I think the mobile port changed some text because the previous versions referred to the controller buttons, while the mobile port apparently popped up colored buttons when necessary. Some or all of it may have been reverted in the Steam version. (After a quick look, the files ending with "pad" use controller buttons.)
Examples:
Quote from: DS
Catchy rhythm, huh?
You can use the X, Y, L, and R Buttons to dance along!
Quote from: Mobile
Catchy rhythm, huh?
You can use the color buttons to dance along!
Quote from: DS
There is a note.

"The Director of Research ought to have the security lock release code written down somewhere in this room.

"He's a strange one, though, always writing in codes and what have you. Knowing him, he probably wrote it down backwards."

There is a note written by the Director of Research.
"Security lock release code: LLAB
I picked it because, well...I like things that are round."
Quote from: Mobile
There is a note.
"The Director of Research ought to have the security lock release code written down somewhere in this room.

"He's a strange one, though, always writing in riddles and what have you. Knowing him, he probably came up with some crazy color code."

There is a note written by the Director of Research.
"Security lock release code: Tomato."
"I picked it because, well...veggies are good for you. A tomato a day keeps the doctor away!"
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Acacia Sgt on September 11, 2020, 03:22:56 am
Those would be expected, but I think there were other changes that didn't had to do with the controllers.

Like, from the Discord chat:

Magus08/27/2018
:<
Frog: Are you Mad!!??!
Magus: I said he was dead. I didn't say it couldn't be undone.
...I think this is its own translation
I don't think this was the DS game's translation

how's this
AcaciaSgt08/27/2018
Hmm, looking up the script, it does seem to be different.
Magus: I said the fool was dead. I did not say his death could not be undone.

He also brought up that Queen Leene no longer mentions the leading party member after finding the Rainbow Shell. A picture was even uploaded, but I can't find it.

Anyway, I have an on-going playthrough of the Steam version in NG+ (I'm getting those ending achievements, as it were), so I can check out both cases if it's truly different from the DS script. Finding the DS script isn't hard when there's lots of video and picture LP's out there.

Meanwhile, it's kinda hard to find playthroughs of the Mobile/PC version. Most I found stopped before reaching those points.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Vehek on September 11, 2020, 03:28:56 am
I checked the files you uploaded. The line's the same as in the DS version; they must have misquoted it. Also, the change to Queen Leene's line was already there in the DS localization (the use of the party member name remained in the Japanese text).
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Acacia Sgt on September 11, 2020, 03:37:49 am
Oh really? Well, that solves that then.

Now that I think about it, I think I was likely misremembering as well. Like, the comparison was from SNES to DS, not DS to Mobile/PC. At least with Queen Leene's line.
Title: Re: [10000] Rip the complete English, Japanese, and French scripts of CT DS
Post by: Vehek on September 11, 2020, 01:28:12 pm
If we're making a unified spreadsheet rather than a separate one, then we need to account for those lines that changed even in the Japanese DS version.
An eternity ago (https://www.chronocompendium.com/Forums/index.php?topic=2761.msg65222#msg65222), I read from the BBS logs on Ms. Kotoi's site (http://radical.or.tv/bbslog/log46.htm) that some lines of dialogue were changed in the PS1 release, the examples given were colors to button names and Taban's line in the ending. Thanks to them leaving the substitute string entries in the US release code, I know from hacking research exactly which ones got changed, though not the Japanese text directly (I checked videos).
The only changes that weren't to button references were censorship of underage drinking. These seem to have carried over into the DS release, though the last one (Taban's) merged back in the "you're the hero" sentence.

The DS version probably has some revisions of its own, but the only one I know for sure is the guy who manages the races at the fair. And that line in the developer's ending now referencing the original game's release year.
Quote
レース中さ。
だまって見てな!

もし よそうをしたいなら
はしっている全員が そこに
そろった時 オレに話しかけな!


There's a race underway.
Be quiet and watch!

If you want to try and guess the next winner,
talk to me once the runners have lined up over
there after the race.