Does anyone know exactly what this means? I'm trying to slurp playstation.jp, and they've got their index based off of an ActiveX object, so Perl (and every site backup tool I've tried) can't use it. *However*, each items page is straight HTML, and the item number is in the URL. Example: http://www.jp.playstation.com/software/title/slps20006.html The problem is, I would need a scheme to come up with all the item numbers. They don't appear to be totally consecutive. Even I'm too nice to hammer their site looking for every single s + 3 letter combination + 5 number combination. So, is there any easy way to suppose each item number, or is it purely hit or miss?
I just had too much time, so here's what I could find out. Basics: The way the code has to look like is determined in general by the Recording Industry Association of Japan (read here: http://ja.wikipedia.org/wiki/規格品番), or Sony just sticks to the regulations. The first two letters are the company code, followed by the type of recording (P for Playstation software) and an internal (i.e. Sony's) category code. And now, the results of my brute force attack (last game in that category stated for reference): SLPS20xxx: Third Party (general) SLPS20504: 大都技研公式パチスロシュミレーター 新・吉宗 SLPS25xxx: From Software, Tecmo, D3 (large publishers) SLPS25869: 必勝パチンコ★パチスロ攻略シリーズ Vol.12 CR新世紀エヴァンゲリオン~使徒、再び~ SLPM62xxx: Konami, Koei, Capcom, Taito, Square Enix, Sega, etc. SLPM62779: ぷよぷよ! スペシャルプライス SLPM65xxx: Konami, Square Enix, etc. SLPM65999: ドラッグ オン ドラグーン2 continued at SLPM66001 for titles from June 2005 SLPM66999: ふしぎ遊戯 朱雀異聞 SLPM67xxx: ??? SLPM67015: School Days L×H SCPS11xxx: published by Sony (SCEI) SCPS11034: ガチャろく2~今度は世界一周よ!!~ SCPS15xxx: published by Sony (SCEI) SCPS15120: ラチェット&クランク5 激突!ドデカ銀河のミリミリ軍団 The categories are not consistent though and I also have no clue about the difference between the two Sony categories. It won't help to find a specific game but well... at least I learned something new. Oh and, as you noted, the numbers don't just go from 0 to 999. There are quite a lot of gaps.
Indeed. Bummer, but thanks for the attempt. Someone in a Perl forum caught my vast oversight, and now I can successfully slurp the site (it wasn't Active X, just tricky HTML).
GaijinPunch, here's what you're looking for: http://mirage.just-size.net/slps/slps.html and http://www.sonyindex.com/ could be helpful too.
Also on the same domain, here's a list of japanese demo/preview discs: http://mirage.just-size.net/slps/taiken.html