Posts: 238
Joined: Nov 2008
Reputation:
0
This might sound like a very uneducated question, but how and with what markup language should one write a website so that it's;
a) easy to scrape, well organized..
b) one is still able to style it, whether it's with css or xls or...
I don't need/except some to give me a complete tutorial, how to make, but more some pointers as to which markup I should look into.
I've tried looking at thetvdb.com and themoviedb.org, but I epically :p fail to understand with what it's been written.
Fiinix Design presents:
Posters, for Movies, TV Shows, Games, Arcade etc.
» Latest Poster-Pack: The Silhouettes, TV Shows
» Upcoming Poster-Pack: To Be Announced
» Game & Emulator Poster, request here
» Movie/TV Genre Poster, here
Posts: 12,706
Joined: Nov 2003
Reputation:
129
spiff
Team-Kodi Member
Posts: 12,706
both of those offer xml based api's.
the only thing needed to make a site scrapeable is a pattern that can be described using regular expressions. repeatability is the key..
Posts: 12,706
Joined: Nov 2003
Reputation:
129
spiff
Team-Kodi Member
Posts: 12,706
my eyes hurt, please drop the upper case. don't see the point of the <header> entries, not that it matters since they can easily be skipped.
the platform tags should be xml'ized, i.e. just have multiple
<platform>
..
</platform>
<platform>
..
</platform>
instead of using img_url then name then img_url then name... much easier to parse and much more xml'ish.
brief overlook only mind you
Posts: 12,706
Joined: Nov 2003
Reputation:
129
spiff
Team-Kodi Member
Posts: 12,706
url format does not matter. you just need to remember, you are storing xml, so you need to escape special chars, in particular;
& -> &
" -> " (prob not relevant in a url)
Posts: 238
Joined: Nov 2008
Reputation:
0
I'll keep that in mind. Yes I'm aware that formatting doesn't matter when storing URL data, but it matters when I want to be able to display it in a browser as well. And that's what I'm having issues with currently, then again, I should probably ask somewhere else for that...
Fiinix Design presents:
Posters, for Movies, TV Shows, Games, Arcade etc.
» Latest Poster-Pack: The Silhouettes, TV Shows
» Upcoming Poster-Pack: To Be Announced
» Game & Emulator Poster, request here
» Movie/TV Genre Poster, here