2010-01-03, 02:56
just noticed today that i couldn't scrap any movies from my ubuntu system: looking at the logs i get:
attempting a wget:
but doing a wget with -U:
also wget works fine on my mac...
is there an option somewhere where we can overload the useragent strings that xbmc/imdb scrapper uses?
Code:
16:51:57 T:140189498325328 M:835964928 DEBUG: FileCurl::Open(0x7fffd6390d08) http://akas.imdb.com/find?s=tt;q=the%20warlords%20(2007)
16:51:57 T:140189498325328 M:835964928 INFO: easy_aquire - Created session to http://akas.imdb.com
16:51:57 T:140189498325328 M:835727360 DEBUG: FillBuffer: curl failed with code 22
attempting a wget:
Code:
$ wget 'http://akas.imdb.com/find?s=tt;q=the%20warlords%20(2007)' --2010-01-02 16:52:32-- http://akas.imdb.com/find?s=tt;q=the%20warlords%20(2007)
Resolving akas.imdb.com... 72.21.206.70
Connecting to akas.imdb.com|72.21.206.70|:80... connected.
HTTP request sent, awaiting response... 403 Forbidden
2010-01-02 16:52:33 ERROR 403: Forbidden.
but doing a wget with -U:
Code:
$ wget -U 'Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.14) Gecko/20080418 Ubuntu/7.10 (gutsy) Firefox/2.0.0.14' 'http://akas.imdb.com/find?s=tt;q=the%20warlords%20(2007)'
--2010-01-02 16:54:33-- http://akas.imdb.com/find?s=tt;q=the%20warlords%20(2007)
Resolving akas.imdb.com... 207.171.166.140
Connecting to akas.imdb.com|207.171.166.140|:80... connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [text/html]
Saving to: `find?s=tt;q=the warlords (2007)'
[ <=> ] 42,356 69.8K/s in 0.6s
2010-01-02 16:54:34 (69.8 KB/s) - `find?s=tt;q=the warlords (2007)' saved [42356]
also wget works fine on my mac...
is there an option somewhere where we can overload the useragent strings that xbmc/imdb scrapper uses?