1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-03-01 14:47:30 +00:00

2037 Commits

Author SHA1 Message Date
Pablo Hoffman
180c091fb2 Fixed encoding issue (reported in #135) when the encoding declared in the HTTP header is unknown. This is the patch proposed by Rolando, with an update to the Request/Response documentation. 2010-02-24 14:01:29 -02:00
Pablo Hoffman
bbef0fe870 Automated merge with http://hg.scrapy.org/users/rolando/scrapy/ 2010-02-20 11:12:37 -02:00
Rolando Espinoza La fuente
7b1ad321e3 examples/experimental: added imdb top movies spider 2010-02-19 21:31:17 -04:00
Pablo Hoffman
cb99edd153 simplified and improved AUTHORS file 2010-02-19 23:16:55 -02:00
Pablo Hoffman
a3d22c7240 Automated merge with http://hg.scrapy.org/scrapy-0.8/ 2010-02-19 23:11:24 -02:00
Pablo Hoffman
60961e5499 minor documentation fix (refs #135) 2010-02-19 23:09:48 -02:00
Pablo Hoffman
c1f8198639 Added RANDOMIZE_DOWNLOAD_DELAY setting 2010-02-19 21:53:18 -02:00
Rolando Espinoza La fuente
4a053a762f examples/experimental: added gooledir crawler 2010-02-19 18:28:16 -04:00
Rolando Espinoza La fuente
a6a3f085a7 docs: added crawlspider v2 outline documentation
Sign-Off: Rolando Espinoza La fuente
2010-02-19 18:22:38 -04:00
Rolando Espinoza La fuente
17d1543929 contrib_exp: added crawlspider v2 package + tests
Sign-Off: Rolando Espinoza La fuente
2010-02-19 18:19:01 -04:00
Rolando Espinoza La fuente
7ddd4441e3 utils.python: added equal_attributes() to compare two objects arbitrary attributes
Sign-Off: Rolando Espinoza La fuente
2010-02-19 17:57:48 -04:00
Rolando Espinoza La fuente
7235040936 merged upstream 2010-02-19 17:41:45 -04:00
Pablo Hoffman
23fcf48a89 Automated merge with http://hg.scrapy.org/scrapy-0.8/ 2010-02-19 16:34:01 -02:00
Pablo Hoffman
53dfc4d3dd fixed bug which was causing the DOWNLOAD_DELAY setting to be ignored (the spider download_delay attribute was working though) 2010-02-19 16:32:30 -02:00
Pablo Hoffman
a67c389728 Automated merge with http://hg.scrapy.org/scrapy-0.8/ 2010-02-19 15:44:23 -02:00
Pablo Hoffman
51faec5dcd fixed bug which was considering DOWNLOAD_DELAY as an int setting, where it should be a float 2010-02-19 15:42:54 -02:00
Daniel Grana
8dc95bf105 Automated merge with ssh://hg.scrapy.org/scrapy-0.8 2010-02-18 16:52:45 -02:00
Daniel Grana
91f4d6dc51 docs: adds another spider example that yields multiples requests/items from a single callback 2010-02-18 16:51:05 -02:00
Pablo Hoffman
d337aeb7e7 Automated merge with http://hg.scrapy.org/scrapy-0.8 2010-01-31 18:11:43 -02:00
Pablo Hoffman
57d60eae39 sort settings doc alphabetically by setting name 2010-01-31 18:11:13 -02:00
Pablo Hoffman
843b371968 Automated merge with http://hg.scrapy.org/scrapy-0.8 2010-01-28 10:56:49 -02:00
Pablo Hoffman
b1c2756778 added spiders to some error logging in Engine and Scraper core components 2010-01-28 10:56:04 -02:00
Pablo Hoffman
407f0671c1 Automated merge with http://hg.scrapy.org/scrapy-0.8 2010-01-18 18:17:08 -02:00
Pablo Hoffman
67858af83c fixed doc typo 2010-01-18 18:16:58 -02:00
Pablo Hoffman
49ac3bd2d2 Automated merge with http://hg.scrapy.org/scrapy-0.8 2010-01-17 05:20:23 -02:00
Pablo Hoffman
496db55518 added some encoding aliases not provided in Python by default - fixes #130 2010-01-17 05:19:51 -02:00
Pablo Hoffman
c0da2edf6f Automated merge with http://hg.scrapy.org/scrapy-0.8 2010-01-13 15:52:06 -02:00
Pablo Hoffman
08eeaf98a2 fixed description of LOG_STDOUT setting 2010-01-13 15:51:08 -02:00
Pablo Hoffman
8e5a3baf1e Automated merge with http://hg.scrapy.org/scrapy-0.8 2010-01-13 12:21:26 -02:00
Pablo Hoffman
48739ae60c install.rst: added explanation about why libxml2 2.6.28 or above is required 2010-01-13 12:20:24 -02:00
Rolando Espinoza La fuente
1402da31c5 docs: fixed typos and updated code examples 2010-01-11 12:28:22 -04:00
Rolando Espinoza La fuente
7bbc14dd15 templates: updated code 2010-01-11 12:26:43 -04:00
Pablo Hoffman
e7ef2eba1e Automated merge with http://hg.scrapy.org/scrapy-0.8 2009-12-13 14:24:10 -02:00
Pablo Hoffman
d60412ce19 titlecased Scrapy easy_install and some fixes to sign_release.sh script 2009-12-13 14:23:31 -02:00
Pablo Hoffman
34971ea6e7 bumped version to 0.9-dev 2009-12-12 18:15:18 -02:00
Pablo Hoffman
8a801e7166 Added tag 0.8 for changeset eef0b17d8752 2009-12-12 18:02:42 -02:00
Pablo Hoffman
980ca75d0d Removed tag 0.8-rc1 0.8 2009-12-12 18:02:25 -02:00
Pablo Hoffman
cd9a2d8edb removed rc1 from version 0.8 (good enough for a stable release) 2009-12-12 17:57:20 -02:00
Pablo Hoffman
422d6facb2 Automated merge with http://hg.scrapy.org/scrapy-stable 2009-12-12 16:52:07 -02:00
Pablo Hoffman
9d50604d24 added |version| to documentation title 2009-12-12 16:51:59 -02:00
Pablo Hoffman
b6493f8bb8 Added tag 0.8-rc1 for changeset 22de0cc4b778 2009-12-12 16:15:49 -02:00
Pablo Hoffman
1fda0e78d5 removed (pretty useless) build_release.sh script - see http://dev.scrapy.org/wiki/ScrapyReleaseProcedure 2009-12-12 15:54:52 -02:00
Pablo Hoffman
024c00a512 title-cased project name in setup.py 2009-12-12 15:48:02 -02:00
Pablo Hoffman
a953efd8e5 Automated merge with http://hg.scrapy.org/scrapy-stable 2009-12-12 15:40:16 -02:00
Pablo Hoffman
f22fe95ae9 changed version to 0.8-rc1 2009-12-12 15:31:03 -02:00
Pablo Hoffman
3d6f598abc added minor delay to httpcache tests, for them to pass on win32 2009-12-12 15:30:21 -02:00
Pablo Hoffman
9c31fc447a Changed character used in unit test because it's not a valid file name char on win32 2009-12-12 11:07:50 -02:00
Pablo Hoffman
3012030b2f Fixed bug in file:// downloader handler with uris containing percent-escaped chars 2009-12-12 10:57:17 -02:00
Pablo Hoffman
cca4be4d64 uncommented some lines in extras/build_release.sh 2009-12-11 23:52:53 -02:00
Ismael Carnales
4ecc909bc1 Fix RobotsTxtMiddleware reference in doc 2009-12-04 15:37:24 -02:00