1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-02-26 12:03:40 +00:00

3329 Commits

Author SHA1 Message Date
Jordi Llonch
5b118ff4ab added documentation (experimental feature) 2013-03-06 06:36:23 +11:00
Jordi Llonch
d9261f6c54 pluggable sub-commands for scrapy comand-line 2013-03-06 05:25:38 +11:00
Pablo Hoffman
19d0942c74 Merge branch 'shell' of git://github.com/stav/scrapy into stav-shell 2013-03-04 02:14:01 -02:00
Pablo Hoffman
3c8eef99cb docs/contributing: added note explaining what Scrapy contrib is 2013-03-04 01:35:17 -02:00
Pablo Hoffman
7dd360f39f Merge pull request #257 from stav/cleanups
doc: fix typo in spider middleware
2013-03-03 19:17:46 -08:00
Steven Almeroth
81111dd39a fetch command should catch IgnoreRequest exception 2013-03-02 20:13:14 -06:00
Steven Almeroth
f62b6660d4 doc: fix typo in spider middleware 2013-03-02 19:46:31 -06:00
Pablo Hoffman
d5d944fa44 Log overriden Scrapy settings when Scrapy starts.
This is useful for debugging, to quickly find out which settings where
used for a specific spider run. dict settings are omitted for brevity.
DEBUG level is used for consistency with "Enabled
extensions/middlewares" lines (which share a similar purpose).
2013-02-28 11:35:11 -02:00
Pablo Hoffman
8f3a509d44 remove debugging code 2013-02-27 03:52:55 -02:00
Pablo Hoffman
3a6b80259d arg_to_iter: replaced double call to isinstance with a single one. refs #248 2013-02-27 03:51:23 -02:00
Pablo Hoffman
2bbd92742b arg_to_iter: treat items the same way as dicts (ie. non iterables). fixes #248 2013-02-27 02:39:31 -02:00
Pablo Hoffman
7400ceb1ed added 502 to RETRY_HTTP_CODES 2013-02-22 19:12:59 -02:00
Pablo Hoffman
e3f50b97b2 made scrapy/item.py pep8 compliant 2013-02-20 11:38:50 -02:00
Pablo Hoffman
a038f46859 doc: fixed rst title 2013-02-14 11:11:17 -02:00
Pablo Hoffman
22edc44c6c doc: remove links to diveintopython.org, which is no longer available. closes #246 2013-02-14 11:09:40 -02:00
Pablo Hoffman
aeb7fbe221 Merge pull request #240 from llonchj/spider_log
spider.log to pass keyword arguments into twisted log.msg
2013-02-12 18:13:26 -08:00
Pablo Hoffman
07669947c7 Merge pull request #244 from morty/master
scrapy genspider allows you to create a spider module with the same name as the project
2013-02-12 18:11:49 -08:00
Tom Mortimer-Jones
2efd859525 Added check so that genspider cannot create a spider with the same name as the project. 2013-02-12 12:26:38 +00:00
Pablo Hoffman
1ff8b4f831 updated release notes with previous commit 2013-02-12 00:59:25 -02:00
Rolando Espinoza La fuente
5971333a93 added --pdb command option to enable pdb debugger on failure. 2013-02-11 12:34:29 -04:00
Pablo Hoffman
d9043ffa6e reverted previous changed as it broke tests 2013-02-11 11:43:25 -02:00
Pablo Hoffman
2df010a972 Merge pull request #241 from zuhao/master
Fix url_has_any_extension bug
2013-02-11 05:29:14 -08:00
Zuhao Wan
27ca25472f Fix url_has_any_extension bug 2013-02-11 17:19:31 +08:00
Jordi Llonch
6d4b764f29 spider.log to pass keyword arguments into twisted log.msg 2013-02-11 05:00:06 +11:00
Daniel Graña
c0a7040f16 add PHONYs so build does not match build/ path 2013-02-08 15:07:00 -02:00
Daniel Graña
910effd145 get scrapy version from package data 2013-02-06 11:44:26 -02:00
Daniel Graña
79cb0318d0 update travis-ci conf 2013-02-06 10:36:08 -02:00
Daniel Graña
5db45b3825 remove scrapyd, it was migrated to its own repository 2013-02-06 05:24:07 +00:00
Daniel Graña
33ca295129 Merge pull request #235 from whodatninja/patch-1
Fix typo labeling attrs type bool instead of list
2013-02-05 17:27:10 -08:00
whodatninja
8e3b5baac5 Fix typo labeling attrs type bool instead of list 2013-02-05 15:10:41 -05:00
Daniel Graña
f293d08e7a Merge pull request #234 from darkrho/httpcache-dbm-module-fullpath
allow to use full path in HTTPCACHE_DBM_MODULE setting
2013-02-01 09:01:27 -08:00
Rolando Espinoza La fuente
d00e6cf1d7 tests: dummydbm.error should be KeyError. 2013-02-01 12:52:59 -04:00
Rolando Espinoza La fuente
6ed44a3a67 httpcache: added tests for custom dbm module.
And removed the break in the line to improve readability as there
is no strict 80-chars line width convention.
2013-02-01 12:41:52 -04:00
Rolando Espinoza La fuente
a2602df90b httpcache: allow to import submodules within packages as HTTPCACHE_DBM_MODULE setting. 2013-01-31 23:49:36 -04:00
Daniel Graña
e5edb8ec3e Merge branch 'dangra/issue-24' 2013-01-30 16:55:53 -02:00
Daniel Graña
3af240f5f9 Merge branch 'dangra/issue-12' 2013-01-30 16:55:16 -02:00
Daniel Graña
0d3e4b4c43 do not unquote slash and question mark in url paths. fix #24 2013-01-30 11:22:03 -02:00
Daniel Graña
872a22df68 pep8ize sgml link extractors 2013-01-29 18:08:32 -02:00
Daniel Graña
cc69b3aa4c Fix #199 encoding error concatenating link text 2013-01-29 17:56:34 -02:00
Daniel Graña
95fde0a498 register namespaces when XMLFeedSpider uses iternodes mode. fixes #12 2013-01-29 12:00:13 -02:00
Daniel Graña
8e77f27897 Find form nodes in invalid html5 documents
lxml fails to parse invalid html5 documents
This error was reported in scrapy/loginform#3
2013-01-24 17:47:53 -02:00
Daniel Graña
ff04480675 fix exporting nested items as xml. fixes #66 2013-01-24 14:57:53 -02:00
Pablo Hoffman
71d7df9273 simplify InitSpider implementation, fixing one bug (closes #228) and adding support generators return from start_requests() (which the previous version didn't) 2013-01-23 16:28:41 -02:00
Daniel Graña
3cf7f4975b Add 0.16.4 to release notes
Conflicts:
	docs/news.rst
2013-01-23 11:29:38 -02:00
Daniel Graña
c40f947dc1 Merge pull request #229 from christilden/master
fixes spelling errors in documentation
2013-01-22 20:27:40 -08:00
Chris Tilden
aae6aed4fb fixes spelling errors in documentation 2013-01-22 14:52:18 -08:00
Pablo Hoffman
27583922a7 remove unused imports 2013-01-21 14:01:12 -02:00
Pablo Hoffman
65258e3621 Merge pull request #227 from tonal/correct-old-cache
Correct init bag for load FilesystemCacheStorage from old location
2013-01-21 04:57:19 -08:00
Alexandr N Zamaraev (aka tonal)
2c51266a40 Correct init bag for load old scrapy.contrib.httpcache.FilesystemCacheStorage 2013-01-21 13:54:36 +07:00
Pablo Hoffman
6ab8afb992 improve documentation about removing namespaces 2013-01-18 12:35:30 -02:00