1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-02-25 21:03:40 +00:00

1718 Commits

Author SHA1 Message Date
Ismael Carnales
4b5aa30867 minor update to tutorial 2009-08-24 14:34:17 -03:00
Pablo Hoffman
0363040884 doc: added FAQ entry about Accept-Language 2009-08-24 13:56:44 -03:00
Ismael Carnales
85282a4b76 added scrapy commandline scripts doc 2009-08-24 12:02:44 -03:00
Pablo Hoffman
ead8cb9fac removed documentation about ugly DontCloseDomain exception (which will be removed in the future) 2009-08-24 11:57:21 -03:00
Pablo Hoffman
74f706b356 renamed "parse_item" method of XMLFeedSpider to "parse_node", keeping backwards compatibility 2009-08-24 10:54:34 -03:00
Pablo Hoffman
31693eb90f dropped "cache" attribute of Request and Response objects 2009-08-24 10:34:05 -03:00
Pablo Hoffman
79bf4c817a replaced old memoizemethod decorator with a more efficient one (memoizemethod_noargs) 2009-08-24 10:21:04 -03:00
Pablo Hoffman
4f2925420f minor improvements to Response.__repr__ 2009-08-24 09:54:02 -03:00
Pablo Hoffman
b71de57b21 some simplifications to Request and Response classes 2009-08-24 09:47:26 -03:00
Pablo Hoffman
81832773ee ported get_base_url and get_meta_refresh to use WeakKeyDictionary (instead of Request.cache) 2009-08-24 08:58:51 -03:00
Pablo Hoffman
49e11d34c2 switched request_fingerprint to use WeakKeyDictionary for caching (instead of Request.cache) 2009-08-24 08:45:23 -03:00
Pablo Hoffman
0186c6937a HTTP auth middleware: added doc and unittest 2009-08-24 08:07:20 -03:00
Pablo Hoffman
c7e916407d fixed test name 2009-08-24 07:57:00 -03:00
Pablo Hoffman
2cb0cfe7ea simplified some code 2009-08-24 07:29:30 -03:00
Daniel Grana
73e7788ba2 dont try to guess if spider output is iterable for Items and Requests objects 2009-08-24 01:25:14 -03:00
Daniel Grana
6e2bd49a81 Host header must include port number when port used for connecting is not default protocol port 2009-08-24 01:16:49 -03:00
Pablo Hoffman
46aaea9f32 doc: improved documentation about debugging leaks 2009-08-23 20:36:00 -03:00
Pablo Hoffman
0ed849248f some improvements to item exporters
- passed previous class attributes to instances attributes
- better handling of constructor arguments
- better coverage on unittets (including encoding)
- updated documentation with new changes
2009-08-23 05:48:35 -03:00
Pablo Hoffman
de89909195 send_catch_log: pass through results from sendRobust 2009-08-22 16:38:25 -03:00
Pablo Hoffman
499dc6677c utils.signal: made send_catch_log function more robust (by using pydispatch.robust.sendRobust) and added unittests 2009-08-22 16:22:31 -03:00
Pablo Hoffman
f204da82d1 disconnecting signal handlers after using them in stats unittests 2009-08-22 16:22:02 -03:00
Pablo Hoffman
825cec51ff fixed bug recently introduced in stats collector closing logic, and added unittests 2009-08-21 23:23:46 -03:00
Pablo Hoffman
7461710ef1 added some missing dots 2009-08-21 21:54:10 -03:00
Pablo Hoffman
9635a7839c rearranged documentation into a better organization
--HG--
rename : docs/topics/index.rst => docs/index.rst
2009-08-21 21:49:54 -03:00
Pablo Hoffman
4761e0c88d minor doc correction 2009-08-21 19:11:59 -03:00
Pablo Hoffman
012fd1cc2f moved api-stability.rst doc to root and updated it
--HG--
rename : docs/misc/api-stability.rst => docs/api-stability.rst
2009-08-21 16:29:23 -03:00
Pablo Hoffman
07c1f3b8d8 updated ugly argument name 2009-08-21 16:13:25 -03:00
Ismael Carnales
5050787380 removed spider templates from project, addeded sumcommands to manage templates in genspider 2009-08-21 16:10:45 -03:00
Pablo Hoffman
ef6f04eb06 moved doc about debugging memory leaks to its own topic and added doc about trackref module 2009-08-21 16:07:16 -03:00
Pablo Hoffman
4980eb4968 added titles to signals doc 2009-08-21 15:07:52 -03:00
Pablo Hoffman
daea3f8a0c sphinx docs: replaced custom :exception: xref by standard :exc: 2009-08-21 15:05:06 -03:00
Ismael Carnales
e2b8817c33 updated project templates to new item 2009-08-21 14:21:22 -03:00
Ismael Carnales
c08d3aa9cc updated tutorial to use new items api 2009-08-21 14:16:27 -03:00
Pablo Hoffman
37760eb13f improved consistency of logging settings to use LOG_* 2009-08-21 08:54:12 -03:00
Ismael Carnales
f0689a9ee8 fixed error in link extractors doc, thanks tarasm 2009-08-21 08:34:53 -03:00
Daniel Grana
bdc0ec6863 remove undefined variable from image pipeline 2009-08-20 20:30:08 -03:00
Pablo Hoffman
e7e1cba4a8 updated some docstings 2009-08-20 18:39:04 -03:00
Pablo Hoffman
41f1972e47 removed unused TRACE log level and improved logging documentation 2009-08-20 18:17:48 -03:00
Pablo Hoffman
297a8ac76b moved caching resolver to an extension in contrib.resolver 2009-08-20 17:37:46 -03:00
Pablo Hoffman
a767b7044e removed old blocking caching DNS resolver and replaced by a non-blocking one installed as the default reactor resolver
--HG--
rename : scrapy/core/downloader/dnscache.py => scrapy/core/downloader/resolver.py
2009-08-20 17:11:12 -03:00
Pablo Hoffman
cbed0f1ea6 moved send_catch_log to new scrapy.utils.signal module 2009-08-20 16:02:40 -03:00
Pablo Hoffman
8fad0a5744 fixed bug with defer_fail rename 2009-08-20 15:33:35 -03:00
Pablo Hoffman
eeac040fce minor docstring update 2009-08-20 14:40:37 -03:00
Pablo Hoffman
119bb11090 removed unused chain_deferred function, renamed defer_fail to defer_failed 2009-08-20 14:37:38 -03:00
Pablo Hoffman
b763350340 removed unused module: scrapy.contrib_exp.history 2009-08-20 14:29:39 -03:00
Pablo Hoffman
b65c87cb67 removed unused module: scrapy.utils.c14n 2009-08-20 14:23:49 -03:00
Pablo Hoffman
5de409a6a0 removed unused module: scrapy.tests.serialization 2009-08-20 14:20:45 -03:00
Pablo Hoffman
20e82335e2 rename some exporter methods and complete exporter tests refactoring 2009-08-20 14:09:24 -03:00
Ismael Carnales
6b7162f3ca updated JsonLinesItemExporter to new exporters API 2009-08-20 12:58:10 -03:00
Pablo Hoffman
3152e66fec deprecate domain_open signal and handle stats domain open/close directly from the engine 2009-08-20 10:54:58 -03:00