1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-02-28 08:24:02 +00:00

3649 Commits

Author SHA1 Message Date
Pablo Hoffman
4ac0eea55d Merge pull request #316 from nramirezuy/closespider/errorcount
standarizes close_spider to use signals
2013-05-31 12:20:58 -07:00
nramirezuy
b61a4e719c standarizes close_spider to use signals 2013-05-31 15:54:17 -03:00
Daniel Graña
b4fca90bba merge 0.16.5 release notes 2013-05-30 18:49:00 -03:00
Daniel Graña
58dc16f48d obey request method when scrapy deploy is redirected to a new endpoint 2013-05-30 18:34:59 -03:00
Pablo Hoffman
0effe139e5 Merge pull request #311 from nramirezuy/logstats-6017
removed multispider support from logstats
2013-05-30 08:47:33 -07:00
nramirezuy
14e382ae48 test_closespider fixed for python 2.6 2013-05-30 10:57:06 -03:00
Pablo Hoffman
461eace4af Merge pull request #313 from nramirezuy/closespider-6017
drop multi-spider support from CloseSpider extension
2013-05-30 06:34:52 -07:00
nramirezuy
971ef5c78e removed multispider support, test added 2013-05-28 17:39:37 -03:00
Pablo Hoffman
2bb2ba66f6 simplify scrapy.tests.mockserver 2013-05-28 15:12:38 -03:00
Pablo Hoffman
f35266e6a5 use os.pathsep 2013-05-28 15:10:05 -03:00
nramirezuy
96b8fb69be removed multispider support from logstats 2013-05-28 14:45:02 -03:00
Pablo Hoffman
98aa3efea7 Merge pull request #312 from nramirezuy/cmd/bech/test
added test for bench command
2013-05-28 08:46:31 -07:00
nramirezuy
1ed5643eb2 added test for bench command 2013-05-28 12:42:50 -03:00
Pablo Hoffman
a4b5bfbb5e Merge pull request #309 from amferraz/patch-1
Add FAQ entry referencing Request.meta usage
2013-05-27 09:07:48 -07:00
cacovsky
8007762890 Add FAQ entry referencing Request.meta usage 2013-05-27 13:02:17 -03:00
Nicolás Alejandro Ramírez Quiros
7874eef7af Merge pull request #307 from DeaconDesperado/raise-format
Raise a usage error when an invalid or unrecognized output format is entered on Command line
2013-05-23 12:50:17 -07:00
Mark Grey
485a954571 raise on unrecognized format 2013-05-23 15:11:45 -04:00
Pablo Hoffman
845c64b89d add benchmarking to 0.18 release notes 2013-05-17 10:38:42 -03:00
Pablo Hoffman
ca12886acb update copyright notes 2013-05-16 15:05:52 -03:00
Pablo Hoffman
8e49fed918 minor improvements to benchmarking doc 2013-05-16 13:23:13 -03:00
Pablo Hoffman
76087e336a add scrapy bench command for benchmarking, with documentation 2013-05-16 13:15:25 -03:00
Pablo Hoffman
5c40741d65 added context manager for mock server, moved test spiders into a separate module (scrapy.tests.spiders) 2013-05-16 13:01:02 -03:00
Pablo Hoffman
214bcdf3be Merge pull request #301 from tpeng/fix_telnet_doc
change manager in telnet variable to crawler
2013-05-08 07:00:52 -07:00
tpeng
edbe525d19 change manager in telnet variable to crawler 2013-05-08 11:58:15 +08:00
Pablo Hoffman
99f8d5733a added mock tests for retries (more to come) 2013-05-06 20:04:15 -03:00
Pablo Hoffman
a7b7c89216 improved logging of downloader errors, when they contain tracebacks 2013-05-06 20:03:54 -03:00
Pablo Hoffman
db195ee4df added more methods to mockserver 2013-05-06 20:03:17 -03:00
Pablo Hoffman
c2561ca160 use get_testenv() shortcut instead of get_pythonpath() 2013-05-06 17:35:09 -03:00
Pablo Hoffman
14e7121674 make sure the mockserver forked from tests uses the current installation of scrapy, not the system one 2013-05-06 17:27:59 -03:00
Pablo Hoffman
cf4d4bc094 added mock server test for DOWNLOAD_TIMEOUT 2013-05-06 14:47:24 -03:00
Daniel Graña
495acba223 agents requires an instance of contextFactory 2013-05-06 12:56:13 -03:00
Daniel Graña
3e62ce35a0 cleanup http connection pool on engine stop 2013-05-06 12:56:13 -03:00
Daniel Graña
334ad71a2e adapt for singletons removals 2013-05-06 12:56:13 -03:00
Daniel Graña
ba6545555f empty bodies does not require a body producer 2013-05-06 12:56:13 -03:00
Daniel Graña
db232da068 close pool connections before finishing tests 2013-05-06 12:56:13 -03:00
Daniel Graña
b34606030d remove duplicate context factory handling for non-ssl support in http1.0 2013-05-06 12:56:13 -03:00
Daniel Graña
0a26170086 move ssl context factory to its own module and implement a non-ssl version that warns about pyopenssl support 2013-05-06 12:52:58 -03:00
Daniel Graña
ab3407289c enable persistent connections 2013-05-06 12:50:14 -03:00
Daniel Graña
e4fe7c63b0 add http connection pool and custom ssl context factory 2013-05-06 12:50:13 -03:00
Daniel Graña
a7a354f982 http11 cleanup 2013-05-06 12:45:27 -03:00
paul
ef03603869 Restore handling of HTTPS 2013-05-06 12:45:27 -03:00
paul
46341d5275 Renamed downloader to Http11DownloadHandler and some refactoring
Only for HTTP, not HTTPS
Test on expected body length instead of request method (HEAD case)
2013-05-06 12:45:27 -03:00
paul
4018d25a9b Use twisted.web.client.Agent for download requests (use of HTTP/1.1)
Adds http11.HttpDownloadHandler in scrapy.core.downloader.handlers
2013-05-06 12:45:27 -03:00
Pablo Hoffman
66311db23e mention crawlera in best practices, as a way to deal with bans 2013-05-04 18:20:23 -03:00
Pablo Hoffman
e1f4144391 removed obsolete (and broken) downloader method: is_idle() 2013-04-29 13:04:40 -03:00
Daniel Graña
2a1a4477d3 Merge pull request #297 from scrapy/downloader-gc
Add garbage collector to downloader
2013-04-29 07:38:27 -07:00
Pablo Hoffman
af5c13fa14 Add garbage collector to downloader
This fixes a couple of issues:
- reactor callLater leaks when using download delay (test was
  re-enabled)
- downloader slot leaking on broad crawls (slots were created but never
  removed)
2013-04-29 10:16:30 -03:00
Pablo Hoffman
36ee36000e bind mockserver to 0.0.0.0, to listen on all 127.* range (useful for testing broad crawls) 2013-04-27 04:17:19 -03:00
Pablo Hoffman
9361c89573 remove scrapyd doc, as it was moved to its own repo 2013-04-27 04:15:42 -03:00
Daniel Graña
5ba2b60a4b fix broken doctests 2013-04-25 11:56:56 -03:00