1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-02-27 13:25:53 +00:00

3406 Commits

Author SHA1 Message Date
Daniel Graña
cdecc760ee default httpcache to rfc2616 policy and improve storage and policy tests 2013-01-04 03:43:25 -02:00
Daniel Graña
0a5586fafd move FilesystemCacheStorage to scrapy.contrib.httpcache 2013-01-03 09:23:18 -02:00
Pablo Hoffman
9f003a73da Merge pull request #217 from Mimino666/item-processing-errormsg
Fixed error message formatting.
2013-01-02 21:04:40 -08:00
Michal Danilak
2cfbd13c17 Added "exclude" parameter testing to unittests. 2013-01-03 02:02:03 +01:00
Michal Danilak
035d1e99a9 Added model validation to DjangoItem. 2013-01-03 01:52:53 +01:00
Michal Danilak
8ea89b277c Fixed error message formatting.
log.err() doesn't support cool formatting and when error occured, the message was:
	"ERROR: Error processing %(item)s"
2013-01-02 22:55:04 +01:00
Pablo Hoffman
ea0967562e Merge pull request #216 from Mimino666/shell-spider-option
Added --spider option to "shell" command.
2013-01-02 09:15:15 -08:00
Daniel Graña
a6ef76ed88 lint and improve images pipeline error logging 2013-01-02 14:14:37 -02:00
Michal Danilak
ea68250d77 Added --spider option to "shell" command. 2013-01-02 17:13:37 +01:00
Pablo Hoffman
e9c5b76242 Merge pull request #215 from kuyan/patch-1
Fixed typo
2013-01-02 06:21:32 -08:00
Natan L
d572f8945e Fixed typo
'persitent' --> 'persistent'
2012-12-31 11:14:01 -08:00
Pablo Hoffman
1aa25cdff2 Merge pull request #214 from tonal/log-level-dropped-item
Make LogFormatter return the log level (and require it)
2012-12-29 19:21:54 -08:00
Alexandr N Zamaraev (aka tonal)
71b071ffeb Log level return from LogFormatter methods 2012-12-29 11:43:41 +07:00
Pedro Faustino
cf5f0203b7 Instead of extending from HttpCachePolicy, following the same approach used for storage selection 2012-12-28 16:11:47 +01:00
Pablo Hoffman
ba257db6e2 Merge pull request #213 from tonal/forget-import
Remove firget imports
2012-12-28 06:42:40 -08:00
Pedro Faustino
492831fc6f Merge branch 'master' of git://github.com/scrapy/scrapy into http-cache-middleware 2012-12-28 15:27:45 +01:00
Pedro Faustino
3e31d06872 Implement single HTTP cache policy 2012-12-28 13:28:35 +01:00
Pedro Faustino
63d0b9f8c8 Remove plural from stat key. 2012-12-28 11:06:04 +01:00
Alexandr N Zamaraev (aka tonal)
3d397de0e9 Possible set log-level in LogFormatter.dropped and LogFormatter.scraped 2012-12-28 13:40:27 +07:00
Alexandr N Zamaraev (aka tonal)
21de68b766 Remove firget imports 2012-12-28 13:01:39 +07:00
Hasnain Lakhani
93a1102189 Implemented policies for HTTP Cache 2012-12-26 16:29:48 -08:00
Pablo Hoffman
51b8feb4ce fixed doc typos 2012-12-26 16:16:53 -02:00
Pablo Hoffman
1e2ee76df2 add documentation topics: Broad Crawls & Common Practies 2012-12-26 14:02:13 -02:00
Pedro Faustino
fdaa35f6e8 Updated the downloader middleware documentation to reflect changes introduced by the support for real HTTP caching. 2012-12-24 19:37:53 +01:00
Pedro Faustino
b2d3f4dd1b Add cache storage support for real HTTP caching. Add real HTTP caching unit tests for both middleware and cache storage. 2012-12-24 16:15:04 +01:00
Pedro Faustino
0e435fb5f9 Add middleware support for the 'no-store' Cache-Control directive. 2012-12-24 13:27:13 +01:00
Pedro Faustino
bb55f39aed Add middleware support for real HTTP caching. Add httpcache setting to allow either real or dummy HTTP caching (for backwards compatibility it's set to use dummy cache by default). 2012-12-24 01:52:45 +01:00
Pedro Faustino
e396509f5b Removing plural from httpcache stats' value names. 2012-12-24 00:24:27 +01:00
Pablo Hoffman
dabb064293 fix bug in scrapy parse command when spider is not specified explicitly. closes #209 2012-12-20 11:59:07 -02:00
Pablo Hoffman
12475fccbe Merge pull request #206 from dangra/downloader-enhancements
AutoThrottle and Downloader enhancements
2012-12-17 10:11:46 -08:00
Daniel Graña
22559833e7 core: drop download inactive slots and get slot key from meta 2012-12-17 16:09:01 -02:00
Daniel Graña
eb2d87259f core: move download slot assignment post middleware evaluation 2012-12-17 16:09:01 -02:00
Daniel Graña
d7daf836d5 Altering delay is enough to auto throttle downloads 2012-12-17 16:08:49 -02:00
Daniel Graña
85c9881724 fix pylint and pep8 warnings 2012-12-14 12:27:53 -02:00
Pablo Hoffman
0bc5e4f9ec Merge pull request #204 from LuanP/patch-1
Update docs/topics/commands.rst
2012-12-10 16:19:25 -08:00
Luan
5582ea28ec Update docs/topics/commands.rst
A short change.
2012-12-10 15:16:02 -02:00
Daniel Graña
0cc138c010 Add 0.16.3 release notes 2012-12-07 18:55:15 -02:00
Daniel Graña
e676ad31d0 Support sending requests trough multiples slots in QPS exmaple spider 2012-12-05 17:06:38 -02:00
Daniel Graña
fd58de493c Remove concurrency limitation when using download delays and still ensure inter-request delays are enforced 2012-12-05 15:08:34 -02:00
Daniel Graña
dcc0a540c0 QPS bench server and a useful spider spider to generate requests 2012-12-05 15:08:34 -02:00
Daniel Graña
a958fb11f0 Merge pull request #202 from saxicek/master
Image pipeline error improvement
2012-12-03 03:06:47 -08:00
Libor Nenadál
5f899aa4ec add error details when image pipeline fails 2012-12-02 23:29:26 +01:00
Pablo Hoffman
9c9a18b3a3 Merge pull request #201 from alexcepoi/test-fixes-mac
improve mac os compatibility
2012-12-02 07:37:53 -08:00
Alex Cepoi
fc405e98aa improve mac os compatibility
Highlights:
* FifoDiskQueue: mixing buffered version of seek with unbuffered version
  of read causes problems
* BSD's find does not default to current directory
* gdbm needs to be closed before it can reopen the same file
* skip PIL tests if jpeg support is not available
2012-12-01 16:39:58 +01:00
Pablo Hoffman
b9a96147ed setup.py: use README.rst to populate long_description 2012-11-25 22:22:33 -02:00
Pablo Hoffman
39274a2457 doc: removed obsolete references to ClientForm 2012-11-23 19:06:47 -02:00
Pablo Hoffman
8ca2ee3d60 Merge pull request #196 from stav/master
The default storage backend is now DbmCacheStorage
2012-11-22 13:02:15 -08:00
stav
99f164fc87 correct docs for default storage backend 2012-11-22 14:05:47 -06:00
Pablo Hoffman
1f0d167037 doc: removed broken proxyhub link from FAQ 2012-11-22 15:10:26 -02:00
Pablo Hoffman
0a00e0fd63 Merge pull request #195 from kalessin/floatdelay
download delay in autothrottle was being casted as int, should be float
2012-11-15 11:18:45 -08:00