1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-02-25 11:24:24 +00:00

3278 Commits

Author SHA1 Message Date
Pablo Hoffman
b7eeeff410 get rid of assertDictEqual (since it's python 2.7+ only) 2013-01-17 13:18:23 -02:00
Pablo Hoffman
c31441a273 revert default HTTP cache policy to dummy (instead of RFC2616) 2013-01-17 13:08:29 -02:00
Daniel Graña
897195186a document new FormRequest parameter named formxpath that matches forms using xpath 2013-01-08 18:36:20 -02:00
Daniel Graña
7527ef97ba Merge pull request #185 from notsobad/master
Added xpath support in FormRequest.from_response
2013-01-08 12:34:06 -08:00
Daniel Graña
75563b3f00 Add list of supported and missing RFC2616 caching features 2013-01-08 18:16:44 -02:00
Daniel Graña
3cbc4d0b94 django is an optional_features, its imports must not fail 2013-01-08 17:56:46 -02:00
Daniel Graña
d8a760bf57 Merge branch 'http-cache-middleware'
Conflicts:
	scrapy/contrib/downloadermiddleware/httpcache.py
	scrapy/contrib/httpcache.py
	scrapy/tests/test_downloadermiddleware_httpcache.py
2013-01-08 17:34:48 -02:00
Daniel Graña
864a7aef87 More httpcache updates
* Change default cache policy to RFC2616
* Update HttpCacheMiddleware documentation
* Move policies to scrapy.contrib.httpcache
* remove a lint error for .has_key() usage in DBM storage backend
2013-01-08 17:26:32 -02:00
Daniel Graña
487299e068 TakeFirst doc says it returns first non-null/non-empty value, zero is a valid value. closes #59 2013-01-08 15:47:33 -02:00
Daniel Graña
672d09ea2e add meta-refresh changes to release notes 2013-01-08 12:30:36 -02:00
Daniel Graña
9527c5819a pep8ize settings 2013-01-08 11:48:36 -02:00
Daniel Graña
defc4f89b5 update metarefresh settings 2013-01-08 11:41:19 -02:00
Daniel Graña
6a2b23883a Add MetaRefreshMiddleware docs 2013-01-08 11:25:38 -02:00
Daniel Graña
076ba40404 update DOWNLOADER_MIDDLEWARES_BASE setting documentation 2013-01-08 10:50:27 -02:00
Daniel Graña
71db7f1b25 Split redirection into status and metarefresh middlewares, also changes httpcompression priority. closes #78 2013-01-08 09:59:38 -02:00
Rolando Espinoza La fuente
fe5d0ce2e0 tests: added downloader middleware manager integration tests for gzipped redirection. 2013-01-08 09:59:38 -02:00
Pablo Hoffman
227a1d666b add doc about disabling an extension. refs #132 2013-01-07 13:16:19 -02:00
Pedro Faustino
5d3a4d755f Update downloader middleware documentation 2013-01-06 18:53:14 +00:00
Pedro Faustino
59dc71f394 Merge branch 'http-cache-middleware', remote-tracking branch 'dangra/http-cache-middleware' into http-cache-middleware 2013-01-06 17:53:25 +00:00
Pablo Hoffman
7f990a4af2 Merge pull request #221 from emschorsch/patch-1
Proposed Changes to DjangoItem documentation
2013-01-04 14:17:54 -08:00
Emanuel Schorsch
f9b130da12 Proposed Changes
I was very confused as to how you actually import DjangoItem.
I searched extensively on the internet looking for actual code so I could see how it worked.
I finally found http://blog.just2us.com/2012/07/setting-up-django-with-scrapy/. It is much easier to understand with full files instead of code fragments.
I also edited where it says "we can see that the model is already saved" as I don't see how it's already saved.
2013-01-04 15:59:04 -05:00
Pablo Hoffman
acb7bad1ff Merge pull request #218 from Mimino666/django-item-validation
Django item validation
2013-01-04 10:35:20 -08:00
Daniel Graña
3f03a2ca50 requests with no-cache set must force revalidation of cached responses 2013-01-04 04:20:45 -02:00
Daniel Graña
cdecc760ee default httpcache to rfc2616 policy and improve storage and policy tests 2013-01-04 03:43:25 -02:00
Daniel Graña
0a5586fafd move FilesystemCacheStorage to scrapy.contrib.httpcache 2013-01-03 09:23:18 -02:00
Pablo Hoffman
9f003a73da Merge pull request #217 from Mimino666/item-processing-errormsg
Fixed error message formatting.
2013-01-02 21:04:40 -08:00
Michal Danilak
2cfbd13c17 Added "exclude" parameter testing to unittests. 2013-01-03 02:02:03 +01:00
Michal Danilak
035d1e99a9 Added model validation to DjangoItem. 2013-01-03 01:52:53 +01:00
Michal Danilak
8ea89b277c Fixed error message formatting.
log.err() doesn't support cool formatting and when error occured, the message was:
	"ERROR: Error processing %(item)s"
2013-01-02 22:55:04 +01:00
Pablo Hoffman
ea0967562e Merge pull request #216 from Mimino666/shell-spider-option
Added --spider option to "shell" command.
2013-01-02 09:15:15 -08:00
Daniel Graña
a6ef76ed88 lint and improve images pipeline error logging 2013-01-02 14:14:37 -02:00
Michal Danilak
ea68250d77 Added --spider option to "shell" command. 2013-01-02 17:13:37 +01:00
Pablo Hoffman
e9c5b76242 Merge pull request #215 from kuyan/patch-1
Fixed typo
2013-01-02 06:21:32 -08:00
Natan L
d572f8945e Fixed typo
'persitent' --> 'persistent'
2012-12-31 11:14:01 -08:00
Pablo Hoffman
1aa25cdff2 Merge pull request #214 from tonal/log-level-dropped-item
Make LogFormatter return the log level (and require it)
2012-12-29 19:21:54 -08:00
Alexandr N Zamaraev (aka tonal)
71b071ffeb Log level return from LogFormatter methods 2012-12-29 11:43:41 +07:00
Pedro Faustino
cf5f0203b7 Instead of extending from HttpCachePolicy, following the same approach used for storage selection 2012-12-28 16:11:47 +01:00
Pablo Hoffman
ba257db6e2 Merge pull request #213 from tonal/forget-import
Remove firget imports
2012-12-28 06:42:40 -08:00
Pedro Faustino
492831fc6f Merge branch 'master' of git://github.com/scrapy/scrapy into http-cache-middleware 2012-12-28 15:27:45 +01:00
Pedro Faustino
3e31d06872 Implement single HTTP cache policy 2012-12-28 13:28:35 +01:00
Pedro Faustino
63d0b9f8c8 Remove plural from stat key. 2012-12-28 11:06:04 +01:00
Alexandr N Zamaraev (aka tonal)
3d397de0e9 Possible set log-level in LogFormatter.dropped and LogFormatter.scraped 2012-12-28 13:40:27 +07:00
Alexandr N Zamaraev (aka tonal)
21de68b766 Remove firget imports 2012-12-28 13:01:39 +07:00
Hasnain Lakhani
93a1102189 Implemented policies for HTTP Cache 2012-12-26 16:29:48 -08:00
Pablo Hoffman
51b8feb4ce fixed doc typos 2012-12-26 16:16:53 -02:00
Pablo Hoffman
1e2ee76df2 add documentation topics: Broad Crawls & Common Practies 2012-12-26 14:02:13 -02:00
Pedro Faustino
fdaa35f6e8 Updated the downloader middleware documentation to reflect changes introduced by the support for real HTTP caching. 2012-12-24 19:37:53 +01:00
Pedro Faustino
b2d3f4dd1b Add cache storage support for real HTTP caching. Add real HTTP caching unit tests for both middleware and cache storage. 2012-12-24 16:15:04 +01:00
Pedro Faustino
0e435fb5f9 Add middleware support for the 'no-store' Cache-Control directive. 2012-12-24 13:27:13 +01:00
Pedro Faustino
bb55f39aed Add middleware support for real HTTP caching. Add httpcache setting to allow either real or dummy HTTP caching (for backwards compatibility it's set to use dummy cache by default). 2012-12-24 01:52:45 +01:00