scrapy

mirror of https://github.com/scrapy/scrapy.git synced 2025-02-24 18:44:20 +00:00

Author	SHA1	Message	Date
Pablo Hoffman	12280c2a95	fix sphinx references in doc	2013-09-25 15:13:17 -03:00
Pablo Hoffman	fc388f4636	Make ITEM_PIPELINE setting a dict This is for consistency with how spider and downloader middlewares are defined. ITEM_PIPELINE_BASE was also added and both remain empty. Backwards compatibility is kept (with a warning) with list-based ITEM_PIPELINES.	2013-09-23 17:50:43 -03:00
cacovsky	71b320914a	Update request-response.rst Fix small doc typo (too many backticks)	2013-09-18 11:45:25 -03:00
Pablo Hoffman	86230c0ab8	added quantal & raring to support ubuntu releases	2013-08-22 21:49:55 -03:00
Mikhail Korobov	034ffae60f	Recommend Pillow instead of PIL. Closes GH-317.	2013-08-18 00:44:01 +06:00
Berend Iwema	32b6364bcd	#327 - Support STARTTLS / SSL option in email sender	2013-08-14 12:59:01 +02:00
Rocio Aramberri	d227d530f6	Added COMPRESSION_ENABLED setting to enable or disable the HttpCompressionMiddleware Added COMPRESSION_ENABLE setting to docs Added COMPRESSION_ENABLED setting to default settings	2013-08-01 11:31:28 -03:00
Dan	1ca31244b0	Fixed ordering of super argument call.	2013-07-16 14:50:10 -04:00
Dan	e12b689c4f	Updated documentation of spider arguments to include required super call.	2013-07-16 14:26:53 -04:00
Mikhail Korobov	1a1c93fafe	tiny FormRequest doc fix	2013-07-15 15:47:34 +06:00
Mikhail Korobov	ac2fadf3ab	DownloaderMiddleware.process_response docs fix "returns an exception" -> "raises an exception"	2013-07-08 19:41:58 +06:00
Mikhail Korobov	39e5da5f66	improve docs for DownloaderMiddleware.process_response	2013-07-08 19:17:29 +06:00
Pablo Hoffman	0f4b70f582	remove no deprecated request_scheduled signal It will be replaced by more accurate scheduler signals (proposal will come soon)	2013-06-27 11:23:24 -03:00
nramirezuy	bef8ade956	removed request_received and added request_scheduled	2013-06-26 16:45:46 -03:00
Pablo Hoffman	819b2776dd	Merge pull request #326 from berendiwema/master Include example of how to stop the reactor from script	2013-06-25 13:30:07 -07:00
nramirezuy	83b2774354	remove wrong default httpcache	2013-06-25 17:01:29 -03:00
Berend Iwema	aec314db09	added a bit more documentation on how to close the reactor when running scrapy from a script	2013-06-25 16:08:22 +02:00
Pablo Hoffman	bbde1d0e0b	Merge pull request #275 from stav/doc doc: Response.replace() cannot take meta argument	2013-06-24 11:09:28 -07:00
Capi Etheriel	50fa46d183	Document CrawlSpider.parse_start_urls method	2013-06-09 04:03:20 -03:00
Pablo Hoffman	8e49fed918	minor improvements to benchmarking doc	2013-05-16 13:23:13 -03:00
Pablo Hoffman	76087e336a	add scrapy bench command for benchmarking, with documentation	2013-05-16 13:15:25 -03:00
Pablo Hoffman	66311db23e	mention crawlera in best practices, as a way to deal with bans	2013-05-04 18:20:23 -03:00
Pablo Hoffman	9361c89573	remove scrapyd doc, as it was moved to its own repo	2013-04-27 04:15:42 -03:00
Nicolás Ramírez	6df274bba5	added copy method to item	2013-04-19 13:23:53 -03:00
Pablo Hoffman	96c2332e0e	fix inaccurate downloader middleware documentation. refs #280	2013-04-02 11:35:32 -03:00
Steven Almeroth	70179c7c0c	doc: remove trailing spaces	2013-03-21 13:57:39 -06:00
Steven Almeroth	0d7747d353	doc: Response.replace() cannot take meta argument >>> response.replace(meta={'foo':1}) Traceback (most recent call last): File "<input>", line 1, in <module> File "/srv/scrapy/scrapy-fork/scrapy/scrapy/http/response/text.py", line 45, in replace return Response.replace(self, args, kwargs) File "/srv/scrapy/scrapy-fork/scrapy/scrapy/http/response/__init__.py", line 77, in replace return cls(args, *kwargs) File "/srv/scrapy/scrapy-fork/scrapy/scrapy/http/response/text.py", line 22, in __init__ super(TextResponse, self).__init__(args, **kwargs) TypeError: __init__() got an unexpected keyword argument 'meta'	2013-03-21 13:49:55 -06:00
Pablo Hoffman	2a5c7ed4da	make Crawler.start() return a deferred that is fired when the crawl is finished	2013-03-20 14:48:59 -03:00
Pablo Hoffman	b347c14b5f	update engine status output on telnet console documentation	2013-03-18 19:12:12 -03:00
Shane Evans	5c2a82f1f7	fix typo	2013-03-17 19:34:55 +00:00
Steven Almeroth	f62b6660d4	doc: fix typo in spider middleware	2013-03-02 19:46:31 -06:00
Pablo Hoffman	7400ceb1ed	added 502 to RETRY_HTTP_CODES	2013-02-22 19:12:59 -02:00
Pablo Hoffman	a038f46859	doc: fixed rst title	2013-02-14 11:11:17 -02:00
Pablo Hoffman	22edc44c6c	doc: remove links to diveintopython.org, which is no longer available. closes #246	2013-02-14 11:09:40 -02:00
Daniel Graña	5db45b3825	remove scrapyd, it was migrated to its own repository	2013-02-06 05:24:07 +00:00
whodatninja	8e3b5baac5	Fix typo labeling attrs type bool instead of list	2013-02-05 15:10:41 -05:00
Chris Tilden	aae6aed4fb	fixes spelling errors in documentation	2013-01-22 14:52:18 -08:00
Pablo Hoffman	6ab8afb992	improve documentation about removing namespaces	2013-01-18 12:35:30 -02:00
Pablo Hoffman	1ba04b1fc3	added remove_namespaces() method to XmlXPathSelector objects	2013-01-18 12:20:03 -02:00
Pablo Hoffman	c31441a273	revert default HTTP cache policy to dummy (instead of RFC2616)	2013-01-17 13:08:29 -02:00
Daniel Graña	897195186a	document new FormRequest parameter named `formxpath` that matches forms using xpath	2013-01-08 18:36:20 -02:00
Daniel Graña	75563b3f00	Add list of supported and missing RFC2616 caching features	2013-01-08 18:16:44 -02:00
Daniel Graña	d8a760bf57	Merge branch 'http-cache-middleware' Conflicts: scrapy/contrib/downloadermiddleware/httpcache.py scrapy/contrib/httpcache.py scrapy/tests/test_downloadermiddleware_httpcache.py	2013-01-08 17:34:48 -02:00
Daniel Graña	864a7aef87	More httpcache updates * Change default cache policy to RFC2616 * Update HttpCacheMiddleware documentation * Move policies to scrapy.contrib.httpcache * remove a lint error for .has_key() usage in DBM storage backend	2013-01-08 17:26:32 -02:00
Daniel Graña	defc4f89b5	update metarefresh settings	2013-01-08 11:41:19 -02:00
Daniel Graña	6a2b23883a	Add MetaRefreshMiddleware docs	2013-01-08 11:25:38 -02:00
Daniel Graña	076ba40404	update DOWNLOADER_MIDDLEWARES_BASE setting documentation	2013-01-08 10:50:27 -02:00
Pablo Hoffman	227a1d666b	add doc about disabling an extension. refs #132	2013-01-07 13:16:19 -02:00
Pedro Faustino	5d3a4d755f	Update downloader middleware documentation	2013-01-06 18:53:14 +00:00
Emanuel Schorsch	f9b130da12	Proposed Changes I was very confused as to how you actually import DjangoItem. I searched extensively on the internet looking for actual code so I could see how it worked. I finally found http://blog.just2us.com/2012/07/setting-up-django-with-scrapy/. It is much easier to understand with full files instead of code fragments. I also edited where it says "we can see that the model is already saved" as I don't see how it's already saved.	2013-01-04 15:59:04 -05:00

1 2 3 4 5 ...

449 Commits