scrapy

mirror of https://github.com/scrapy/scrapy.git synced 2025-02-25 17:43:51 +00:00

Author	SHA1	Message	Date
Brian Lange	b878f60b5a	Add note to item-pipeline documentation explaining order in the ITEM_PIPELINES setting.	2013-11-19 16:12:54 -06:00
tntC4stl3	b51d5d81e4	duplicate 'use' in line 87	2013-11-15 13:56:44 +08:00
Daniel Graña	2df8156431	Drop Python 2.6 support	2013-10-29 13:44:00 -02:00
Pablo Hoffman	911c8082b0	simplified description of crawl command	2013-10-21 14:42:51 -02:00
Pablo Hoffman	e8ee449a2a	Merge pull request #432 from darkrho/crawl-url Removed URL reference in crawl command and .tld suffix in docs for spider names	2013-10-21 09:40:58 -07:00
Rolando Espinoza La fuente	34543c2b2e	DOCS removed .tld suffix for spider names for the sake of consistency.	2013-10-19 23:03:20 -04:00
Daniel Graña	875b07aef8	fix references to old selector naming in docs	2013-10-17 09:33:15 -02:00
Travis Briggs	3043a5ba37	DownloaderMiddleware docs: Update process_request, proper explanation of IgnoreRequest. Also: * Change terminology to eliminate uses of terms such as "request middleware" to refer to the process_request methods of installed middleware. * Remove description of "immediate redirection", as it is misleading. Further changes.	2013-10-17 00:23:21 +00:00
Mikhail Korobov	086b8a20d4	typo fix in TextResponse docs	2013-10-17 04:50:30 +06:00
Pablo Hoffman	951a9f3f4c	Merge pull request #226 from scraperdragon/patch-1 Parameters to Request() in wrong order	2013-10-16 13:15:52 -07:00
Daniel Graña	1461363809	Replace `contenttype` references by `type` The type to choose from is the selector type, not the input type. A content-type doesn't make sense in this context.	2013-10-16 17:37:25 -02:00
Daniel Graña	155ea08ea1	use `sel` name for Selector's instances in docs, internals and shell	2013-10-15 15:58:42 -02:00
Dragon Dave	a3b711bdea	Move callback blob; mention errback	2013-10-15 12:19:42 +01:00
scraperdragon	0ba0d85685	Parameters to Request() in wrong order Implied that callback wasn't the first optional unnamed parameter.	2013-10-15 11:50:43 +01:00
Daniel Graña	ab9462a251	remove more references to libxml2	2013-10-14 16:37:14 -02:00
Daniel Graña	4645f9e03c	Updates docs to reflect unified selectors api	2013-10-14 16:31:20 -02:00
Daniel Graña	bf37f78572	Drop libxml2 selectors backend	2013-10-11 18:02:35 -02:00
Daniel Graña	6d598f0d94	Update selectors docs	2013-10-10 18:24:00 -02:00
Capi Etheriel	bc17e9d412	Adds HtmlCSSSelector and XmlCSSSelector classes, cssselect as optional dependency. Ported .get() from _Element and .text_content() from HTMLMixin Add CSS selectors to scrapy shell Documenting CSS Selectors: Constructing selectors Documenting CSS Selectors: Using Selectors Make CSS Selectors a default feature. Adds XPath powers to CSS Selectors and some syntactic sugar. Removes methods copied over from lxml.html.HtmlMixin. Updating docs to use new CSS Selector super powers. Documenting CSS Selectors: Regular Expressions Moving section after Nesting section, since it mentions it. Documenting CSS Selectors: Nesting Selectors Fix XPath specificity in lxml.selector.CSSSelectorMixin.text Cleaning up unused stuff from cssel.py Changing the behavior of lxml.selector.CSSSelectorMixin.text. Concatenating all of the descendant text nodes is more useful than returning it in pieces (there's xpath() if you need that). Documenting CSS Selectors: CSS Selector objects Documenting CSS Selectors: CSSSelectorList objects Documenting CSS Selectors: HtmlCSSSelector objects Documenting CSS Selectors: XmlCSSSelector objects Fixing some documentations typos and errors Enforcing the 80-char width lines Tidying up CSS selectors and CSSSelectorMixin objects Adding some missing references in documentation. Fixing lxml.selector.CSSSelectorList.text	2013-10-10 18:23:15 -02:00
Pablo Hoffman	8b9526a8f6	Merge pull request #400 from irgmedeiros/patch-2 Update the second code example	2013-10-07 07:57:18 -07:00
Pablo Hoffman	86c6e9433f	remove minor reference to 'scrapy server' command	2013-10-04 14:37:55 -03:00
Pablo Hoffman	37c24e01d7	document bindaddress request meta	2013-10-02 17:13:17 -03:00
Pablo Hoffman	a9c3519897	updated required twisted version to 10.0	2013-10-01 14:07:38 -03:00
Rolando Espinoza	d6e3eae527	docs: added section regarding setting up django's settings.	2013-09-30 09:58:10 -04:00
Rolando Espinoza	0cc1d870db	docs: minor tidy up sample code and missing shell prompts.	2013-09-30 09:58:10 -04:00
Loren Davie	8af0e89e85	Corrected typo.	2013-09-29 17:06:46 -04:00
Loren Davie	f49f5724d5	Added dynamic creation of item classes to practices.rst.	2013-09-28 09:00:48 -04:00
irgmedeiros	9b50409986	Update the second code example Update the second code example to reflect the last change in the first example.	2013-09-27 18:22:33 -03:00
irgmedeiros	d9e0fdc9aa	Update practices.rst With this modification scrapy runs the spider with project settings. The previous example ran only with default settings resulting in ignoring all user settings as pipelines for example.	2013-09-27 17:56:30 -03:00
Daniel Graña	265910aae6	Merge pull request #363 from taikano/sitemap_alternate also fetch alternate URLs from sitemaps, see #360	2013-09-26 09:15:02 -07:00
Pablo Hoffman	12280c2a95	fix sphinx references in doc	2013-09-25 15:13:17 -03:00
Pablo Hoffman	fc388f4636	Make ITEM_PIPELINE setting a dict This is for consistency with how spider and downloader middlewares are defined. ITEM_PIPELINE_BASE was also added and both remain empty. Backwards compatibility is kept (with a warning) with list-based ITEM_PIPELINES.	2013-09-23 17:50:43 -03:00
cacovsky	71b320914a	Update request-response.rst Fix small doc typo (too many backticks)	2013-09-18 11:45:25 -03:00
Stefan	6994959181	renamed to sitemap_alternate_links and added default value, see #360	2013-09-08 10:38:28 +02:00
Stefan	8ed2d0cda1	improved changes to allow retrieval of alternate links in sitemaps, see #360	2013-09-07 12:56:30 +02:00
Pablo Hoffman	86230c0ab8	added quantal & raring to support ubuntu releases	2013-08-22 21:49:55 -03:00
Mikhail Korobov	034ffae60f	Recommend Pillow instead of PIL. Closes GH-317.	2013-08-18 00:44:01 +06:00
Berend Iwema	32b6364bcd	#327 - Support STARTTLS / SSL option in email sender	2013-08-14 12:59:01 +02:00
Rocio Aramberri	d227d530f6	Added COMPRESSION_ENABLED setting to enable or disable the HttpCompressionMiddleware Added COMPRESSION_ENABLE setting to docs Added COMPRESSION_ENABLED setting to default settings	2013-08-01 11:31:28 -03:00
Dan	1ca31244b0	Fixed ordering of super argument call.	2013-07-16 14:50:10 -04:00
Dan	e12b689c4f	Updated documentation of spider arguments to include required super call.	2013-07-16 14:26:53 -04:00
Mikhail Korobov	1a1c93fafe	tiny FormRequest doc fix	2013-07-15 15:47:34 +06:00
Mikhail Korobov	ac2fadf3ab	DownloaderMiddleware.process_response docs fix "returns an exception" -> "raises an exception"	2013-07-08 19:41:58 +06:00
Mikhail Korobov	39e5da5f66	improve docs for DownloaderMiddleware.process_response	2013-07-08 19:17:29 +06:00
Pablo Hoffman	0f4b70f582	remove no deprecated request_scheduled signal It will be replaced by more accurate scheduler signals (proposal will come soon)	2013-06-27 11:23:24 -03:00
nramirezuy	bef8ade956	removed request_received and added request_scheduled	2013-06-26 16:45:46 -03:00
Pablo Hoffman	819b2776dd	Merge pull request #326 from berendiwema/master Include example of how to stop the reactor from script	2013-06-25 13:30:07 -07:00
nramirezuy	83b2774354	remove wrong default httpcache	2013-06-25 17:01:29 -03:00
Berend Iwema	aec314db09	added a bit more documentation on how to close the reactor when running scrapy from a script	2013-06-25 16:08:22 +02:00
Pablo Hoffman	bbde1d0e0b	Merge pull request #275 from stav/doc doc: Response.replace() cannot take meta argument	2013-06-24 11:09:28 -07:00

... 6 7 8 9 10 ...

831 Commits