Paul Tremberth
88c8a523a7
Add warning in docs on performance when using EXSLT regexp functions
2014-01-15 12:52:10 +01:00
Paul Tremberth
a3eba68aca
Drop EXSLT strings and math extensions
2014-01-15 12:28:25 +01:00
Pablo Hoffman
ea2f897b81
Merge pull request #502 from scrapy/doc-fixes
...
DOWNLOAD_DELAY docs clarification
2014-01-14 21:07:42 -08:00
Hobson Lane
a3db95985b
another import name correction by pablo
2014-01-14 21:04:15 -08:00
Pablo Hoffman
3d308d7732
Merge pull request #517 from hoprocker/patch-1
...
Clarify return value from extract_links
2014-01-14 21:01:32 -08:00
Paul Tremberth
2cc26e6f56
Fix typo error
2014-01-14 13:09:18 +01:00
Paul Tremberth
29fc9f3466
Update selectors documentation and tests
2014-01-14 12:56:37 +01:00
Ferdy Rodriguez
807dd25324
fixed error on tor's name
2014-01-13 00:03:58 -06:00
Ferdy Rodriguez
8b9348cfaf
Changed TOR Info as previous was removed from www.mininova.org
2014-01-12 23:46:04 -06:00
Hobson Lane
6ba0857a5c
documentation code example correction corrections per pablohoffman
2014-01-10 10:37:27 -08:00
malcolm m
962e5ef702
Clarify return value from extract_links
2014-01-05 14:42:48 -08:00
Yuri Prezument
060891c01c
Remove unused import from code sample
...
Item pipeline docs - removed unused import from code sample
2014-01-03 15:44:17 +02:00
Mikhail Korobov
a27d91f0a6
Rename BaseSpider to Spider. See GH-495.
2013-12-30 19:46:41 +06:00
Mikhail Korobov
e713733edf
minor fixes to scrapy shell docs
...
* better IPython links;
* MDC link instead of w3schools;
* small formatting fixes;
* show quoted URL in example
2013-12-30 10:27:39 +06:00
Mikhail Korobov
f18ac02987
remove duplicated link extractors link
...
Check http://doc.scrapy.org/en/latest/topics/link-extractors.html - two menu items are highlighted at the left.
2013-12-28 05:40:10 +05:00
Mikhail Korobov
9a999daa2a
DOWNLOAD_DELAY docs clarification:
...
* delay is enforced per website, not per spider;
* document download_delay attribute (it was previously documented only in FAQ about 999 error codes);
* document how CONCURRENT_REQUESTS_PER_IP affects download delays.
2013-12-28 06:30:34 +06:00
Pablo Hoffman
e42e3743fe
quick documentation for #475
2013-12-24 12:19:15 -02:00
Mikhail Korobov
e0cebbfc8f
add a remark about 1%
2013-12-20 23:12:37 +06:00
Mikhail Korobov
943a0bd264
AjaxCrawlableMiddleware in Broad Crawl docs
2013-12-19 01:01:26 +06:00
Mikhail Korobov
a87b3bd1c8
AjaxCrawlableMiddleware
2013-12-19 00:06:47 +06:00
RasPat1
ff21281b95
Note about selector class import
...
This is the salient point of this code compared to the last example. We have a selector now and this is how we use it. Especially since the user has just come from the shell where the pre-instantiated selector is taken for granted.
2013-12-15 13:46:42 -05:00
Daniel Graña
8a7c5b5d81
Add 0.20.2 release notes
...
Conflicts:
docs/news.rst
2013-12-09 18:33:46 -02:00
Pablo Hoffman
f2741c413e
fix method name in tutorial. closes GH-480
2013-12-02 13:24:12 -02:00
Daniel Graña
e34ffc0f42
Add 0.20.1 release notes
...
Conflicts:
docs/news.rst
2013-11-28 16:25:57 -02:00
Pablo Hoffman
339861367e
Merge pull request #425 from audiodude/master
...
DownloaderMiddleware docs: Update process_request and minor cleanups.
2013-11-25 10:33:35 -08:00
Paul Tremberth
14f5817d6b
Modify ItemLoader to support XPath and CSS selectors
...
Deprecate XPathItemLoader (now an alias to the new ItemLoader)
2013-11-21 18:05:24 +01:00
Pablo Hoffman
f87be371a2
better names for HANDLE_* settings, and added doc
2013-11-21 14:33:17 -02:00
Brian Lange
e4c1d8d37d
Elaborate on use of order numbers
2013-11-19 17:51:50 -06:00
Brian Lange
b878f60b5a
Add note to item-pipeline documentation explaining order in the ITEM_PIPELINES setting.
2013-11-19 16:12:54 -06:00
Pablo Hoffman
afe6eaa2fe
Merge pull request #460 from tntC4stl3/master
...
duplicate 'use' in line 87
2013-11-15 04:10:49 -08:00
tntC4stl3
b51d5d81e4
duplicate 'use' in line 87
2013-11-15 13:56:44 +08:00
Daniel Graña
04ff7ecebf
improve 0.20 release notes
...
Conflicts:
docs/news.rst
2013-11-08 17:45:03 -02:00
Daniel Graña
3d18a3c49e
bumped version to 0.21.0
2013-11-08 17:09:00 -02:00
Daniel Graña
d0980e5c9b
Merge 0.20 release notes
2013-11-08 17:06:10 -02:00
Daniel Graña
2df8156431
Drop Python 2.6 support
2013-10-29 13:44:00 -02:00
Pablo Hoffman
911c8082b0
simplified description of crawl command
2013-10-21 14:42:51 -02:00
Pablo Hoffman
e8ee449a2a
Merge pull request #432 from darkrho/crawl-url
...
Removed URL reference in crawl command and .tld suffix in docs for spider names
2013-10-21 09:40:58 -07:00
Rolando Espinoza La fuente
34543c2b2e
DOCS removed .tld suffix for spider names for the sake of consistency.
2013-10-19 23:03:20 -04:00
Daniel Graña
875b07aef8
fix references to old selector naming in docs
2013-10-17 09:33:15 -02:00
Travis Briggs
3043a5ba37
DownloaderMiddleware docs: Update process_request, proper explanation of IgnoreRequest.
...
Also:
* Change terminology to eliminate uses of terms such as "request middleware" to refer to the process_request methods of installed middleware.
* Remove description of "immediate redirection", as it is misleading.
Further changes.
2013-10-17 00:23:21 +00:00
Mikhail Korobov
086b8a20d4
typo fix in TextResponse docs
2013-10-17 04:50:30 +06:00
Pablo Hoffman
951a9f3f4c
Merge pull request #226 from scraperdragon/patch-1
...
Parameters to Request() in wrong order
2013-10-16 13:15:52 -07:00
Daniel Graña
1461363809
Replace contenttype
references by type
...
The type to choose from is the selector type, not the input type. A
content-type doesn't make sense in this context.
2013-10-16 17:37:25 -02:00
Daniel Graña
155ea08ea1
use sel
name for Selector's instances in docs, internals and shell
2013-10-15 15:58:42 -02:00
Daniel Graña
1abb1af0c6
fix typos and wording on selector's introduction
2013-10-15 10:13:43 -02:00
Dragon Dave
a3b711bdea
Move callback blob; mention errback
2013-10-15 12:19:42 +01:00
scraperdragon
0ba0d85685
Parameters to Request() in wrong order
...
Implied that callback wasn't the first optional unnamed parameter.
2013-10-15 11:50:43 +01:00
Daniel Graña
28999590fa
update release notes
2013-10-14 16:41:04 -02:00
Daniel Graña
ab9462a251
remove more references to libxml2
2013-10-14 16:37:14 -02:00
Daniel Graña
4645f9e03c
Updates docs to reflect unified selectors api
2013-10-14 16:31:20 -02:00