Pablo Hoffman
|
832e45073b
|
fixed typo in stats documentation. closes #159
|
2012-07-20 17:13:06 -03:00 |
|
Daniel Graña
|
277ed0ae23
|
Merge pull request #145 from alexcepoi/cookies-changes
domain and path support for request cookies
|
2012-06-25 11:29:04 -07:00 |
|
Alexandru Cepoi
|
177c81745d
|
domain and path support for request cookies
|
2012-06-25 20:17:59 +02:00 |
|
Pablo Hoffman
|
179e3810dc
|
fixed links to doc. closes #150
|
2012-06-24 01:00:33 -03:00 |
|
Alexandru Cepoi
|
f4faa19e31
|
added docs topic debugging spiders
|
2012-06-21 20:03:33 +02:00 |
|
Alexandru Cepoi
|
3e05a2ecf6
|
update docs for parse command
|
2012-06-12 18:28:10 +02:00 |
|
Pablo Hoffman
|
9686f97242
|
added precise to supported ubuntu distros
|
2012-05-12 19:54:36 -03:00 |
|
Pablo Hoffman
|
58e88ed246
|
scrapyd: do not set SCRAPY_FEED_URI/SCRAPY_LOG_FILE if items_dir/logs_dir settings are not set
|
2012-05-08 17:43:00 -03:00 |
|
Pablo Hoffman
|
9c3b9f2968
|
fixed bug in json-rpc webservice reported in https://groups.google.com/d/topic/scrapy-users/qgVBmFybNAQ/discussion. also removed no longer supported 'run' command from extras/scrapy-ws.py
|
2012-05-03 12:05:40 -03:00 |
|
Pablo Hoffman
|
abcac4fcbd
|
updated maintainer to scrapinghub
|
2012-05-02 03:25:35 -03:00 |
|
stav
|
86dba76d1f
|
documentation indentation
|
2012-04-30 13:09:34 -05:00 |
|
Pablo Hoffman
|
d567d8efbe
|
added note to docs/topics/firebug.rst about google directory being shut down
|
2012-04-19 01:34:20 -03:00 |
|
stav
|
f1802289cd
|
small doc typo change to get the fork rolling
|
2012-04-11 12:05:39 -05:00 |
|
Pablo Hoffman
|
27018fced7
|
changed default user agent to Scrapy/0.15 (+http://scrapy.org) and removed no longer needed BOT_VERSION setting
|
2012-03-23 13:45:21 -03:00 |
|
Pablo Hoffman
|
8933e2f2be
|
added REFERER_ENABLED setting, to control referer middleware
|
2012-03-22 16:35:14 -03:00 |
|
Jason Yeo
|
da826aa13d
|
fixed minor mistake in Request objects documentation
|
2012-03-21 10:25:41 +08:00 |
|
Pablo Hoffman
|
175c70ad44
|
fixed minor defect in link extractors documentation
|
2012-03-20 22:56:45 -03:00 |
|
Pablo Hoffman
|
35fb01156e
|
removed some obsolete remaining code related to sqlite support in scrapy
|
2012-03-16 11:55:55 -03:00 |
|
Pablo Hoffman
|
2b16ebdc11
|
added minor clarification on cookiejar request meta key usage
|
2012-02-29 07:19:01 -02:00 |
|
lostsnow
|
5afe4f50c1
|
scrapyd: support bind to a specific ip address
|
2012-02-29 13:47:40 +08:00 |
|
Pablo Hoffman
|
81abb45000
|
fixed bug in new cookiejar documentation
|
2012-02-28 11:08:25 -02:00 |
|
Pablo Hoffman
|
26c8004125
|
added documentation for the new cookiejar Request.meta key
|
2012-02-27 19:58:58 -02:00 |
|
Pablo Hoffman
|
7fe7c3f3b1
|
MemoryUsage extension: close the spiders (instead of stopping the engine) when the limit is exceeded, providing a descriptive reason for the close. Also fixed default value of MEMUSAGE_ENABLED setting to match the documentation.
|
2012-02-23 17:05:06 -02:00 |
|
Pablo Hoffman
|
7b8942a648
|
updated StackTraceDump extension doc
|
2012-02-16 15:14:17 -02:00 |
|
Pablo Hoffman
|
0b0bce7f3c
|
scrapyd: added cancel.json and listjobs.json api methods to documentation
|
2012-01-05 11:23:25 -02:00 |
|
Pablo Hoffman
|
8f42633a94
|
scrapyd: added clarification about how to disable items feeds generation
|
2012-01-05 11:20:50 -02:00 |
|
Pablo Hoffman
|
dbda33efa6
|
scrapyd: added support for storing items by default
Items are stored the same way as logs, in jsonlines format.
Also renamed logs_to_keep setting to jobs_to_keep.
|
2012-01-03 23:08:54 -02:00 |
|
Pablo Hoffman
|
41fd3c4f6c
|
doc: removed duplicated callback argument from Request.replace()
|
2011-12-23 15:55:46 -02:00 |
|
Pablo Hoffman
|
0eeff76227
|
fixed formatting of scrapyd doc
|
2011-12-20 03:18:37 -02:00 |
|
Pablo Hoffman
|
992af8d38f
|
ubuntu repos: added support for oneiric release
|
2011-10-25 14:26:38 -02:00 |
|
Pablo Hoffman
|
c38c49d56a
|
fixed PickeItemExporter bug, added unittest, and added pickle to suported feed exports formats
|
2011-10-25 02:36:51 -02:00 |
|
Pablo Hoffman
|
8bdf288428
|
made scrapyd doc more version agnostic
|
2011-10-23 05:29:54 -02:00 |
|
Pablo Hoffman
|
431441cb52
|
updated documentation to remove references to old issue tracker and mercurial repos
|
2011-09-25 13:06:24 -03:00 |
|
Pablo Hoffman
|
ce03ccd4ec
|
updated documentation about DEPTH_PRIORITY and DFO/BFO crawls
|
2011-09-23 13:22:25 -03:00 |
|
Julien Duponchelle
|
b7c436343a
|
scrapy deploy support git version
|
2011-09-21 22:17:08 +02:00 |
|
Daniel Grana
|
5f1b1c05f8
|
Do not filter requests with dont_filter attribute set in OffsiteMiddleware
|
2011-09-08 15:18:10 -03:00 |
|
Pablo Hoffman
|
bff3d31469
|
scrapyd: updated schedule.json response format
|
2011-09-04 09:29:24 -03:00 |
|
Pablo Hoffman
|
a1dbc62b45
|
removed CONCURRENT_SPIDERS setting (use scrapyd maxproc instead)
|
2011-09-02 18:27:39 -03:00 |
|
Pablo Hoffman
|
40f7075f11
|
added initial documentation about suspend and resume crawls
|
2011-09-02 13:12:27 -03:00 |
|
Pablo Hoffman
|
27dd68a690
|
added SpiderState extension
|
2011-09-02 13:06:59 -03:00 |
|
Pablo Hoffman
|
6a31ab667d
|
minor fix to doc
|
2011-09-01 15:08:23 -03:00 |
|
Pablo Hoffman
|
d98b058c21
|
no longer recommend using labmda's in the doc, as they're not friendly with scheduler persistence
|
2011-09-01 15:06:49 -03:00 |
|
Pablo Hoffman
|
76af0cdd44
|
updated documentation and code to use -s instead of --set option
|
2011-09-01 14:35:37 -03:00 |
|
Pablo Hoffman
|
98b68ca89d
|
scrapyd: documented support for passing setting to spiders in schedule.json
|
2011-08-27 01:31:12 -03:00 |
|
Pablo Hoffman
|
5c6b0631e2
|
minor doc fix
|
2011-08-19 11:42:03 -03:00 |
|
Pablo Hoffman
|
9d97e73a24
|
fixed priority handling on the new scheduler so that it's backwards compatible (ie. bigger priorities are higher). also fixed a few documentation bugs related to requests priority
|
2011-08-19 08:26:41 -03:00 |
|
Pablo Hoffman
|
a3697421c0
|
some minor updates to documentation
|
2011-08-11 09:19:59 -03:00 |
|
Pablo Hoffman
|
19e6da59d8
|
added new downloader middleware: ChunkedTransferMiddleware
|
2011-08-09 03:03:25 -03:00 |
|
Pablo Hoffman
|
984be35461
|
Some telnet console changes:
* renamed manager alias to crawler
* added aliases: spider, slot
* fixed est() function
|
2011-08-08 15:01:08 -03:00 |
|
Pablo Hoffman
|
f7c0aeccc6
|
added note about engine_started signal
|
2011-08-07 03:57:09 -03:00 |
|