Mikhail Korobov
5ac91e4883
DOC remove Dynamic Creation of Item Classes section
...
It was a hack, and dicts-as-items cover most use cases.
Dicts don't allow to attach metadata to fields,
but e.g. adding "_meta" key and removing it in a custom serializer
is no worse than creating classes dynamically.
2015-03-23 18:11:35 +05:00
Julia Medina
cda3922507
Add Response.urljoin() helper
2015-03-19 19:07:52 -03:00
Pablo Hoffman
c81eefaf81
fix doc links
2015-03-19 17:42:48 -03:00
Mikhail Korobov
8ac397670f
DOC move .. module: declaration to a proper place
2015-03-19 21:41:36 +05:00
Faisal Anees
643984e1b4
Updated architecture.rst
...
Added http://krondo.com/blog/?page_id=1327 as a resource
2015-03-18 23:55:22 -03:00
Mikhail Korobov
f16a33f34e
DOC change structure of spider docs:
...
* start with scrapy.Spider, then mention spider arguments,
then describe generic spiders;
* change wording regarding start_urls/start_requests;
* show an example of start_requests vs start_urls;
* show an example of dicts as items;
* as defining Item is an optional step now, docs for Items are
moved below Spider docs.
2015-03-19 05:25:15 +05:00
Mikhail Korobov
817dbc6cbd
DOC mention dicts in documentation; explain better what are Items for
2015-03-19 05:16:14 +05:00
Julia Medina
959aaad205
Document re_first
2015-03-18 21:11:08 -03:00
Mateusz Golewski
127c6c694a
Fix extract_first() docs
2015-03-18 21:11:08 -03:00
Mateusz Golewski
012211accd
Add docs for extract_first()
2015-03-18 21:11:08 -03:00
Daniel Graña
12eedd90d1
fix truncated 0.24.5 release notes. closes #1084
2015-03-18 20:46:28 -03:00
Julia Medina
4fb818a250
Run linkfix over current docs
2015-03-18 20:04:14 -03:00
Julia Medina
ff987fb5a5
Add linkfix rule to docs Makefile
2015-03-18 20:00:03 -03:00
Shadab Zafar
c05f5f175e
Added linkfix script to docs/utils
...
https://github.com/scrapy/scrapy/pull/1041#issuecomment-78143576
2015-03-18 19:41:26 -03:00
Shadab Zafar
5a58d64131
Fix some redirection links in documentation
...
Fixes #606
2015-03-18 19:41:26 -03:00
Nicolás Alejandro Ramírez Quiros
ee82fe0e24
Merge pull request #1016 from SudShekhar/jsonProcessor
...
[MRG+1] Added JmesSelect
2015-03-18 08:11:25 -03:00
Mikhail Korobov
39635e5f55
Allow spiders to return dicts. See GH-1064.
2015-03-18 07:26:56 +05:00
Pablo Hoffman
934584a355
Merge pull request #1020 from jojje/gzip_http_cache
...
[MRG+1] add gzip compression to filesystem http cache backend
2015-03-17 14:32:06 -03:00
Pablo Hoffman
f924567591
Merge pull request #983 from ArturGaspar/linkextractor_css
...
[MRG+1] CSS support in link extractors
2015-03-17 01:07:47 -03:00
nramirezuy
c13e23641b
httpcache dont_cache meta #19 #689
2015-03-16 11:50:04 -03:00
Mikhail Korobov
c1bf8365d5
Merge pull request #1022 from scrapy/docs-reorder
...
DOC reorder topics
2015-03-13 16:40:47 +05:00
Mikhail Korobov
baf5c59386
Merge pull request #1071 from eliasdorneles/updating-request-meta-special-keys
...
updating list of Request.meta special keys
2015-03-13 16:38:19 +05:00
Elias Dorneles
57a5ee0097
added example value to set for proxy meta key
2015-03-12 23:20:44 -03:00
Elias Dorneles
f7031c08ff
updating list of Request.meta special keys
2015-03-10 22:29:07 -03:00
Daniel Graña
c9d7386a32
Add 0.24.5 release notes
2015-02-25 19:36:40 -02:00
Sudhanshu Shekhar
839ffba971
Added the first version of SelectJmes
...
Utilizes jmespath. Also, added tests and documentation for the same.
2015-02-24 22:59:01 +05:30
Mikhail Korobov
6419f76bbc
Merge pull request #1029 from dangra/contribute-to-master
...
DOC contribute to master branch
2015-02-03 02:10:47 +05:00
Nicolás Alejandro Ramírez Quiros
8a3b9b6131
Merge pull request #1011 from SudShekhar/master
...
Extension example fix to something that makes more sense
2015-01-30 15:45:52 -02:00
Sudhanshu Shekhar
e42a1ac1a1
Reset items_scraped instead of item_count
...
items_scraped is the counter that needs to be reset each time we have scraped a specific number of items in the code instead of item_count (which represents the specific number of items needed before a message is logged). Updating the source code to reflect this.
Removed some irrelevant words from the log message.
Signed-off-by: Sudhanshu Shekhar <sudshekhar02@gmail.com>
2015-01-30 23:13:06 +05:30
Daniel Graña
f0bdc14522
Tentative attention message about what document to read for contributions
2015-01-29 15:40:12 -02:00
Mikhail Korobov
a07b4353d6
DOC reorder topics
...
* duplicate "topics/commands" link is removed;
* Request/Response docs are moved to "Basic Concepts";
* settings docs are moved to "Basic Concepts";
* exceptions docs are moved to "Basic Concepts";
* "signals" and "exporters" docs are moved to "Extending Scrapy";
* "Reference" section is dropped because it is empty now.
2015-01-22 22:58:10 +05:00
Jonas Tingeborn
bd5d99a2d2
add gzip compression to filesystem http cache backend
2015-01-21 20:18:11 +01:00
Capi Etheriel
4bc14da59e
Updates documentation on dynamic item classes.
...
Fixes #398
2015-01-19 17:21:56 -02:00
Mikhail Korobov
283d6a5344
DOC a couple more references are fixed
2015-01-19 22:07:03 +05:00
Mikhail Korobov
73e6b35622
DOC fix a reference
2015-01-19 22:02:46 +05:00
Artur Gaspar
b0730a1d16
documentation for CSS support in link extractors
2014-12-11 18:22:08 -02:00
Stefan
3602fc4fcb
fixed the variable types in mailsender documentation
2014-12-10 22:48:09 +01:00
Lev Berman
e04b0aff74
An attempt to resolve #977 , add signal to be sent when request is dropped by the scheduler
2014-11-27 15:10:15 +03:00
tpeng
a69f042d10
add 2 more test cases and minor doc fixes
2014-11-19 15:31:07 +01:00
tpeng
fa84730e70
avoid download large response
...
introduce DOWNLOAD_MAXSIZE and DOWNLOAD_WARNSIZE in settings and
download_maxsize/download_warnsize in spider/request meta, so
downloader stop downloading as soon as the received data exceed the
limit. also check the twsisted response's length in advance to stop
downloading as early as possible.
2014-11-12 12:28:02 +01:00
Jeff Paine
b422312a38
Update docs copyright year range
2014-11-09 21:08:27 -05:00
Lazar-T
13f83f0da0
typo
2014-11-10 06:28:41 +05:00
HalfCrazy
b21a28cc9a
Afterwords->Afterwards
2014-11-10 06:28:09 +05:00
Daniel Graña
2c67bd6c57
pywin32 is required by Twisted. closes #937
...
see:
* http://twistedmatrix.com/trac/ticket/6032
* https://tahoe-lafs.org/trac/tahoe-lafs/ticket/2028
2014-11-05 23:05:54 -02:00
Daniel Graña
6cb8995731
Update install.rst
...
fixes #937
2014-11-05 22:50:52 -02:00
Pablo Hoffman
efe589c643
Merge pull request #882 from ahlen/feature/csvfeed-quotechar
...
[MRG+1] Allow to specify the quotechar in CSVFeedSpider
2014-11-04 11:32:59 -02:00
Lazar-T
38dcf50cd6
comma instead of fullstop
2014-10-25 09:19:50 +06:00
Pablo Hoffman
675fd5ba04
Merge pull request #898 from scrapy/download-timeout
...
[MRG] DOC document download_timeout
2014-10-24 16:52:42 -02:00
Pablo Hoffman
0dce283459
Merge pull request #893 from kmike/less-ads
...
[MRG] DOC simplify extension docs
2014-10-21 17:13:59 -02:00
Mikhail Korobov
7d68b084a4
DOC document download_timeout Request.meta key and download_timeout spider attribute.
2014-10-07 04:23:11 +06:00