Daniel Graña
|
277ed0ae23
|
Merge pull request #145 from alexcepoi/cookies-changes
domain and path support for request cookies
|
2012-06-25 11:29:04 -07:00 |
|
Pablo Hoffman
|
8b30575dd3
|
Merge pull request #147 from tonal/cfg-jsons
Configure json services members
|
2012-06-25 11:27:13 -07:00 |
|
Alexandru Cepoi
|
177c81745d
|
domain and path support for request cookies
|
2012-06-25 20:17:59 +02:00 |
|
Alexandr N Zamaraev (aka tonal)
|
cf968c328f
|
Configure json services members
|
2012-06-25 12:50:48 +07:00 |
|
Pablo Hoffman
|
179e3810dc
|
fixed links to doc. closes #150
|
2012-06-24 01:00:33 -03:00 |
|
Pablo Hoffman
|
700f20b28f
|
Merge pull request #149 from alexcepoi/parse-changes
documentation for `parse` command, debugging spiders section
|
2012-06-22 17:50:40 -07:00 |
|
Alexandru Cepoi
|
2e05cf5685
|
fix small bug with parse command
|
2012-06-21 20:06:50 +02:00 |
|
Alexandru Cepoi
|
f4faa19e31
|
added docs topic debugging spiders
|
2012-06-21 20:03:33 +02:00 |
|
Pablo Hoffman
|
4eeda53b0d
|
Merge pull request #148 from tonal/cfg-launcher
Add launcher class to config
|
2012-06-19 08:14:58 -07:00 |
|
Alexandr N Zamaraev (aka tonal)
|
3bddedfc6d
|
Add launcher class to config
|
2012-06-19 22:01:53 +07:00 |
|
Daniel Graña
|
27689009f1
|
fix urlparse monkeypatches for python 2.7.4. closes #144
|
2012-06-18 11:50:43 -03:00 |
|
Alexandru Cepoi
|
3e05a2ecf6
|
update docs for parse command
|
2012-06-12 18:28:10 +02:00 |
|
Pablo Hoffman
|
e1be9c01bc
|
updated FAQ about bot bans
|
2012-06-08 18:33:53 -03:00 |
|
Daniel Graña
|
3d6edc2f40
|
Merge pull request #135 from alexcepoi/parse-changes
Add --depth option to parse
|
2012-06-07 07:52:27 -07:00 |
|
Alexandru Cepoi
|
3b5cf31198
|
add --verbose option to parse command
|
2012-05-24 16:35:09 +02:00 |
|
Alexandru Cepoi
|
8e3c5f1bf7
|
add --depth field to parse command
|
2012-05-24 15:10:03 +02:00 |
|
Pablo Hoffman
|
8d77005047
|
scrapy shell: start shell in main thread and crawler in secondary thread, instead of the other way around. fixes #100
|
2012-05-22 19:15:54 -03:00 |
|
Pablo Hoffman
|
b33303779a
|
scrapyd.launcher: make SCRAPY_LOG_FILE and SCRAPY_FEED_URI optional
|
2012-05-21 14:29:15 -03:00 |
|
Daniel Graña
|
35ef7de546
|
add travis-ci build status to README
|
2012-05-17 09:07:36 -03:00 |
|
Daniel Graña
|
e77e4b5f6e
|
add precise to travis-ci build enviroments
|
2012-05-17 09:03:28 -03:00 |
|
Daniel Graña
|
3096e46401
|
add requirements file per travis env
|
2012-05-17 08:53:18 -03:00 |
|
Daniel Graña
|
7740581f88
|
Merge remote-tracking branch 'upstream/master'
|
2012-05-17 08:50:28 -03:00 |
|
Daniel Graña
|
c3a3108799
|
multiple build enviroments for travis-ci
|
2012-05-17 08:49:45 -03:00 |
|
Pablo Hoffman
|
1bc18434f5
|
removed obsolete entries from MANIFEST.in
|
2012-05-17 01:09:48 -03:00 |
|
Daniel Graña
|
7fc573a230
|
fix libxml2 test
|
2012-05-16 18:54:02 -03:00 |
|
Daniel Graña
|
f530b0b3eb
|
make libxml2 optional now that lxml is the default
|
2012-05-16 18:17:51 -03:00 |
|
Daniel Graña
|
8376d95ce8
|
add travis-ci build configuration file
|
2012-05-16 16:54:12 -03:00 |
|
Pablo Hoffman
|
b4f368c37e
|
warn if Link objects are instantiated with unicode urls
|
2012-05-16 13:12:25 -03:00 |
|
Pablo Hoffman
|
30b6c77ce5
|
fixed typo in previous commit
|
2012-05-16 09:16:46 -03:00 |
|
Pablo Hoffman
|
b53bc66c76
|
added lxml/libxml2 versions to 'scrapy version' output
|
2012-05-16 09:12:02 -03:00 |
|
Daniel Graña
|
d74a067227
|
require w3lib 1.2 or greater
|
2012-05-15 17:11:51 -03:00 |
|
Daniel Graña
|
ae2ff4d33a
|
update news file with 0.14.4 release notes
|
2012-05-15 16:16:02 -03:00 |
|
Pablo Hoffman
|
9686f97242
|
added precise to supported ubuntu distros
|
2012-05-12 19:54:36 -03:00 |
|
Pablo Hoffman
|
58e88ed246
|
scrapyd: do not set SCRAPY_FEED_URI/SCRAPY_LOG_FILE if items_dir/logs_dir settings are not set
|
2012-05-08 17:43:00 -03:00 |
|
Daniel Graña
|
43732e5042
|
Merge pull request #129 from saxicek/master
ImagesPipeline should not fail if Item['images'] is not defined.
|
2012-05-07 10:38:49 -07:00 |
|
Daniel Graña
|
69078368ab
|
Merge pull request #130 from alexcepoi/lxml-fixes
fix `tail` issue when extracting nodes [lxml]
|
2012-05-07 08:37:36 -07:00 |
|
Alexandru Cepoi
|
045ff8e5a7
|
fix tail issue when extracting nodes [lxml]
|
2012-05-07 17:23:27 +02:00 |
|
Libor Nenadl
|
2b93b0a93c
|
Do not try to set item['images'] if it is not defined in the Item.
|
2012-05-05 14:53:02 +02:00 |
|
Daniel Graña
|
43028876b5
|
Merge pull request #128 from dangra/cannonicalize-missing-url-path
handle missing paths in urls as /
|
2012-05-03 10:27:11 -07:00 |
|
Daniel Graña
|
72b1c2e88b
|
handle missing paths in urls as /
|
2012-05-03 13:56:45 -03:00 |
|
Pablo Hoffman
|
9c3b9f2968
|
fixed bug in json-rpc webservice reported in https://groups.google.com/d/topic/scrapy-users/qgVBmFybNAQ/discussion. also removed no longer supported 'run' command from extras/scrapy-ws.py
|
2012-05-03 12:05:40 -03:00 |
|
Pablo Hoffman
|
abcac4fcbd
|
updated maintainer to scrapinghub
|
2012-05-02 03:25:35 -03:00 |
|
Daniel Graña
|
2681be59f7
|
Merge pull request #127 from stav/master
scrapy.contrib.spiders.Rule documentation indentation
|
2012-04-30 12:16:45 -07:00 |
|
stav
|
86dba76d1f
|
documentation indentation
|
2012-04-30 13:09:34 -05:00 |
|
Pablo Hoffman
|
f5b87dbef8
|
added NEWS file pointing to docs/news.rst
|
2012-04-28 23:32:51 -03:00 |
|
Pablo Hoffman
|
78185921c2
|
renamed and improved README to provide a more helpful github landing page
|
2012-04-28 23:30:00 -03:00 |
|
Daniel Graña
|
9d66d7cdf9
|
be consistent removing BOM from decoded bodies. #123
|
2012-04-27 00:31:03 -03:00 |
|
Daniel Graña
|
8cae228df1
|
do not treat input type "image" as form input. #111
|
2012-04-25 16:12:48 -03:00 |
|
Daniel Graña
|
5b9a7814a5
|
support TextResponse in open_in_browser util
|
2012-04-25 16:04:37 -03:00 |
|
Pablo Hoffman
|
7865fbf05a
|
replace "import Image" by more standard "from PIL import Image". closes #88
|
2012-04-20 19:05:55 -03:00 |
|