1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-02-24 04:44:26 +00:00

3034 Commits

Author SHA1 Message Date
Daniel Graña
8376d95ce8 add travis-ci build configuration file 2012-05-16 16:54:12 -03:00
Pablo Hoffman
b4f368c37e warn if Link objects are instantiated with unicode urls 2012-05-16 13:12:25 -03:00
Pablo Hoffman
30b6c77ce5 fixed typo in previous commit 2012-05-16 09:16:46 -03:00
Pablo Hoffman
b53bc66c76 added lxml/libxml2 versions to 'scrapy version' output 2012-05-16 09:12:02 -03:00
Daniel Graña
d74a067227 require w3lib 1.2 or greater 2012-05-15 17:11:51 -03:00
Daniel Graña
ae2ff4d33a update news file with 0.14.4 release notes 2012-05-15 16:16:02 -03:00
Pablo Hoffman
9686f97242 added precise to supported ubuntu distros 2012-05-12 19:54:36 -03:00
Pablo Hoffman
58e88ed246 scrapyd: do not set SCRAPY_FEED_URI/SCRAPY_LOG_FILE if items_dir/logs_dir settings are not set 2012-05-08 17:43:00 -03:00
Daniel Graña
43732e5042 Merge pull request #129 from saxicek/master
ImagesPipeline should not fail if Item['images'] is not defined.
2012-05-07 10:38:49 -07:00
Daniel Graña
69078368ab Merge pull request #130 from alexcepoi/lxml-fixes
fix `tail` issue when extracting nodes [lxml]
2012-05-07 08:37:36 -07:00
Alexandru Cepoi
045ff8e5a7 fix tail issue when extracting nodes [lxml] 2012-05-07 17:23:27 +02:00
Libor Nenadl
2b93b0a93c Do not try to set item['images'] if it is not defined in the Item. 2012-05-05 14:53:02 +02:00
Daniel Graña
43028876b5 Merge pull request #128 from dangra/cannonicalize-missing-url-path
handle missing paths in urls as /
2012-05-03 10:27:11 -07:00
Daniel Graña
72b1c2e88b handle missing paths in urls as / 2012-05-03 13:56:45 -03:00
Pablo Hoffman
9c3b9f2968 fixed bug in json-rpc webservice reported in https://groups.google.com/d/topic/scrapy-users/qgVBmFybNAQ/discussion. also removed no longer supported 'run' command from extras/scrapy-ws.py 2012-05-03 12:05:40 -03:00
Pablo Hoffman
abcac4fcbd updated maintainer to scrapinghub 2012-05-02 03:25:35 -03:00
Daniel Graña
2681be59f7 Merge pull request #127 from stav/master
scrapy.contrib.spiders.Rule documentation indentation
2012-04-30 12:16:45 -07:00
stav
86dba76d1f documentation indentation 2012-04-30 13:09:34 -05:00
Pablo Hoffman
f5b87dbef8 added NEWS file pointing to docs/news.rst 2012-04-28 23:32:51 -03:00
Pablo Hoffman
78185921c2 renamed and improved README to provide a more helpful github landing page 2012-04-28 23:30:00 -03:00
Daniel Graña
9d66d7cdf9 be consistent removing BOM from decoded bodies. #123 2012-04-27 00:31:03 -03:00
Daniel Graña
8cae228df1 do not treat input type "image" as form input. #111 2012-04-25 16:12:48 -03:00
Daniel Graña
5b9a7814a5 support TextResponse in open_in_browser util 2012-04-25 16:04:37 -03:00
Pablo Hoffman
7865fbf05a replace "import Image" by more standard "from PIL import Image". closes #88 2012-04-20 19:05:55 -03:00
Daniel Graña
66de3d1757 return trial status as bin/runtests.sh exit value. #118 2012-04-20 16:40:55 -03:00
Daniel Graña
db99ae3958 test more border cases for formrequest 2012-04-20 16:39:50 -03:00
Daniel Graña
28401fd47d Merge dev.scrapinghub.com:~/src/scrapy 2012-04-20 09:32:43 -03:00
Daniel Graña
29d6bcf0d1 Workaround bug in lxml for multiple select options
In lxml version pre 2.3.1 there is a bug that returns
all options elements instead of those selected for select tag

it is mentioned in 2.3.1 release notes
http://lxml.de/2.3/changes-2.3.1.html

and fixed by
57f49eed82 (L1R1139)
2012-04-20 12:24:30 +00:00
Pablo Hoffman
c7f33c534b removed redundant lines from release notes 2012-04-20 09:09:55 -03:00
Daniel Graña
15c8c01828 add 0.14.3 release notes 2012-04-19 23:25:15 -03:00
Daniel Graña
f34dd11c16 forgot to include pydispatch license. #118 2012-04-19 22:49:47 -03:00
Daniel Graña
771abd57c1 include egg files used by testsuite in source distribution. #118 2012-04-19 22:38:45 -03:00
Daniel Graña
8b45a00f36 Merge dangra/lxml-formrequest 2012-04-19 17:10:29 -03:00
Daniel Graña
72485128cb Add a test case to cover input elements as not direct child of form element. #111 #121 2012-04-19 16:58:01 -03:00
Daniel Graña
3b2458dbbb cleanup all FormRequest test cases. #111 #121 2012-04-19 16:48:39 -03:00
Daniel Graña
4340a13db5 More lxml FormRequest fixes. #111 #121
* test textarea elements
* handle odd cases for select elements like chrome and FF browsers does
* Remove test case already covered by per tag test cases
2012-04-19 16:48:31 -03:00
Daniel Graña
84d5f5ea53 test and fix each form input type with border cases. #111 #121 2012-04-19 15:40:07 -03:00
Daniel Graña
63e4355fba find form input elements using one xpath #111 #121 2012-04-19 14:11:13 -03:00
Daniel Graña
5c03c1df2d Merge pull request #121 from artem-dev/master
fix FormRequest for case when a form values are missed
2012-04-19 08:18:02 -07:00
Pablo Hoffman
97f362d64e removed deprecated/undocumented class: HTMLImageLinkExtractor 2012-04-19 12:07:46 -03:00
Artem Bogomyagkov
3a7c28f155 fixed FormRequest for a form missed values case 2012-04-19 18:07:38 +03:00
Pablo Hoffman
13ac6f63eb Merge pull request #119 from andrix/fix-htmlimagelinkextractor
Fix HTMLImageLinkExtractor to work with libxml2 and lxml selectors
2012-04-19 07:56:38 -07:00
Pablo Hoffman
1e521a3ff3 improved command line tool usage help which explains that more commands are available when run from project directory. refs #107 2012-04-19 02:54:57 -03:00
Pablo Hoffman
2fb5e62c39 doc: update overview page to point to the genspider command. refs #107 2012-04-19 02:37:22 -03:00
Pablo Hoffman
1c5294bee1 update docstring in project template to avoid confusion with genspider command, which may be considered as an advanced feature. refs #107 2012-04-19 02:35:48 -03:00
Pablo Hoffman
d567d8efbe added note to docs/topics/firebug.rst about google directory being shut down 2012-04-19 01:34:20 -03:00
Daniel Graña
21e03729a3 lxml is the new default selector backend. closes #120 2012-04-19 00:28:27 -03:00
Daniel Graña
6bb40fe5a8 use xpath to match img tags in one shot. #119 2012-04-19 00:03:04 -03:00
Andrés Moreira
e24107feb8 fix HTMLImageLinkExtractor to work with libxml2 and lxml selectors 2012-04-18 18:05:44 -03:00
Pablo Hoffman
30ddbf624e mention about some scrapy.xlib modules removed in the release notes 2012-04-17 12:31:18 -03:00