1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-02-26 18:44:12 +00:00

3653 Commits

Author SHA1 Message Date
Daniel Graña
155ea08ea1 use sel name for Selector's instances in docs, internals and shell 2013-10-15 15:58:42 -02:00
Daniel Graña
1abb1af0c6 fix typos and wording on selector's introduction 2013-10-15 10:13:43 -02:00
Daniel Graña
28999590fa update release notes 2013-10-14 16:41:04 -02:00
Daniel Graña
ab9462a251 remove more references to libxml2 2013-10-14 16:37:14 -02:00
Daniel Graña
4645f9e03c Updates docs to reflect unified selectors api 2013-10-14 16:31:20 -02:00
Daniel Graña
add3506928 remove internal references to old selector classes and api 2013-10-14 10:51:16 -02:00
Daniel Graña
4e94b38396 port tests to new Selector class 2013-10-14 10:35:02 -02:00
Daniel Graña
e4d6e2eb31 default xpath selector was html 2013-10-11 19:53:32 -02:00
Daniel Graña
a9eb0b74c8 port scrapy shell to unified selector api 2013-10-11 19:53:01 -02:00
Daniel Graña
681af6b258 Remove CSS*Selector classes and port its tests 2013-10-11 18:22:03 -02:00
Daniel Graña
c3d28cc412 working implementaion of unified api 2013-10-11 18:06:27 -02:00
Daniel Graña
bf37f78572 Drop libxml2 selectors backend 2013-10-11 18:02:35 -02:00
Daniel Graña
6d598f0d94 Update selectors docs 2013-10-10 18:24:00 -02:00
Paul Tremberth
ea579bca04 Support only ::text and ::attr(name) + add more CSS selector tests 2013-10-10 18:23:15 -02:00
Paul Tremberth
52e2eb6b70 Adapt to latest cssselect API supporting pseudo-elements 2013-10-10 18:23:15 -02:00
Daniel Graña
4e6967b854 extend css selectors with ":text" and :attribute(<name>) #176 2013-10-10 18:23:15 -02:00
Daniel Graña
b38ac27eee rename XPathSelectorList as SelectorList #176 2013-10-10 18:23:15 -02:00
Capi Etheriel
bc17e9d412 Adds HtmlCSSSelector and XmlCSSSelector classes, cssselect as optional dependency.
Ported .get() from _Element and .text_content() from HTMLMixin

Add CSS selectors to scrapy shell

Documenting CSS Selectors: Constructing selectors

Documenting CSS Selectors: Using Selectors

Make CSS Selectors a default feature.

Adds XPath powers to CSS Selectors and some syntactic sugar.

Removes methods copied over from lxml.html.HtmlMixin.

Updating docs to use new CSS Selector super powers.

Documenting CSS Selectors: Regular Expressions

Moving section after Nesting section, since it mentions it.

Documenting CSS Selectors: Nesting Selectors

Fix XPath specificity in lxml.selector.CSSSelectorMixin.text

Cleaning up unused stuff from cssel.py

Changing the behavior of lxml.selector.CSSSelectorMixin.text.

Concatenating all of the descendant text nodes is more useful
than returning it in pieces (there's xpath() if you need that).

Documenting CSS Selectors: CSS Selector objects

Documenting CSS Selectors: CSSSelectorList objects

Documenting CSS Selectors: HtmlCSSSelector objects

Documenting CSS Selectors: XmlCSSSelector objects

Fixing some documentations typos and errors

Enforcing the 80-char width lines

Tidying up CSS selectors and CSSSelectorMixin objects

Adding some missing references in documentation.

Fixing lxml.selector.CSSSelectorList.text
2013-10-10 18:23:15 -02:00
Daniel Graña
8bf3284ebf Merge pull request #380 from dangra/dont-log-ignorerequest
Dont log IgnoreRequest exception as download failure
2013-10-10 11:53:08 -07:00
Pablo Hoffman
1a2db089ff Merge pull request #418 from nramirezuy/engine-multispider
engine multispider support removed
2013-10-10 10:31:13 -07:00
Daniel Graña
5b5dd679b0 Add 0.18.4 release notes
Conflicts:
	docs/news.rst
2013-10-10 01:04:35 -02:00
Daniel Graña
aa6fb7daaa IPython refuses to update the namespace. fix #396
IPython embedding code borrowed from https://github.com/mitsuhiko/werkzeug/pull/85
2013-10-10 00:57:14 -02:00
Daniel Graña
7b1288ba54 Fix AlreadyCalledError replacing a request in shell command. closes #407 2013-10-09 23:50:35 -02:00
Daniel Graña
5eb429999e Fix start_requests lazyness and early hangs
- Removes new public methods added by #330 to Crawler and CrawlerProcess
- Add test for start_requests lazy evaluation
- Fix and test hangs when start_requests erroed before returning the generator
- Add test when start_requests fails while generating requests
- Simplify Crawler and CrawlerProcess implementation taking in count
  that only one spider can be attached per Crawler. As required by SEP-019
- "scrapy settings" command do not require starting a Crawler anymore
2013-10-09 22:58:10 -02:00
Pablo Hoffman
2bd5002fb4 Merge pull request #416 from redapple/attrnamespaces
.remove_namespaces(): remove namespaces on elements' attributes
2013-10-09 15:34:08 -07:00
Pablo Hoffman
e1683ddf9b fix doc typo 2013-10-09 17:24:12 -02:00
nramirezuy
a84e5f806c engine multispider support removed 2013-10-09 15:02:12 -02:00
Pablo Hoffman
ce82d22c9e Merge pull request #423 from nramirezuy/defaultheaders-multispider
defaultheaders multi spider support removed
2013-10-09 09:03:33 -07:00
nramirezuy
332bf3b6b7 defaultheaders multi spider support removed 2013-10-09 13:50:15 -02:00
Pablo Hoffman
7ec01799d5 Merge pull request #419 from nramirezuy/robotstxt-multispider
robotstxt mid multi spider support removed
2013-10-09 07:43:11 -07:00
Pablo Hoffman
9ad736d7ce Merge pull request #420 from nramirezuy/httpauth-multispider
httpauth mid multispider support removed
2013-10-08 21:06:49 -07:00
Pablo Hoffman
27d3cbb64f Merge pull request #421 from nramirezuy/useragent-multispider
useragent mid multi spider support removed
2013-10-08 21:06:13 -07:00
Pablo Hoffman
37b01a229c Merge pull request #422 from nramirezuy/downloadtimeout-multispider
downloadtimeout mid multi spider support removed
2013-10-08 21:05:06 -07:00
nramirezuy
1d09b6e1f2 downloadtimeout mid multi spider support removed 2013-10-08 19:03:26 -02:00
nramirezuy
fbb0ad6419 useragent mid multi spider support removed 2013-10-08 18:18:49 -02:00
nramirezuy
f56f547738 httpauth mid multispider support removed 2013-10-08 18:00:25 -02:00
nramirezuy
10646f1768 robotstxt mid multi spider support removed 2013-10-08 16:38:08 -02:00
Pablo Hoffman
3501c2025e remove no longer existent examples from doc_files used in bdist_rpm. closes GH-417 2013-10-08 15:18:45 -02:00
Paul Tremberth
d806184bb6 .remove_namespaces(): remove namespaces on elements' attributes 2013-10-08 00:01:44 +02:00
Pablo Hoffman
8b9526a8f6 Merge pull request #400 from irgmedeiros/patch-2
Update the second code example
2013-10-07 07:57:18 -07:00
Pablo Hoffman
86c6e9433f remove minor reference to 'scrapy server' command 2013-10-04 14:37:55 -03:00
Daniel Graña
aad90ec5a2 Add 0.18.3 release notes
Conflicts:
	docs/news.rst
2013-10-03 12:56:25 -03:00
Daniel Graña
479b4942bd fix regression on lazy evaluation of start requests 2013-10-03 12:49:16 -03:00
Pablo Hoffman
662eac11fa added pypi version badge to README 2013-10-03 12:47:22 -03:00
Pablo Hoffman
ba7b75d299 Merge pull request #410 from duendex/httpsMockServer
Adds HTTPS support to the MockServer.
2013-10-02 14:05:44 -07:00
duendex
2519b3031e Adds HTTPS support to the MockServer. 2013-10-02 17:35:28 -03:00
Daniel Graña
0c63c9539d Merge pull request #409 from kalessin/imagesfield
allow to set source and destination fields in files and images pipelines, and added tests
2013-10-02 13:31:26 -07:00
Martin Olveyra
e81120e683 allow to set source and destination fields in files and images pipelines, and added tests 2013-10-02 18:23:13 -02:00
Pablo Hoffman
37c24e01d7 document bindaddress request meta 2013-10-02 17:13:17 -03:00
Pablo Hoffman
a9c3519897 updated required twisted version to 10.0 2013-10-01 14:07:38 -03:00