1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-02-11 23:11:32 +00:00

7871 Commits

Author SHA1 Message Date
Marc Hernández
91bbc70bc1
fix E30X flake8 (#4355) 2020-02-21 06:05:31 +01:00
Mikhail Korobov
c4ee4b6075
Merge pull request #4347 from noviluni/deprecate_sel_shortcut
Remove deprecated `sel` shortcut in scrapy shell
2020-02-20 02:56:34 +05:00
Adrián Chaves
0f78a591f8
Fix Flake8-reported “Too many blank lines” 2020-02-19 19:09:39 +01:00
Adrián Chaves
6972a19707
Remove unused imports 2020-02-19 18:59:09 +01:00
Andrey Rahmatullin
88179027de
Merge pull request #4331 from Gallaecio/response-cb-kwargs
Implement Response.cb_kwargs
2020-02-19 22:40:14 +05:00
Marc Hernandez Cabot
eb21dae524 deprecare sel shortcut in scrapy shell 2020-02-19 17:49:42 +01:00
Andrey Rahmatullin
f558df2558
Merge pull request #4188 from elacuesta/logformatter-error-formatting
LogFormatter error formatting
2020-02-19 19:05:08 +05:00
Andrey Rahmatullin
528b894f28
Merge pull request #4321 from Gallaecio/link-extractor-encoding
Use safe_url_string in link extraction
2020-02-19 18:19:21 +05:00
Akshay Sharma
182445f9d9
Fix a spelling error: ie. → i.e. (#4338) 2020-02-18 17:58:31 +01:00
Mikhail Korobov
320cea62ff
Merge pull request #4309 from Gallaecio/virtualenv-doc
Update installation instructions regarding Python 3 and virtual environments
2020-02-18 19:56:35 +05:00
Adrián Chaves
a04dd13cd0
ie. → i.e. 2020-02-14 22:31:30 +01:00
Adrián Chaves
5ae3e1678f
ie. → i.e.
Co-Authored-By: elacuesta <elacuesta@users.noreply.github.com>
2020-02-14 22:30:36 +01:00
Adrián Chaves
43b43654a1 Add tests for meta and cb_kwargs not being available 2020-02-13 22:39:58 +01:00
Adrián Chaves
5ff9eb90ea Add a test for the copy of cb_kwargs from Request to Response 2020-02-13 22:36:18 +01:00
Adrián Chaves
df937d8280 Implement Response.cb_kwargs 2020-02-13 22:33:36 +01:00
Adrián Chaves
b4958358e8 Update tests to account for link extractors escaping spaces 2020-02-12 19:00:04 +01:00
Drew Seibert
2d6d4fb233
Deprecate overriding settings with SCRAPY-prefixed environment variables (#4300) 2020-02-11 10:35:23 +01:00
Mikhail Korobov
a6ef065eb5
Merge pull request #4271 from wRAR/asyncio-signals
async def support for signal handlers that already supported Deferreds
2020-02-11 02:05:45 +05:00
Adrián Chaves
61e74bac76 Extract links with safe_url_string
canonicalize_url changes links in undesirable ways.
2020-02-10 21:57:21 +01:00
Andrey Rakhmatullin
1f0f52cbf7 Improve async signal tests. 2020-02-11 01:05:45 +05:00
Abhishek Pratap Singh
4626e90df8
Allow updating flags in follow and follow_all (#4279) 2020-02-10 19:48:31 +01:00
Adrián Chaves
35723d76c0 Use canonicalize_url in link extraction 2020-02-07 22:59:53 +01:00
Adrián Chaves
59653ebac6 Update installation instructions regarding Python 3 and virtual environments 2020-02-07 21:07:57 +01:00
Mikhail Korobov
b0eaf114e5
Merge pull request #4197 from elacuesta/sphinx-twisted-api
[Docs] Fix Twisted links
2020-02-07 23:51:15 +05:00
Mikhail Korobov
bd7780277c
Merge pull request #4275 from abhishekh2001/master
Fixed artwork/README formatting
2020-02-07 23:44:15 +05:00
Mikhail Korobov
7e341e0f6b
Merge pull request #4291 from seregaxvm/master
add zsh -h autocomplete option
2020-02-07 23:42:10 +05:00
Mikhail Korobov
c3b690a5b5
Merge pull request #4290 from dekimsey/patch-1
FilesPipeline.file_path has optional arguments
2020-02-07 23:41:31 +05:00
Mikhail Korobov
957681bcfa
Merge pull request #4272 from elacuesta/spider-middleware
Spider middleware: catch spider callback exceptions early
2020-02-07 23:40:50 +05:00
Mikhail Korobov
afbaf9d430
Merge pull request #4303 from whalebot-helmsman/request_left_downloader_signal
request_left_downloader signal
2020-02-07 23:33:51 +05:00
Mikhail Korobov
0f62e44def
Merge pull request #4316 from wRAR/asyncio-parse-request-tests
Add a test for an async callbacks that returns requests.
2020-02-07 23:22:19 +05:00
Andrey Rakhmatullin
31f6c7112f Add a test for an async callbacks that returns requests. 2020-02-07 17:14:52 +05:00
Andrey Rakhmatullin
4a7c7340a0 Merge remote-tracking branch 'origin/master' into asyncio-signals 2020-02-07 16:58:59 +05:00
Vostretsov Nikita
153b78e53f
Update docs/topics/signals.rst
Co-Authored-By: elacuesta <elacuesta@users.noreply.github.com>
2020-02-07 11:08:55 +05:00
Vostretsov Nikita
8817b9e8e9
Update docs/topics/signals.rst
Co-Authored-By: Adrián Chaves <adrian@chaves.io>
2020-02-07 11:07:53 +05:00
Vostretsov Nikita
2f83f3e2cb
Update docs/topics/signals.rst
Co-Authored-By: Adrián Chaves <adrian@chaves.io>
2020-02-07 11:07:43 +05:00
Vostretsov Nikita
84b55b7364
Update docs/topics/signals.rst
Co-Authored-By: Adrián Chaves <adrian@chaves.io>
2020-02-07 11:07:35 +05:00
Joy Bhalla
4f31c3ce01
Document a backward incompatibility that may affect custom schedulers (#4274) 2020-02-06 22:21:33 +01:00
Lane Shaw
3263441fbc
Update RFPDupeFilter line separator for correct universal newlines mode usage (#4283) 2020-02-06 22:14:40 +01:00
Mikhail Korobov
042e71e2b8
Merge pull request #4311 from Gallaecio/metarefresh-ignore-tags
Make METAREFRESH_IGNORE_TAGS an empty list by default
2020-02-06 23:40:45 +05:00
Mikhail Korobov
8d2705f23c
Merge pull request #4305 from Respawnz/patch-1
fix a typo in devloper-tools.rst
2020-02-06 23:17:28 +05:00
elacuesta
35dafef7f1
Specify Twisted reactor (TWISTED_REACTOR setting) (#4294)
* Add the ability to install a specific reactor

* Add docs for the TWISTED_REACTOR setting

* Add tests for the TWISTED_REACTOR setting

* Update asyncio reactor test

* Ignore W503 globally

W503 is not PEP8-compliant:
c59c4376ad

* Line length adjustment

* Adjust asyncio reactor tests

* Merge ASYNCIO_ENABLED and TWISTED_REACTOR settings

* More docs about TWISTED_REACTOR

* Fix asyncio reactor test

* Docs: fix title

* Reword docs

* Check the TWISTED_REACTOR setting outside of the installing function

* Remove unrelated change

* Update scrapy/utils/log.py

Co-Authored-By: Adrián Chaves <adrian@chaves.io>

* Update docs/topics/settings.rst

Co-Authored-By: Adrián Chaves <adrian@chaves.io>

* Update docs/topics/settings.rst

Co-Authored-By: Adrián Chaves <adrian@chaves.io>

Co-authored-by: Adrián Chaves <adrian@chaves.io>
2020-02-06 22:42:34 +05:00
Andrey Rakhmatullin
489ffcda51 Add a test for an async item_scraped handler. 2020-02-06 22:40:11 +05:00
Vostretsov Nikita
4be19e443e name signla catcher in accord with signal name 2020-02-06 13:46:23 +00:00
Vostretsov Nikita
4bcc0933d9 Merge branch 'request_left_downloader_signal' of github.com:whalebot-helmsman/scrapy into request_left_downloader_signal 2020-02-06 13:45:00 +00:00
Vostretsov Nikita
4a91a5427d fix typo 2020-02-06 13:44:51 +00:00
Vostretsov Nikita
6733f4d976
Update docs/topics/signals.rst
Co-Authored-By: elacuesta <elacuesta@users.noreply.github.com>
2020-02-06 18:40:42 +05:00
Mikhail Korobov
bbbb8f1418
Merge pull request #4304 from elacuesta/remove-six-from-tox-ini
Remove elusive six occurrence from tox.ini
2020-02-06 17:09:57 +05:00
Adrián Chaves
576663e5a7 Make METAREFRESH_IGNORE_TAGS an empty list by default 2020-02-06 10:43:20 +01:00
Respawnz
c2cca36821
typo 2020-02-06 05:39:15 +08:00
Eugenio Lacuesta
11941c3244
Remove elusive six occurrence from tox.ini 2020-02-05 13:27:54 -03:00