1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-02-26 11:44:11 +00:00
Pablo Hoffman 409aaade0b Refactored close spider behaviour so that the engine now waits for all
downloading (and enqueued for download) requests to finish and their responses
to be processed in the scraper/spiders, before closing the spider.

This will be required in the future to avoid loosing requests when we add
scheduler persistence and it's also a more correct behaviour overall.

The closing process has also been refactored to remove unneeded closing state
from downloader and leave it only in the engine.

Finally, some unused methods has been removed too, like spider_is_open() for
engine and scheduler.
2011-07-08 11:40:19 -03:00
2009-07-25 15:21:22 -03:00
2011-06-12 01:42:30 -03:00
2011-04-07 02:04:42 -03:00
2009-09-17 02:01:40 -03:00

This is Scrapy, an opensource screen scraping framework written in Python.

For more info visit the project home page at http://scrapy.org

Description
Scrapy, a fast high-level web crawling & scraping framework for Python.
Readme BSD-3-Clause 128 MiB
Languages
Python 99.8%
HTML 0.1%