1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-02-25 18:24:30 +00:00
scrapy/docs/topics/scheduler-middleware.rst
2009-08-29 03:46:58 -03:00

35 lines
919 B
ReStructuredText

.. _topics-scheduler-middleware:
====================
Scheduler middleware
====================
.. _topics-scheduler-middleware-ref:
Built-in scheduler middleware reference
========================================
This page describes all scheduler middleware components that come with
Scrapy.
For a list of the components enabled by default (and their orders) see the
:setting:`SCHEDULER_MIDDLEWARES_BASE` setting.
DuplicatesFilterMiddleware
--------------------------
.. module:: scrapy.contrib.schedulermiddleware.duplicatesfilter
:synopsis: Duplicates Filter Scheduler Middleware
.. class:: DuplicatesFilterMiddleware
Filter out already visited urls.
The :class:`DuplicatesFilterMiddleware` can be configured through the following
settings (see the settings documentation for more info):
* :setting:`DUPEFILTER_CLASS` - The class used to detect and filter
duplicate requests.