1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-02-23 23:03:42 +00:00
scrapy/docs/index.rst

261 lines
5.8 KiB
ReStructuredText
Raw Normal View History

.. _topics-index:
2009-12-12 16:51:59 -02:00
==============================
Scrapy |version| documentation
==============================
This documentation contains everything you need to know about Scrapy.
Getting help
============
Having trouble? We'd like to help!
* Try the :doc:`FAQ <faq>` -- it's got answers to some common questions.
* Looking for specific information? Try the :ref:`genindex` or :ref:`modindex`.
* Search for information in the `archives of the scrapy-users mailing list`_, or
`post a question`_.
2009-08-21 21:54:10 -03:00
* Ask a question in the `#scrapy IRC channel`_.
* Report bugs with Scrapy in our `issue tracker`_.
.. _archives of the scrapy-users mailing list: https://groups.google.com/forum/#!forum/scrapy-users
.. _post a question: https://groups.google.com/forum/#!forum/scrapy-users
.. _#scrapy IRC channel: irc://irc.freenode.net/scrapy
.. _issue tracker: https://github.com/scrapy/scrapy/issues
First steps
===========
.. toctree::
:hidden:
intro/overview
intro/install
intro/tutorial
intro/examples
:doc:`intro/overview`
Understand what Scrapy is and how it can help you.
:doc:`intro/install`
Get Scrapy installed on your computer.
:doc:`intro/tutorial`
2009-08-21 21:54:10 -03:00
Write your first Scrapy project.
:doc:`intro/examples`
Learn more by playing with a pre-made Scrapy project.
.. _section-basics:
Basic concepts
==============
.. toctree::
:hidden:
topics/commands
topics/spiders
topics/selectors
topics/items
topics/loaders
topics/shell
topics/item-pipeline
topics/feed-exports
topics/request-response
topics/link-extractors
topics/settings
topics/exceptions
:doc:`topics/commands`
Learn about the command-line tool used to manage your Scrapy project.
:doc:`topics/spiders`
Write the rules to crawl your websites.
:doc:`topics/selectors`
Extract the data from web pages using XPath.
:doc:`topics/shell`
Test your extraction code in an interactive environment.
:doc:`topics/items`
Define the data you want to scrape.
:doc:`topics/loaders`
Populate your items with the extracted data.
:doc:`topics/item-pipeline`
Post-process and store your scraped data.
:doc:`topics/feed-exports`
Output your scraped data using different formats and storages.
:doc:`topics/request-response`
Understand the classes used to represent HTTP requests and responses.
:doc:`topics/link-extractors`
Convenient classes to extract links to follow from pages.
:doc:`topics/settings`
Learn how to configure Scrapy and see all :ref:`available settings <topics-settings-ref>`.
:doc:`topics/exceptions`
See all available exceptions and their meaning.
Built-in services
=================
.. toctree::
:hidden:
topics/logging
topics/stats
topics/email
topics/telnetconsole
topics/webservice
:doc:`topics/logging`
Understand the simple logging facility provided by Scrapy.
:doc:`topics/stats`
Collect statistics about your scraping crawler.
:doc:`topics/email`
Send email notifications when certain events occur.
:doc:`topics/telnetconsole`
Inspect a running crawler using a built-in Python console.
:doc:`topics/webservice`
Monitor and control a crawler using a web service.
Solving specific problems
=========================
.. toctree::
:hidden:
faq
2012-06-21 20:03:33 +02:00
topics/debug
topics/contracts
topics/practices
topics/broad-crawls
topics/firefox
topics/firebug
topics/leaks
2015-04-15 14:26:08 -03:00
topics/media-pipeline
topics/ubuntu
2015-04-09 17:39:01 -03:00
topics/deploy
topics/autothrottle
topics/benchmarking
topics/jobs
:doc:`faq`
Get answers to most frequently asked questions.
2012-06-21 20:03:33 +02:00
:doc:`topics/debug`
Learn how to debug common problems of your scrapy spider.
:doc:`topics/contracts`
Learn how to use contracts for testing your spiders.
:doc:`topics/practices`
Get familiar with some Scrapy common practices.
:doc:`topics/broad-crawls`
Tune Scrapy for crawling a lot domains in parallel.
:doc:`topics/firefox`
Learn how to scrape with Firefox and some useful add-ons.
:doc:`topics/firebug`
Learn how to scrape efficiently using Firebug.
:doc:`topics/leaks`
Learn how to find and get rid of memory leaks in your crawler.
2015-04-15 14:26:08 -03:00
:doc:`topics/media-pipeline`
Download files and/or images associated with your scraped items.
:doc:`topics/ubuntu`
Install latest Scrapy packages easily on Ubuntu
2015-04-09 17:39:01 -03:00
:doc:`topics/deploy`
Deploying your Scrapy spiders and run them in a remote server.
:doc:`topics/autothrottle`
Adjust crawl rate dynamically based on load.
:doc:`topics/benchmarking`
Check how Scrapy performs on your hardware.
:doc:`topics/jobs`
Learn how to pause and resume crawls for large spiders.
.. _extending-scrapy:
Extending Scrapy
================
.. toctree::
:hidden:
topics/architecture
topics/downloader-middleware
topics/spider-middleware
topics/extensions
topics/api
topics/signals
topics/exporters
:doc:`topics/architecture`
Understand the Scrapy architecture.
:doc:`topics/downloader-middleware`
Customize how pages get requested and downloaded.
:doc:`topics/spider-middleware`
Customize the input and output of your spiders.
:doc:`topics/extensions`
Extend Scrapy with your custom functionality
:doc:`topics/api`
Use it on extensions and middlewares to extend Scrapy functionality
:doc:`topics/signals`
See all available signals and how to work with them.
:doc:`topics/exporters`
Quickly export your scraped items to a file (XML, CSV, etc).
All the rest
============
.. toctree::
:hidden:
news
contributing
versioning
experimental/index
:doc:`news`
See what has changed in recent Scrapy versions.
:doc:`contributing`
Learn how to contribute to the Scrapy project.
:doc:`versioning`
2009-08-21 21:54:10 -03:00
Understand Scrapy versioning and API stability.
:doc:`experimental/index`
2009-08-21 21:54:10 -03:00
Learn about bleeding-edge features.