1
0
mirror of https://github.com/scrapy/scrapy.git synced 2025-02-24 07:03:52 +00:00

Do not commit on any HTML parsing speed assumption with BS4+lxml

This commit is contained in:
Paul Tremberth 2016-06-14 19:33:56 +02:00
parent 6cbd92fac1
commit 1ff9a4828b

View File

@ -35,8 +35,7 @@ for parsing HTML responses in Scrapy callbacks.
You just have to feed the response's body into a ``BeautifulSoup`` object
and extract whatever data you need from it.
Here's an example spider using BeautifulSoup API, with ``lxml`` as the HTML parser
(so you get the same parsing speed as with scrapy/parsel selectors)::
Here's an example spider using BeautifulSoup API, with ``lxml`` as the HTML parser::
from bs4 import BeautifulSoup