Scrapy shell is a command-line interface that helps to debug scripts without running the entire crawler. We have to provide a URL, and Scrapy shell will open up an interface to interact with objects that the spider handles in its callbacks, such as a response object.
Scrapy shell
How to do it...
We can go through some simple usage of Scrapy's interactive shell. The steps are as follows:
- Open up a Terminal window and type the following command:
$ Scrapy shell http://books.toscrape.com/
After loading the Scrapy shell, it will open up an interface to interact with the response object as follows:
data:image/s3,"s3://crabby-images/c1f69/c1f6978e37b0fa346d31899125e357a6a8947234" alt=""
- We can use this interface to debug the selectors for the response object:
>>> response.xpath('//ol/li/article...