描述
要执行蜘蛛,请在 first_scrapy 目录中运行以下命令 :
scrapy crawl first
其中, first 是创建蜘蛛时指定的蜘蛛的名称.
蜘蛛爬行后,您可以看到以下输出 :
2016-08-09 18:13:07-0400 [scrapy] INFO: Scrapy started (bot: tutorial)2016-08-09 18:13:07-0400 [scrapy] INFO: Optional features available: ...2016-08-09 18:13:07-0400 [scrapy] INFO: Overridden settings: {}2016-08-09 18:13:07-0400 [scrapy] INFO: Enabled extensions: ...2016-08-09 18:13:07-0400 [scrapy] INFO: Enabled downloader middlewares: ...2016-08-09 18:13:07-0400 [scrapy] INFO: Enabled spider middlewares: ...2016-08-09 18:13:07-0400 [scrapy] INFO: Enabled item pipelines: ...2016-08-09 18:13:07-0400 [scrapy] INFO: Spider opened2016-08-09 18:13:08-0400 [scrapy] DEBUG: Crawled (200)(referer: None)2016-08-09 18:13:09-0400 [scrapy] DEBUG: Crawled (200) (referer: None)2016-08-09 18:13:09-0400 [scrapy] INFO: Closing spider (finished)
As您可以在输出中看到,对于每个URL,都有一个日志行,其中(引用者:无)表明URL是起始URL并且没有引用者.接下来,您应该会在 first_scrapy 目录中看到两个名为 Books.html 和 Resources.html 的新文件.