class OnlineSpider(scrapy.Spider): name = "online" allowed_domains = ["www.onlinedown.net"] start_urls = ['http://www.onlinedown.net/new/android/','http://www.onlinedown.net/new/ios/','http://www.onlinedown.net/new/windows/']# model = Model('onlinedown') def start_request(self): for url in self.start_urls: for x in range(1, 100): detail_url = url + str(x) + '.html' print detail_url yield scrapy.Request(detail_url, callback = self.parse)
There are 35 items per page, and the result is 105 items. Why is this.
How to write the parse function
Where is your parse method?