Doesn’t RSS have a GUID? Save the latest GUID and make a judgment when crawling again. Whether or not RSS has been updated is the business of other people’s server programs and you can’t control it either
lz, please give me this program code! The final topic is this. I would like to ask the poster for help. I have zero basic knowledge and how to complete this project quickly. Crab
Theoretically, RSS should return a last-modified or etag (atom) in the http header, which can be judged by this
In python’s feedparser, you can use it like this
If there is no update, you will not get anything the second time
Doesn’t RSS have a GUID? Save the latest GUID and make a judgment when crawling again. Whether or not RSS has been updated is the business of other people’s server programs and you can’t control it either
lz, please give me this program code! The final topic is this. I would like to ask the poster for help. I have zero basic knowledge and how to complete this project quickly. Crab