I currently think that the other party may read the html through ajax-like technology, then modify it through regular rules and then output it, so is it possible to use a non-browser interception method to retrieve the data?
You can perform statistical analysis on the entrance IP of your website to see which IP they use to extract the content of your website, and then ban this IP.
I immediately thought of nginx redirection
Maybe some domain name providers can also do this
You can consider doing some tricks on the page, such as header detection and source checking
Give me an example
3003 is the original version
80 is redirected by nginx
You can look at the header of the response to see if there is any difference. It’s easy to catch clues
I currently think that the other party may read the html through ajax-like technology, then modify it through regular rules and then output it, so is it possible to use a non-browser interception method to retrieve the data?
You can perform statistical analysis on the entrance IP of your website to see which IP they use to extract the content of your website, and then ban this IP.