how to tell when a HTTP web page has changed when it is of type html/text?
问题 I'm trying to work out the algorithm to tell if non-binary files on the web have changed or not. I was going to go with: LastModified datetime from header, and then if these aren't present fallback to ContentLength from header I'm finding however that for alot of websites the LastModified for the HTML pages are actually just using the current DateTime, hence the approach doesn't work (i.e. would lead to an indication that the page is always changing) I think...? What would be a good algorithm