Find word in HTML page fast algorithm
问题 I need to do a boolean function which returns true if a word is in the text of a HTML page and false if it's not. I know that it's easy to do analysing all the page tree until finding the word with the lxml library but I find inefficient to iterate through all the html blocks and find if the word is there. Any suggestions for a faster algorithm (I need to do this search so many times)? 回答1: As long as you're not worried about accidentally finding the word in an element attribute or something