I want to find the percentage of similarity between two large texts These texts are very long and there are often more than 3,000 words ins