so i got a clustering project, which i\'ve decided to do a term-document clustering on Hamshahri newspaper (a persian dataset) and in the end of the project i want to do a b