I\'m doing an analysis of scientific publications from around 30,000 journals. My list has over 1.3 million records but with several duplicates (ex: a paper with more than one a