I have a log file (about 50K rows) in the format:
email1@gmail.com:address0:some_details0 email2@gmail.com:address1:some_details1 email1@yahoo.com:address2:so