Are there any standards at all in the way that the addresses are recorded? For example:
- Are there always commas or new-lines separating street1 from street2 from city from state from zip?
- Are address types (road, street, boulevard, etc) always spelled out? always abbreviated? Some of each?
- Define "error".
My general answer is a series of Regular Expressions, though the complexity of this depends on the answer. And if there is no consistency at all, then you may only be able to achieve partial success with a Regex (ie: filtering out zip code and state) and will have to do the rest by hand (or at least go through the rest very carefully to make sure you spot the errors).