Remove accented characters form string - Python

后端 未结 2 619
小蘑菇
小蘑菇 2021-01-28 10:09

I get some data from a webpage and read it like this in python

origional_doc = urllib2.urlopen(url).read()

Sometimes this url has characters su

2条回答
  •  广开言路
    2021-01-28 10:41

    This should work. It will eliminate all characters that are not ascii.

        original_doc = (original_doc.decode('unicode_escape').encode('ascii','ignore'))
    

提交回复
热议问题