How to parse inbound GAE email manually?

谁都会走 提交于 2019-12-11 20:45:30

问题


Since I am having problems with standard GAE's e-mail functionality when e-mail subject is UTF-8 encoded, I am trying to handle it manually:

msg_encoding = self.request.headers['Content-Type'].split('charset=')[1] # message/rfc822; charset=UTF-8
msg = email.message_from_string(self.request.body)
if msg:
    logging.debug(msg.get_content_charset()) # None
    logging.debug(msg['to'])
    logging.debug(msg['from'])
    logging.debug(msg['Subject'].decode(msg_encoding))

Do I do it correctly? Should I decode Subject? Or is it done by email parser automatically?


回答1:


The following simplified code (text/plain only, no attachments) works well for me:

class InboundMailHandler(InboundMailHandler):
    def post(self):
        msg = email.message_from_string(self.request.body)
        if msg:
            charset = 'utf-8'
            from_realname, from_emailaddr = email.utils.parseaddr(msg['from'])
            to_realname, to_emailaddr = email.utils.parseaddr(msg['to'])
            # parse e-mail body, look for text/plain part only
            body = ''
            if msg.is_multipart():
                for part in msg.walk():
                    if part.get_content_type() and part.get_content_type()=='text/plain': # ignore text/html
                        charset = part.get_content_charset()
                        body = part.get_payload(decode=True).decode(part.get_content_charset())
            else:
                body = msg.get_payload(decode=True)
                body = body.decode('utf-8')
            subject, encoding = email.header.decode_header(msg['subject'])[0]
            if encoding != None:
                subject = subject.decode(encoding)
            else:
                subject = subject.decode('utf-8')


来源:https://stackoverflow.com/questions/16043780/how-to-parse-inbound-gae-email-manually

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!