Can't compare file.readline() line with a string [duplicate]

后端未结

关注

 3  390

鱼传尺愫

相关标签:

3条回答

甜味超标

2020-12-22 06:24
I think this is because its reading new line character in the string try:
```
for line in f:
    line = line.rstrip()
    if (line == '<TEXT>'):
```
0 讨论(0)
发布评论:

提交评论
- 加载中...
不思量自难忘°

2020-12-22 06:35

Make sure 'line' object does not have spaces at the beginning and at the end. You can strip it and then compare.

Because,

x='<TEXT>' is not equal to y='<TEXT> '

Use line = line.strip() and then compare.

0 讨论(0)
发布评论:

提交评论
- 加载中...

日久生厌

2020-12-22 06:46

Instead of parsing the html by yourself, take a look at this built-in python html parser (or this for python 2).

It will probably be easier and more robust than any code you will write by your own.

The example from the python documentation:

from html.parser import HTMLParser

class MyHTMLParser(HTMLParser):
    def handle_starttag(self, tag, attrs):
        print("Encountered a start tag:", tag)

    def handle_endtag(self, tag):
        print("Encountered an end tag :", tag)

    def handle_data(self, data):
        print("Encountered some data  :", data)

parser = MyHTMLParser()
parser.feed('<html><head><title>Test</title></head>'
        '<body><h1>Parse me!</h1></body></html>')

To use this example just add a member to the class which keeps track of the content you have.

0 讨论(0)

热议问题