Python: TypeError: argument of type 'NoneType' is not iterable Spider Script issue

后端未结

关注

 2  1844

借酒劲吻你 2021-01-27 05:24

I am working on building a link checking script to be used in monitoring a domain I manage. I am getting an error about the 9th url is ran through the findLinks() function. I

2条回答

有刺的猬 (楼主)

2021-01-27 05:58
You cannot iterate over (use the in keyword to check the contents of) None, which is the default returned from get() when it fails to find the provided name, so using an empty list as the default (second argument) will prevent the error:
```
for link in soup.find_all('a'):
    # all absolute paths hrefs and add to array     
    if "google.com" in link.get('href', []):
        linksToCrawl.append(link.get('href'))
```
You still may wish to confirm that you need link.get('href') to return something truthy before getting this far into the function.
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...