How to download a text file or some objects from webpage using Python?

后端未结

关注

 2  1632

I am writing a function that downloads and stores the today\'s list of pre-release domains .txt file from http://www.namejet.com/pages/downloads.aspx. I am trying to achieve it

相关标签:

2条回答

星月不相逢

2021-01-28 11:32
I see a few problems with your approach:
1. The page doesn't return any json; so even if you were to access the page successfully, r.json will be empty:
```
>>> import requests
>>> r = requests.get('http://www.namejet.com/pages/downloads.aspx')
>>> r.json
```
2. The file that you are after, is hidden behind a postback link; which you cannot "execute" using requests as it will not understand javascript.
In light of the above, the better approach is to use mechanize or alternatives to emulate a browser. You could also ask the company to provide you with a direct link.
0 讨论(0)
发布评论:

提交评论
- 加载中...

天命终不由人

2021-01-28 11:37

I dont't think mechanize is much use for javascript, use selenium. Here's an example:

In [1]: from selenium import webdriver
In [2]: browser=webdriver.Chrome() # Select browser that you want to automate 
In [3]: browser.get('http://www.namejet.com/pages/downloads.aspx')
In [4]: element=browser.find_element_by_xpath(
            '//a[@id="ctl00_ContentPlaceHolder1_hlPreRelease1"]')

In [5]: element.click()

Now you can find prerelease_10-08-2012.txt in your download folder and you can open it in a usual way.

0 讨论(0)