发表新帖

发表新帖

Using PDFMiner (Python) with online pdf files. Encode the url?

前端未结

关注

 1  949

I am wishing to extract the content of pdf files available online using PDFMiner.

My code is based on the one available in the documentation used to ext

相关标签:

1条回答

悲哀的现实

2021-01-14 09:21
Well, I finally found out a solution,

I resorted on Request and StringIO and got rid off the open('my_file', 'rd') command
```
from urllib2 import Request
from StringIO import StringIO

url = 'my_url'

open = urllib2.urlopen(Request(url)).read()
memoryFile = StringIO(open)

parser = PDFParser(memoryFile)
```
That way Python considers the url as a file (to say so).
0 讨论(0)
发布评论:

提交评论
- 加载中...

热议问题