How to save Scrapy crawl Command output

后端未结

关注

 7  1791

I am trying to save the output of the scrapy crawl command I have tried scrapy crawl someSpider -o some.json -t json >> some.text But it doesn\'t worked ...c

相关标签:

7条回答

挽巷

2021-02-01 10:38

You need to redirect stderr too. You are redirecting only stdout. You can redirect it somehow like this:

scrapy crawl someSpider -o some.json -t json 2> some.text

The key is number 2, which "selects" stderr as source for redirection.

If you would like to redirect both stderr and stdout into one file, you can use:

scrapy crawl someSpider -o some.json -t json &> some.text

For more about output redirection: http://tldp.org/HOWTO/Bash-Prog-Intro-HOWTO-3.html

0 讨论(0)
发布评论:

提交评论
- 加载中...
眼角桃花

2021-02-01 10:42
For all scrapy commands you can add --logfile NAME_OF_FILE to log to a file e.g.
```
scrapy crawl someSpider -o some.json --logfile some.text
```
There are two other useful command line options for logging:
- -L or --loglevel to control the logging level e.g. -L INFO (the default is DEBUG)
- --nolog to disable logging completely
These commands are documented here.
0 讨论(0)
发布评论:

提交评论
- 加载中...
孤街浪徒

2021-02-01 10:43
you can use nohup:
```
nohup scrapy crawl someSpider &
```
The log will be stored in nohup.out
0 讨论(0)
发布评论:

提交评论
- 加载中...
花落未央

2021-02-01 10:50
You can add these lines to your settings.py:
```
LOG_STDOUT = True
LOG_FILE = '/tmp/scrapy_output.txt'
```
And then start your crawl normally:
```
scrapy crawl someSpider
```
0 讨论(0)
发布评论:

提交评论
- 加载中...
野趣味

2021-02-01 10:54
You can save as log file:
```
scrapy crawl someSpider -s LOG_FILE=fileName.log -L <loglevel>
```
loglevel can be one between CRITICAL, ERROR, WARNING, INFO and DEBUG or --nolog for no log. for more information read Doc.
0 讨论(0)
发布评论:

提交评论
- 加载中...
孤街浪徒

2021-02-01 10:55
```
scrapy crawl someSpider --logfile some.text
```
This will do exactly what you are looking for, save the output of the command that you see on the screen to a text file.
0 讨论(0)
发布评论:

提交评论
- 加载中...

1 2 下一页