Difference between BeautifulSoup and Scrapy crawler?

前端 未结 8 1292
隐瞒了意图╮
隐瞒了意图╮ 2020-12-12 08:52

I want to make a website that shows the comparison between amazon and e-bay product price. Which of these will work better and why? I am somewhat familiar with Beaut

相关标签:
8条回答
  • 2020-12-12 09:32

    BeautifulSoup is a library that lets you extract information from a web page.

    Scrapy on the other hand is a framework, which does the above thing and many more things you probably need in your scraping project like pipelines for saving data.

    You can check this blog to get started with Scrapy https://www.inkoop.io/blog/web-scraping-using-python-and-scrapy/

    0 讨论(0)
  • 2020-12-12 09:34

    I think both are good... im doing a project right now that use both. First i scrap all the pages using scrapy and save that on a mongodb collection using their pipelines, also downloading the images that exists on the page. After that i use BeautifulSoup4 to make a pos-processing where i must change attributes values and get some special tags.

    If you don't know which pages products you want, a good tool will be scrapy since you can use their crawlers to run all amazon/ebay website looking for the products without making a explicit for loop.

    Take a look at the scrapy documentation, it's very simple to use.

    0 讨论(0)
提交回复
热议问题