rcrawler

R data scraping / crawling with dynamic/multiple URLs

拟墨画扇 提交于 2019-12-24 20:12:20
问题 I try to get all decrees of the Federal Supreme Court of Switzerland available at: https://www.bger.ch/ext/eurospider/live/de/php/aza/http/index.php?lang=de&type=simple_query&query_words=&lang=de&top_subcollection_aza=all&from_date=&to_date=&x=12&y=12 Unfortunately, no API is provided. The CSS selectors of the data I want to retrieve is .para I am aware of http://relevancy.bger.ch/robots.txt. User-agent: * Disallow: /javascript Disallow: /css Disallow: /hashtables Disallow: /stylesheets