Css selector issue with rvest and NHL statistics

后端未结

关注

 1  1820

礼貌的吻别

I want to scrape data from hockey-reference.com, specifically from this link:

https://www.hockey-reference.com/leagues/NHL_1991.html

I want the 4th table, called

相关标签:

1条回答

一整个雨季

2021-01-28 14:51

You can try using RSelenium. Saw a similar answer here: Web Scraping Basketball Reference using R.

library(rvest)
library(RSelenium)
startServer() 
remDr<-remoteDriver(browserName = "chrome")
remDr$open()

remDr$navigate("https://www.hockey-reference.com/leagues/NHL_1991.html")
page <- read_html(remDr$getPageSource()[[1]])
table <- html_table(page, fill = TRUE)
table[[28]]

It's a pain to install selenium though and I would try to help with that too but I installed it a while ago so don't really remember. Good luck

From the guy who posted the original question:

The above answer worked, but I had to go through Homebrew:

https://brew.sh/

And then I had to use the following code from here:

Using Selenium on Mac Chrome

# download selenium jar
curl -L0 https://selenium-release.storage.googleapis.com/3.9/selenium- 
server-standalone-3.9.1.jar -o selenium-server-standalone.jar

# install chromedriver
brew install chromedriver

# start chrome driver
brew services start chromedriver                                                                                                                                                                      
#==> Successfully started `chromedriver` 
(label:homebrew.mxcl.chromedriver)

# start selenium server
java -jar selenium-server-standalone.jar                                                                                                                                                                           
#14:38:20.684 INFO - Selenium build info: version: '3.9.1', revision: 
'63f7b50'
#14:38:20.685 INFO - Launching a standalone Selenium Server on port 
4444

0 讨论(0)