Grabbing text from a webpage

前端未结

关注

 8  428

南旧

I would like to write a program that will find bus stop times and update my personal webpage accordingly.

If I were to do this manually I would

Visit

相关标签:

8条回答

无人及你

2021-01-03 10:16

Since you write in C, you may want to check out cURL; in particular, take a look at libcurl. It's great.

0 讨论(0)
发布评论:

提交评论
- 加载中...
攒了一身酷

2021-01-03 10:20

You can use the mechanize library that is available for Python http://wwwsearch.sourceforge.net/mechanize/

0 讨论(0)
发布评论:

提交评论
- 加载中...
生来不讨喜

2021-01-03 10:20

This is called Web scraping, and it even has its own Wikipedia article where you can find more information.

Also, you might find more details in this SO discussion.

0 讨论(0)
发布评论:

提交评论
- 加载中...
情歌与酒

2021-01-03 10:22

Beautiful Soup is a Python library designed for parsing web pages. Between it and urllib2 (urllib.request in Python 3) you should be able to figure out what you need.

0 讨论(0)
发布评论:

提交评论
- 加载中...
Happy的楠姐

2021-01-03 10:37

What you're asking about is called "web scraping." I'm sure if you google around you'll find some stuff, but the core notion is that you want to open a connection to the website, slurp in the HTML, parse it and identify the chunks you want.

The Python Wiki has a good lot of stuff on this.

0 讨论(0)
发布评论:

提交评论
- 加载中...
温柔的废话

2021-01-03 10:37
You can use Perl to help you complete your task.
```
use strict;
use LWP;

my $browser = LWP::UserAgent->new;

my $responce = $browser->get("http://google.com");
print $responce->content;
```
Your responce object can tell you if it suceeded as well as returning the content of the page.You can also use this same library to post to a page.

Here is some documentation. http://metacpan.org/pod/LWP::UserAgent
0 讨论(0)
发布评论:

提交评论
- 加载中...

1 2 下一页