问题
I want to convert the data on any html website to rdf. do you know any html to rdf converter? or, do you recommend for this purpose the use of XQuery or GRDDL?
回答1:
In general, an automatic conversion is not possible, since HTML says what something looks like, and not what it means. If the HTML contains existing markup, you could use Anything2Triples (http://developers.any23.org/) to get RDF out.
If it's just HTML you have to write your own extraction rules somehow. GRDDL would work, but I would probably simply use python + BeautifulSoup. It depends what technology/language you know already!
回答2:
If the HTML contains embedded RDFa the you can use an RDFa parser to extract the information. There are parsers available for various platforms and languages so depends on your development environment.
来源:https://stackoverflow.com/questions/4876179/converting-websites-in-html-xml-into-rdf