RDF list subjects with their objects in a single line

别说谁变了你拦得住时间么 提交于 2019-11-28 08:57:17

问题


I have an RDF file and I need to extract some information from it and write it to a file. I understood how it basically works, but I'm stuck with this:

String queryString = "select ?person ?children where { ?person ?hasChildren ?children}";
TupleQuery tupleQuery = conn.prepareTupleQuery(QueryLanguage.SPARQL, queryString);
TupleQueryResult result = tupleQuery.evaluate();     
while (result.hasNext()) {
    BindingSet bindingSet = result.next();
    Value p1 = bindingSet.getValue("person");
    Value p2 = bindingSet.getValue("child");
    println(p1 + " has children " + p2 +"");
}
result.close();

The output I get is like this:

http://example.org/people/person1 has children http://example.org/people/child1
http://example.org/people/person1 has children http://example.org/people/child2

I don't see how to list all the persons with their objects in this format:

person1 has children child1 and child2

How can this be done?


回答1:


You may find this answer, which describes SPARQL's group_concat, useful:

  • Aggregating results from SPARQL query

In SPARQL, when you have a result set of query solutions, you can group on one or more of the variables, merging solutions that have these variables in common. For instance, consider the data

@prefix : <http://example.org/people/>.

:person1 :hasChild :child1, :child2, :child3 .
:person2 :hasChild :child4, :child5 .
:person3 :hasChild :child6 .

If you run the following query on it

prefix : <http://example.org/people/>

select ?person ?child where { 
  ?person :hasChild ?child .
}

you get results like this:

$ arq --data data.n3 --query query.sparql
----------------------
| person   | child   |
======================
| :person3 | :child6 |
| :person2 | :child5 |
| :person2 | :child4 |
| :person1 | :child3 |
| :person1 | :child2 |
| :person1 | :child1 |
----------------------

Iterating through the results as you have in your question would produce the type of output that you're currently getting. What we'd like to do is to actually get results like:

$ arq --data data.n3 --query query.sparql
----------------------------------------
| person   | child                     |
========================================
| :person3 | :child6                   |
| :person2 | :child4, :child5          |
| :person1 | :child1, :child2, :child3 |
----------------------------------------

and that's exactly what group_by lets us do. A query like this:

prefix : <http://example.org/people/>

select ?person (group_concat(?child;separator=' and ') as ?children) where { 
  ?person :hasChild ?child .
}
group by ?person

produces (notice that the variable in the result is ?children, not ?child, because we've used group_concat(...) as ?children to create the new variable ?children):

$ arq --data data.n3 --query query.sparql
---------------------------------------------------------------------------------------------------------------------------
| person   | children                                                                                                     |
===========================================================================================================================
| :person3 | "http://example.org/people/child6"                                                                           |
| :person1 | "http://example.org/people/child3 and http://example.org/people/child2 and http://example.org/people/child1" |
| :person2 | "http://example.org/people/child5 and http://example.org/people/child4"                                      |
---------------------------------------------------------------------------------------------------------------------------

If you use a query like this and iterate through the results, printing them as you have, you'll get output like you want. If you do want to strip the leading http://example.org/people/ off from the persons and children, you'll need a bit more string processing. For instance, using STRAFTER to remove the http://example.org/people/ prefix, you can use a query like this:

prefix : <http://example.org/people/>

select
 (strafter(str(?personX),"http://example.org/people/") as ?person)
 (group_concat(strafter(str(?child),"http://example.org/people/");separator=' and ') as ?children)
where { 
  ?personX :hasChild ?child .
}
group by ?personX

to get results like:

$ arq --data data.n3 --query query.sparql
----------------------------------------------
| person    | children                       |
==============================================
| "person3" | "child6"                       |
| "person2" | "child5 and child4"            |
| "person1" | "child3 and child2 and child1" |
----------------------------------------------

which, when you do your printing, will give you results like

person3 has children child6
person2 has children child5 and child4
person1 has children child3 and child2 and child1


来源:https://stackoverflow.com/questions/18485461/rdf-list-subjects-with-their-objects-in-a-single-line

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!