Allow blank nodes to be reused across expressions #34

RubenVerborgh · 2019-08-29T15:43:26Z

Currently, a blank node resulting from one expression, cannot be reused in another. Following the example of https://github.com/solid/query-ldflex/issues/33, the following snippets return different results:

const alice = "https://drive.verborgh.org/public/2019/blanks.ttl#Alice";
for await (const name of solid.data[alice].friends.name)
  console.log(`${name}`)

const alice = "https://drive.verborgh.org/public/2019/blanks.ttl#Alice";
for await (const friend of solid.data[alice].friends)
  console.log(`${await friend.name}`)

This happens because Alice's friends are identified by blank nodes, and they lose context across multiple expressions. If we retry both snippets with Alice2, which has IRI friends, we get the same results.

We could strive to reuse blank nodes across expressions, by internally skolemizing them. Here is a sketch of how that could work:

When outputting blank nodes, Comunica assigns an internal identifier to them. For instance, _:b1 is still output as a BlankNode, but has a special internal field .skolemized that contains urn:skolem:1234.
When a SPARQL query is generated from such a skolemized blank node, the skolemized IRI is used instead of a blank node.
When returning results, any skolemized NamedNode is turned into a skolemized `BlankNode.

The key is inserting skolemization and deskolemization processing in the right place, for which I need to ask @rubensworks for help.

We could simply skolemize upon parsing, and then deskolemize right before results are returned. This works in all cases, except when Comunica directly operates on a store (the contents of which it did not parse, so it can contain actual blank nodes).

And alternative approach is a skolemizing store wrapper. It takes a store as an argument, and translates on the fly in its match etc. methods.

Perhaps both approaches can be used in conjunction: skolemization in parsers for all cases, except when a store is passed, then we wrap it.

The text was updated successfully, but these errors were encountered:

rubensworks · 2019-08-29T15:51:58Z

Skolemization does indeed sound the right way to go here.

This issue in Comunica would be a requirement for this: comunica/comunica#355
And (AFAICS) that's probably everything that needs to be done, as all request go via the federated actor.

justinwb · 2020-03-09T18:53:41Z

Checking to see if this issue has been slated for work in the near term? We're running into some use cases that are blocked by this.

RubenVerborgh · 2020-03-09T19:02:52Z

@justinwb Will schedule it in @rubensworks' agenda.

RubenVerborgh · 2020-03-09T19:06:30Z

Or maybe also @joachimvh, they can decide who is most appropriate.

rubensworks · 2020-03-13T14:33:48Z

I've started working on the solution described in comunica/comunica#355.

This will mean that Comunica will not output any blank nodes anymore originating from sources (unless enforced by SPARQL via BNODE()). (The solution you describe would solve the problem here, but not the problem described in comunica/comunica#355)

As far as I know, this shouldn't be a problem for users downstream, and SPARQL spec-compliant.
@RubenVerborgh Do you agree?

RubenVerborgh · 2020-03-13T22:51:59Z

This will mean that Comunica will not output any blank nodes anymore originating from sources

That by itself seems too strong? Should at least be a switch (off by default to have correct query semantics)?

SPARQL spec-compliant

Do we have evidence? Perhaps something @Dexagod might want to dive into?

rubensworks · 2020-03-16T08:13:18Z

Hmm, I just realized there is a isBlank function. If we'd skolemize everything, this would definitely change the semantics of the query, which is not what we want.
So skolemizing everything may indeed be too radical. The internal .skolemized field may be a better solution.

When federating over multiple sources, the outcoming blank nodes coming from each source will receive a distinct blank node, so that they can not be joined across different sources, even if they have the same blank node label. A reverse translation also takes place for incoming queries with blank nodes, so that these blank nodes will only match if they come from that source. Blank nodes coming from sources will receive a .skolemized field containing a named node. This named node can be queried again as an IRI, and this will be interpreted by Comunica as a blank node corresponding to the proper source, assuming that the array of sources remains the same. Closes #355 Required for LDflex/Query-Solid#34

justinwb · 2020-03-27T18:48:11Z

Saw a commit go in at comunica/comunica#624 - is the expectation that this will provide the full resolution or is there now additional work to do in ldflex to take advantage of the changes?

RubenVerborgh · 2020-03-29T13:20:41Z

@justinwb That should be it, mostly. We'd still need to add LDflex support for the .skolemized field for some cases, and test.

rubensworks · 2020-03-30T10:10:41Z

Comunica 1.11.0 has now been released with this new feature, so implementing .skolemized support into LDflex should be possible now.

justinwb · 2020-04-06T18:08:36Z

Comunica 1.11.0 has now been released with this new feature, so implementing .skolemized support into LDflex should be possible now.

Awesome thanks for the update! Is anyone slotted to add .skolemized to ldflex at this point or is it still on backlog?

RubenVerborgh · 2020-04-06T18:52:10Z

Still on backlog currently; will discuss with the team.

RubenVerborgh · 2020-04-25T22:21:44Z

@justinwb You can follow progress in #64
System test with the above example was added; presently hitting a skolemized issue.

RubenVerborgh added the enhancement New feature or request label Aug 29, 2019

RubenVerborgh changed the title ~~Allow blank nodes do be reused across expressions~~ Allow blank nodes to be reused across expressions Aug 29, 2019

RubenVerborgh mentioned this issue Aug 29, 2019

Cannot display properties of some iterable resources #33

Closed

RubenVerborgh mentioned this issue Jan 3, 2020

Fetch an RDF list and return it as a Javascript array #53

Closed

RubenVerborgh assigned rubensworks Mar 9, 2020

RubenVerborgh assigned joachimvh Mar 9, 2020

rubensworks mentioned this issue Mar 16, 2020

Scope blank nodes to each federated source comunica/comunica#624

Merged

RubenVerborgh assigned RubenVerborgh and unassigned rubensworks and joachimvh Apr 7, 2020

RubenVerborgh mentioned this issue Apr 25, 2020

Make blank nodes reusable across expressions #64

Closed

RubenVerborgh closed this as completed in fb07664 Jun 1, 2020

rubensworks mentioned this issue Nov 23, 2020

Queries involving blank nodes return Bad Request error. LDflex/LDflex-Comunica#22

Open

rubensworks mentioned this issue Mar 29, 2021

Fix blank node correlation (closes #795) comunica/comunica#803

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow blank nodes to be reused across expressions #34

Allow blank nodes to be reused across expressions #34

RubenVerborgh commented Aug 29, 2019 •

edited

Loading

rubensworks commented Aug 29, 2019

justinwb commented Mar 9, 2020

RubenVerborgh commented Mar 9, 2020

RubenVerborgh commented Mar 9, 2020

rubensworks commented Mar 13, 2020

RubenVerborgh commented Mar 13, 2020

rubensworks commented Mar 16, 2020

justinwb commented Mar 27, 2020

RubenVerborgh commented Mar 29, 2020

rubensworks commented Mar 30, 2020

justinwb commented Apr 6, 2020 •

edited

Loading

RubenVerborgh commented Apr 6, 2020

RubenVerborgh commented Apr 25, 2020

Allow blank nodes to be reused across expressions #34

Allow blank nodes to be reused across expressions #34

Comments

RubenVerborgh commented Aug 29, 2019 • edited Loading

rubensworks commented Aug 29, 2019

justinwb commented Mar 9, 2020

RubenVerborgh commented Mar 9, 2020

RubenVerborgh commented Mar 9, 2020

rubensworks commented Mar 13, 2020

RubenVerborgh commented Mar 13, 2020

rubensworks commented Mar 16, 2020

justinwb commented Mar 27, 2020

RubenVerborgh commented Mar 29, 2020

rubensworks commented Mar 30, 2020

justinwb commented Apr 6, 2020 • edited Loading

RubenVerborgh commented Apr 6, 2020

RubenVerborgh commented Apr 25, 2020

RubenVerborgh commented Aug 29, 2019 •

edited

Loading

justinwb commented Apr 6, 2020 •

edited

Loading