Identifier as context #1505

ghost · 2021-12-14T22:38:58Z

Fixes # A few issues

Proposed Changes

A rebase of the rebased PR #958 “Move Store API to work with identifiers, not graphs”

This is a Draft PR because the rebased #958 is basically just the start, a number of issues remain unresolved, even with the changes contained in these commits.

This change is the latest in series of changes initiated in 2012 with #167 and reignited in 2013 by #301 (and friends). The subtle differences between the way Dataset and ConjunctiveGraph models work has been discussed at length in various issue threads, in an attempt to get an overview, I collated all the early discussions in a gist

If there are any misperceptions about the relative complexity of the exercise, gromgull summarised the, ahem, context in #307 including the assertion:

“For SPARQL, it is often interesting to on-the-fly define a new DataSet, i.e. selecting some subset of graphs from another DataSet, or even from several datasets. It would be nice to be able to do this without copying all the triples.”

but was moved to observe four years later in #698:

“A combination of https://github.com/RDFLib/rdflib/blob/master/rdflib/graph.py#L1698 and https://github.com/RDFLib/rdflib/blob/master/rdflib/graph.py#L2028 means you can pass an existing graph into the dataset. This will then be added directly. But there is no guarantee this new graph has the same store, and the triples will not be copied over. This is chaos, but wasn't flagged earlier since it's chaos all the way down :)”

The issues list contains around a couple of dozen issues related to the whole "identifier as context" business. I'm gradually working my way through them, discovering whether they remain extant or whether this PR fixes them.

My modus operandi is to create a test to check (in test_dataset_anomaly_wip.py) and when I've characterised the issue, migrate it to test_dataset_anomalies.py with annotations. I've also created a separate test which implements all the current Dataset docstrings test_identifier_as_context.py

Example: Issue #939, “rdflib 4.2.2: ConjunctiveGraph.parse() return a Graph object” is fixed by the changes in this PR, as attested by test_issue939_parse_return_inconsistent_type

A note to reviewers - in order to get the tests to pass without extensive skipping, I've been obliged to make a couple of changes in addition to the rebased PR and for those I have provided a rationale, prefixed with DEVNOTE. I had to add three failing named-graph-related DAWG tests to the skippedtests list - SPARQL update CLEAR remains an issue atm.

ghost · 2021-12-15T13:16:10Z

The basics look quite good - serialization of default graph and contexts looks sane and as expected.
Attached is results from wip, methodically iterating the serializers over the dataset object and then its contexts.

anomalies-wip-results.txt

# Conflicts: # rdflib/graph.py

ghost · 2022-01-02T14:44:09Z

Got too scrappy to be useful in this state, close and reopen when more coherent.

Graham Higgins added 5 commits December 14, 2021 18:16

rebased PR, tests adjusted, pro tem, pending resolution of anomalies.

b1e9a61

Merge remote-tracking branch 'origin/master' into identifier-as-context

e52064e

skip SPARQLUpdateStore on Windows.

27a1381

skip SPARQLUpdateStore on Windows.

fe315cb

skip SPARQLUpdateStore test on Windows

838f9c7

gjhiggins and others added 10 commits December 20, 2021 00:41

Merge branch 'RDFLib:master' into identifier-as-context

4ba4288

comment out logging pro tem avoid obscuring test results

61453d2

stage point, non-skipped tests passing.

5de9dce

adjust non-skipped test.

147b159

restore pytest skipif win32 - no sparql endpoint

411a8d4

restore (the other) pytest skipif win32 - no sparql endpoint

f9232cf

Merge remote-tracking branch 'origin/master' into identifier-as-context

af08731

# Conflicts: # rdflib/graph.py

merge with upstream master

7471a85

remove superfluous inclusion

0b96c18

no fuseki, skip test if windows

a98618f

ghost mentioned this pull request Dec 26, 2021

RDFLib doesn't accept an empty public ID #738

Closed

Graham Higgins and others added 8 commits December 26, 2021 17:51

remove unrelated changes

5528362

remove test of unrelated change

00e5729

Merge branch 'RDFLib:master' into identifier-as-context

4dafaf6

make minimal

d1c01c0

Merge remote-tracking branch 'origin/master' into identifier-as-context

5f7e2ee

# Conflicts: # rdflib/graph.py

remove temporary warnings import

5c2cbd7

w-i-p w.r.t master 02-02-2022

eee359c

w-i-p w.r.t master 02-02-2022

56fdb8b

ghost closed this Jan 2, 2022

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Identifier as context #1505

Identifier as context #1505

ghost commented Dec 14, 2021 •

edited by ghost

Loading

ghost commented Dec 15, 2021

ghost commented Jan 2, 2022

Identifier as context #1505

Identifier as context #1505

Conversation

ghost commented Dec 14, 2021 • edited by ghost Loading

Proposed Changes

ghost commented Dec 15, 2021

ghost commented Jan 2, 2022

ghost commented Dec 14, 2021 •

edited by ghost

Loading