Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The dumps pipeline produces a file with may triples that ROBOT can not parse. #23

Open
dosumis opened this issue Jan 19, 2022 · 5 comments
Assignees

Comments

@dosumis
Copy link
Member

dosumis commented Jan 19, 2022

robot merge -i /out/raw/all.ttl \
	reason --reasoner ELK --axiom-generators "SubClass EquivalentClass ClassAssertion" --exclude-tautologies structural \
	relax \
	reduce --reasoner ELK --named-classes-only true \
	annotate --ontology-iri "http://virtualflybrain.org/data/VFB/OWL/raw/all.owl" \
	convert -f owl -o /out/raw/construct_all.owl | { grep -v 'OWLRDFConsumer\|InvalidReferenceViolation\|RDFParserRegistry' || true; }

=>

ERROR Input ontology contains 545373 triple(s) that could not be parsed:
 - <http://virtualflybrain.org/reports/VFBc_00101gc1> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> _:genid-nodeid-genid-d772490648ac4472932985646f1ab9c7-node1fp2rmh14x1797609.

Possible cause:
image
http://robot.obolibrary.org/errors

Could the pipeline be somehow producing axioms following rdf reification?

Investigating an example:

Here's one of the unparsed triples reported:

<http://virtualflybrain.org/reports/VFB_00029522> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> _:genid-nodeid-genid-4841781bfabd4a7b98ac303327e83a13-node1fpl1cvm3x508939 .

I expected this to be something to do with blank nodes used for reification used in axiom annotation, however, looking at that blank node in the triplestore it looks like a simple, unannotated type axiom:

image

Looking at PDB, this type restriction appears to be present

image

Any idea what's going on? Half a million unparsed triples is at least a bad smell, even if we're not sure of what the consequences might be.

@dosumis
Copy link
Member Author

dosumis commented Jan 19, 2022

I note that the text on the Robot doc says "this is often because", suggesting there are other possible causes.

@matentzn
Copy link
Contributor

If @hkir-dev can grep an example from the dump which is tiny and causes the ROBOT warning, we can easily find the reason.. A very typical problem I have encountered is complex "source" or "target" in reification, which must be atomic. But one look at a failing minimal example and we can tell..

@hkir-dev
Copy link
Contributor

hkir-dev commented Jan 19, 2022

Minimal example attached.
ROBOT version 1.8.3
Command: robot -vvv reason -i minimal.ttl reason --reasoner ELK -o minimal_reason.owl
Result is success, but we still see the error log
log:

ERROR Input ontology contains 3 triple(s) that could not be parsed:
 - <http://virtualflybrain.org/reports/VFB_00000001> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> _:genid-nodeid-genid-674b1cc8101e47ceb845067892d7fd8e-node1fpb4lttix277296.
 - <http://virtualflybrain.org/reports/VFB_00000001> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> _:genid-nodeid-genid-674b1cc8101e47ceb845067892d7fd8e-node1fpb4lttix277295.
 - <http://virtualflybrain.org/reports/VFB_00000001> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> _:genid-nodeid-genid-674b1cc8101e47ceb845067892d7fd8e-node1fpb4lttix277297.

minimal.zip
minimal2.zip

@matentzn
Copy link
Contributor

The output seems to be noise, see ontodev/robot#965

The triples parse and convert just fine.

@matentzn matentzn reopened this Jan 19, 2022
@matentzn
Copy link
Contributor

Sorry didnt mean to close it - reflex. Up to you :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants