A CLI for CARML
CARML jar is a CLI application for executing RML mappings with CARML.
This project produces two artifacts:
- CARML jar RDF4J - which outputs RDF using RDF4J
- CARML jar Jena - which outputs RDF using Apache Jena
Usage: map [-hVPv] [-F=<outputRdfFormat>] [-o=<outputPath>]
[-M=<prefixMappings>]... [-p=<prefixDeclarations>[,
<prefixDeclarations>...]]... [-b=<baseIri>] [-l=<limit>]
(-m=<mappingFiles> [-m=<mappingFiles>]...
[-f=<mappingFileRdfFormat>] [-r=<relativeSourceLocation>])
-h, --help Show this help message and exit.
-V, --version Print version information and exit.
-m, --mapping=<mappingFiles>
Mapping file path(s) and/or mapping file directory
path(s).
-f, --format=<mappingFileRdfFormat>
Mapping file RDF format:
ttl (text/turtle),
ttls (application/x-turtlestar),
nt (application/n-triples),
nq (application/n-quads),
rdf (application/rdf+xml),
jsonld (application/ld+json),
ndjsonld (application/x-ld+ndjson),
trig (application/trig),
trigs (application/x-trigstar),
n3 (text/n3),
trix (application/trix),
brf (application/x-binary-rdf),
rj (application/rdf+json).
-r, -rsl, --rel-src-loc=<relativeSourceLocation>
Path from which to relatively find the sources
specified in the mapping files.
-F, -of, --outformat=<outputRdfFormat>
Output RDF format. Default: nq.
Supported values are brf, jsonld, n3, ndjsonld,
nq, nt, rdf, rj, trig, trigs, ttl, ttls, xml
-o, --output=<outputPath> Output file path.
If path is directory, will default to fileName
`output`.
If left empty will output to console.
-M, -pm, --prefix-mapping=<prefixMappings>
File or directory path(s) containing prefix
mappings.
Files must be JSON or YAML files containing a map
of prefix declarations.
File names must have either .json or .yaml/.yml
file extensions.
-p, --prefixes=<prefixDeclarations>[,<prefixDeclarations>...]
Declares which prefixes to apply to the output.
Can be a prefix reference or an inline prefix
declaration.
A prefix reference will be resolved against the
provided prefix mapping (`-pm`), or the default
prefix mapping.
An inline prefix declaration can be provided as
'prefix=iri. For example: ex=http://example.com/'
Multiple declarations can be separated by ','. For
example: ex=http://example.com/,foo,bar
-P, --pretty Serialize pretty printed output. (Caution: will
cause in-memory output collection).
-b, --base-iri=<baseIri> Base IRI to use to expand relative IRIs in the
generated output.
If not specified the default `http://example.
com/base/` is used.
-l, --limit=<limit> Limit the number of statements generated by the
amount provided.
-v, --verbose Specify multiple -v or --verbose options to
increase verbosity.
For example `-v -v`, or `-vv` or `--verbose
--verbose`
For example, the following command:
java -jar carml-jar-X.jar map -m rml -rsl input -of ttl -P
- maps (
map
) all mapping files - under the 'rml' directory (
-m rml
) - using the relative source location 'input' (
-rsl input
) - with output format Turtle (
-of ttl
) - pretty printed (
-P
) - to stdout.
If an output path is provided (via -o
) the RDF result is output to the specified path.
By default, if no output path is provided the plain RDF result will be output to stdout
.
To get more information on what is going on during execution, the verbosity level can be increased via -v
.
You can get more verbosity by adding more v
s, for example -vvv
, which is the highest level of verbosity and will
give TRACE
level logging.
Errors logs are output to stderr
.
To generate output with prefixed IRIs, namespace prefixes can be specified via -p
.
The prefix definition can be provided inline by using the format prefix=iri
. For example:
-p ex=http://example.org/
Multiple prefix definitions can be separated by comma's.
The prefix can also reference a prefix defined in a prefix mapping document provided via -pm
. For example:
-p foo,bar -pm /path/to/prefix/mapping.json
A prefix mapping file is a JSON or a YAML file that provides a map of key values, where the key is the prefix and
the value is the name (the IRI). JSON files must have file name with a .json
file extension and YAML files **
must**
have a .yaml
or .yml
file extension.
Example contents of a JSON prefix mapping file:
{
"foo": "http://foo.org/",
"bar": "http://bar.org#"
}
Example contents of a YAML prefix mapping file:
foo: http://foo.org/
bar: http://bar.org#
If no prefix mapping document is provided via -pm
the default prefix.cc mappings will be used.
The following exit codes are returned on exit.
0
: success1
: "failure" when running the command2
: command line usage error
It is possible to map a data source provided via stdin
. In this case the
CARML input stream extension can be used.
stdin
is available as the default unnamed stream, and can be reference from a mapping as follows.
@prefix carml: <http://carml.taxonic.com/carml/> .
...
:SomeLogicalSource
rml:source [
a carml:Stream ;
];
...
.
This makes it possible to pipe input into a mapping process. For example:
cat some/input | java -jar carml-jar-X.jar map -m rml/mapping.ttl
The CARML jar RDF4J artifact supports the same output formats (-of
) that are supported for the mapping file format
(-f
).
See the map
help-description for details:
Usage: map [-hVPv] [-F=<outputRdfFormat>] [-o=<outputPath>]
[-M=<prefixMappings>]... [-p=<prefixDeclarations>[,
<prefixDeclarations>...]]... [-b=<baseIri>] [-l=<limit>]
(-m=<mappingFiles> [-m=<mappingFiles>]...
[-f=<mappingFileRdfFormat>] [-r=<relativeSourceLocation>])
-h, --help Show this help message and exit.
-V, --version Print version information and exit.
-m, --mapping=<mappingFiles>
Mapping file path(s) and/or mapping file directory
path(s).
-f, --format=<mappingFileRdfFormat>
Mapping file RDF format:
ttl (text/turtle),
ttls (application/x-turtlestar),
nt (application/n-triples),
nq (application/n-quads),
rdf (application/rdf+xml),
jsonld (application/ld+json),
ndjsonld (application/x-ld+ndjson),
trig (application/trig),
trigs (application/x-trigstar),
n3 (text/n3),
trix (application/trix),
brf (application/x-binary-rdf),
rj (application/rdf+json).
-r, -rsl, --rel-src-loc=<relativeSourceLocation>
Path from which to relatively find the sources
specified in the mapping files.
-F, -of, --outformat=<outputRdfFormat>
Output RDF format. Default: nq.
Supported values are brf, jsonld, n3, ndjsonld,
nq, nt, rdf, rj, trig, trigs, ttl, ttls, xml
-o, --output=<outputPath> Output file path.
If path is directory, will default to fileName
`output`.
If left empty will output to console.
-M, -pm, --prefix-mapping=<prefixMappings>
File or directory path(s) containing prefix
mappings.
Files must be JSON or YAML files containing a map
of prefix declarations.
File names must have either .json or .yaml/.yml
file extensions.
-p, --prefixes=<prefixDeclarations>[,<prefixDeclarations>...]
Declares which prefixes to apply to the output.
Can be a prefix reference or an inline prefix
declaration.
A prefix reference will be resolved against the
provided prefix mapping (`-pm`), or the default
prefix mapping.
An inline prefix declaration can be provided as
'prefix=iri. For example: ex=http://example.com/'
Multiple declarations can be separated by ','. For
example: ex=http://example.com/,foo,bar
-P, --pretty Serialize pretty printed output. (Caution: will
cause in-memory output collection).
-b, --base-iri=<baseIri> Base IRI to use to expand relative IRIs in the
generated output.
If not specified the default `http://example.
com/base/` is used.
-l, --limit=<limit> Limit the number of statements generated by the
amount provided.
-v, --verbose Specify multiple -v or --verbose options to
increase verbosity.
For example `-v -v`, or `-vv` or `--verbose
--verbose`
The CARML jar Jena artifact transforms the output to Jena datastructures and uses Jena to generate the output.
This transformation has negligible performance impact and, since Jena is used as the output handler, the available
output formats (-of
) are those that Jena supports.
See the map
help-description for details:
Usage: map [-hVPv] [-F=<outputRdfFormat>] [-o=<outputPath>]
[-M=<prefixMappings>]... [-p=<prefixDeclarations>[,
<prefixDeclarations>...]]... [-b=<baseIri>] [-l=<limit>]
(-m=<mappingFiles> [-m=<mappingFiles>]...
[-f=<mappingFileRdfFormat>] [-r=<relativeSourceLocation>])
-h, --help Show this help message and exit.
-V, --version Print version information and exit.
-m, --mapping=<mappingFiles>
Mapping file path(s) and/or mapping file directory
path(s).
-f, --format=<mappingFileRdfFormat>
Mapping file RDF format:
ttl (text/turtle),
ttls (application/x-turtlestar),
nt (application/n-triples),
nq (application/n-quads),
rdf (application/rdf+xml),
jsonld (application/ld+json),
ndjsonld (application/x-ld+ndjson),
trig (application/trig),
trigs (application/x-trigstar),
n3 (text/n3),
trix (application/trix),
brf (application/x-binary-rdf),
rj (application/rdf+json).
-r, -rsl, --rel-src-loc=<relativeSourceLocation>
Path from which to relatively find the sources
specified in the mapping files.
-F, -of, --outformat=<outputRdfFormat>
Output RDF format. Default: nq.
Supported values are jsonld, jsonld10, jsonld11,
n3, nq, nt, owl, pbrdf, rdf, rj, rpb, rt,
shaclc, shc, trdf, trig, trix, ttl, xml
-o, --output=<outputPath> Output file path.
If path is directory, will default to fileName
`output`.
If left empty will output to console.
-M, -pm, --prefix-mapping=<prefixMappings>
File or directory path(s) containing prefix
mappings.
Files must be JSON or YAML files containing a map
of prefix declarations.
File names must have either .json or .yaml/.yml
file extensions.
-p, --prefixes=<prefixDeclarations>[,<prefixDeclarations>...]
Declares which prefixes to apply to the output.
Can be a prefix reference or an inline prefix
declaration.
A prefix reference will be resolved against the
provided prefix mapping (`-pm`), or the default
prefix mapping.
An inline prefix declaration can be provided as
'prefix=iri. For example: ex=http://example.com/'
Multiple declarations can be separated by ','. For
example: ex=http://example.com/,foo,bar
-P, --pretty Serialize pretty printed output. (Caution: will
cause in-memory output collection).
-b, --base-iri=<baseIri> Base IRI to use to expand relative IRIs in the
generated output.
If not specified the default `http://example.
com/base/` is used.
-l, --limit=<limit> Limit the number of statements generated by the
amount provided.
-v, --verbose Specify multiple -v or --verbose options to
increase verbosity.
For example `-v -v`, or `-vv` or `--verbose
--verbose`
The project can be built by running:
mvn clean package
The runnable jars will be generated in the /carml-app/*/target
dirs.
One way of customizing the mapper without modifying any code is via the RmlMapperConfigurer
interface.
Using Spring, it is very easy to create your own Spring component to configure the RmlMapper builder.
This can be used to add functions to the builder for example.
package foo.bar;
import io.carml.engine.rdf.RdfRmlMapper;
import io.carml.jar.runner.RmlMapperConfigurer;
import org.springframework.stereotype.Component;
@Component
public class FunctionRmlMapperConfigurer implements RmlMapperConfigurer {
@Override
public void configureMapper(RdfRmlMapper.Builder builder) {
builder.addFunctions(new MyCustomFunctions());
}
}
Now you can create your own Spring application which will detect and wire the component.
Make sure to add the package of your component to the @ComponentScan
annotation.
@SpringBootApplication
@ComponentScan({"io.carml.jar", "foo.bar"}) // Add your component's package
public class MyCustomizedCarmlJarApplication {
public static void main(String... args) {
System.exit(SpringApplication.exit(SpringApplication.run(MyCustomizedCarmlJarApplication.class, args)));
}
}