C2. Pipeline for semantic annotation of relational DB and triples ...

1 downloads 245 Views 572KB Size Report
Highlighted features: Pipeline for a) the semantic OBOE-based annotation of data managed in (postgreSQL) relational DB a
C2. Pipeline for semantic annotation of relational DB and triples generation Relation to the data lifecycle: data processing and data use Data for Science service pillar: processing, provenance Available Date: 2018 Main contributor: INRA Role of contributor: developer Contact: Christian Pichot

H2020 Project

Project Number: 654182

Technical specification   Highlighted features:   Pipeline for a) the semantic OBOE-based annotation of data managed in (postgreSQL) relational DB and b) the generation of rdf triples.   Steps: graph modeling (yEd), data annotation/ triples generation (ontop), triples inferences (corese), SPARQL endpoint (BlazeGraph)   Genericity through RBD connection parameters and a variable pattern approach.   Target users:   RI data scientists and data managers,   e-Infrastructure semantic operators for pipeline deployment   Technology readiness level:   6–7 demonstrated and operational on AnaEE-France environment (OBOE-based ontology & postgres RDB)   Accessibility:   Still under development for genericty extension   Open Source   Supported standards:   Semantic Web W3C   Required platform:   Linux environment, java   Known bugs:

How to use? Dat aB as e ma na ger

Dat a sci ent ist

variable semantic description

yEd based processing

odba mapping

graph pattern RDB raw data

Semantic portals

raw data

Ontology (OBOE-based)

Metadata generation Data set generation

End Point

raw data with inferered triples