Provenance ontology

The provenance ontology supports data management and auditing tasks. It is used to define the different types of named graphs we used in the store (quad store) and enables their association with metadata that allow us to manage, validate and expose data to our services.

Introduction

The provenance ontology supports data management and auditing tasks. It is used to define the different types of named graphs we used in the store (quad store) and enables their association with metadata that allow us to manage, validate and expose data to our services.

The ontology enables the recording of information about the provenance of data in an RDF Triple Store. This provenance is focused on the immediate providers and not the ultimate source, so for example, this would record that geodata was provided by the BBC Locator team, and not geonames. In the Linked Data Platform, this data is applied to contexts or named graphs. A named graph is, in effect, a 'fourth part' to a triple, hence the term 'quad store'.

A nodes-and-edges diagram with provenance:Graph as the central node and example subclasses & properties as connected nodes
A provenance:Graph-centric example of properties & subclasses in the Provenance ontology
# Some example triples for a dataset of English football teams, in Turtle format @prefix provenance: <http://www.bbc.co.uk/ontologies/provenance/> .
@prefix bbc: <http://www.bbc.co.uk/ontologies/bbc/> .
<http://www.bbc.co.uk/contexts/a-b-c-d#id> a provenance:Dataset ; rdfs:label "English Football Teams"^^xsd:string ; provenance:changeReason "Initial creation of dataset"^^xsd:string ; provenance:version "1"^^xsd:int ; provenance:product bbc:Sport ; provenance:provided "20121010T16:34Z"^^xsd:datetime ; provenance:provider <mailto:[email protected]> .

Metadata

Created2012-12-01
Last modified2023-02-28
Current version1.13
Authors[email protected]
Licencehttp://creativecommons.org/licenses/by/4.0/
DownloadProvenance ontology(Turtle format)

 

Classes

 

CreativeWorkGraph

URIhttp://www.bbc.co.uk/ontologies/provenance/CreativeWorkGraph
DescriptionThe class of named graphs which contain the metadata for the CreativeWorks. For example all the metadata for a news story provided by a CMS will be contained in a single CreativeWorkGraph instance.
SuperclassesGraph

 

Dataset

URIhttp://www.bbc.co.uk/ontologies/provenance/Dataset
DescriptionDataset is a single graph which contains a collection of things that is managed together, e.g., "the dataset of the BBC's News Regions". The management of datasets involves editing RDF files and publishing them to the triplestore in a single action. This requires a good knowledge of semantic technologies therefore datasets are managed by experts.
SuperclassesGraph

 

DeletedGraph

URIhttp://www.bbc.co.uk/ontologies/provenance/DeletedGraph
DescriptionA DeletedGraph is a single graph describing an archived ThingGraph.
SuperclassesGraph
Domain fordescribed

 

Graph

URIhttp://www.bbc.co.uk/ontologies/provenance/Graph
DescriptionThe class of named graphs. All our data is stored in named graphs of different types depending on its provenance and management. These are specified by the subclasses of Graph.
SubclassesCreativeWorkGraph, Dataset, DeletedCreativeWorkGraph, DeletedGraph, Ontology, ThingGraph, TransitionOntology
Domain forcanonicalLocation, changeReason, owner, product, provided, provider, public, version

 

Ontology

URIhttp://www.bbc.co.uk/ontologies/provenance/Ontology
DescriptionGraphs that contain ontologies in the store are declared to be of type provenance:Ontology. The graph name in the store is identical to the ontology IRI.
SuperclassesGraph
SubclassesTransitionOntology
Domain forslug

 

ThingGraph

URIhttp://www.bbc.co.uk/ontologies/provenance/ThingGraph
DescriptionThing graphs hold metadata about a single thing. Note the difference with Datasets, which hold metadata about many things. Thing graphs are usually exposed to user friendly tools which allow users with little on no RDF skills to specifically edit a metadata about a thing in a form. For example a thing graph can contain metadata about a football team such as its various names, LOD links for the team, a short disambiguation hint, e.t.c. Thing graphs are also used to store metadata about things obtained from a feed, for example store metadata about locations obtained from a Location Services feed.
SuperclassesGraph
Domain formanagedBy, managedThing

 

Transition Ontology

URIhttp://www.bbc.co.uk/ontologies/provenance/TransitionOntology
DescriptionAn ontology that is used to provide a transition feature for a parent ontology
SuperclassesGraph, Ontology

 

Properties

 

canonicalLocation

URIhttp://www.bbc.co.uk/ontologies/provenance/canonicalLocation
DescriptionThe canonical location where all the triples contained within the context (named graph) are stored in and can be referred to outside the triplestore.
DomainGraph
Rangew3.org/2001/XMLSchema#string

 

changeReason

URIhttp://www.bbc.co.uk/ontologies/provenance/changeReason
DescriptionThe reason for the most recent change applied to the graph.
DomainGraph
Rangew3.org/2001/XMLSchema#string

 

created

URIhttp://www.bbc.co.uk/ontologies/provenance/created
DescriptionThe date when a Thing was created.
Domainw3.org/2000/01/rdf-schema#Resource
Rangew3.org/2001/XMLSchema#dateTime

 

deprecated

URIhttp://www.bbc.co.uk/ontologies/provenance/deprecated
DescriptionA predicate to allow hiding of Things so as to not delete them and lose the reference object.
Domainw3.org/2000/01/rdf-schema#Resource
Rangew3.org/2001/XMLSchema#boolean

 

deprecates

URIhttp://www.bbc.co.uk/ontologies/provenance/deprecates
DescriptionThe thing that this is superseding.
Domainw3.org/2000/01/rdf-schema#Resource
Rangew3.org/2000/01/rdf-schema#Resource

 

deprecationComment

URIhttp://www.bbc.co.uk/ontologies/provenance/deprecationComment
DescriptionInformation pertinent to a resource's deprecation status
Domainw3.org/2000/01/rdf-schema#Resource
Rangew3.org/2001/XMLSchema#string

 

deprecationDate

URIhttp://www.bbc.co.uk/ontologies/provenance/deprecationDate
DescriptionThe date when this thing will be deprecated and potentially no longer available.
Domainw3.org/2000/01/rdf-schema#Resource
Rangew3.org/2001/XMLSchema#dateTime

 

described

URIhttp://www.bbc.co.uk/ontologies/provenance/described
DescriptionThe Thing which was described by a Thing Graph prior to archival.
DomainDeletedGraph
Rangew3.org/2000/01/rdf-schema#Resource

 

isDeprecatedBy

URIhttp://www.bbc.co.uk/ontologies/provenance/isDeprecatedBy
DescriptionThe thing that is superseding this deprecated thing.
Domainw3.org/2000/01/rdf-schema#Resource
Rangew3.org/2000/01/rdf-schema#Resource

 

managedBy

URIhttp://www.bbc.co.uk/ontologies/provenance/managedBy
DescriptionAssociates the thing graph of a cms:ManagedThing with its managing system. This is needed to solve cases where different systems can manage metadata about a shared thing. E.g., one thing graph from BBC News may have all the politics related metadata for David Cameron, while another thing graph managed by Knowledge and Learning may contain information about the historical role David Cameron has played as a conservative leader in British politics. Separating metadata into thing graphs is useful for managing information about shared things.
DomainThingGraph
RangeSystem

 

owner

URIhttp://www.bbc.co.uk/ontologies/provenance/owner
DescriptionThis property indicates the owner of the data, i.e., the person or CMS who's responsible for managing and governing the data.
DomainGraph
Rangew3.org/2001/XMLSchema#anyURI

 

product

URIhttp://www.bbc.co.uk/ontologies/provenance/product
DescriptionThe reference to the product from which the dataset, graph was created. This is needed for dataset management permissions. Not to be confused with bbc:product.
DomainGraph
RangeProduct

 

provided

URIhttp://www.bbc.co.uk/ontologies/provenance/provided
DescriptionThe date and time at which the data was provided to the triplestore.
DomainGraph
Rangew3.org/2001/XMLSchema#dateTime

 

provider

URIhttp://www.bbc.co.uk/ontologies/provenance/provider
DescriptionThe email address extracted from the BBC SSL Certificate, indicating the person or system which directly interacted with the writer service to put the data in the store.
DomainGraph
Rangew3.org/2001/XMLSchema#anyURI

 

public

URIhttp://www.bbc.co.uk/ontologies/provenance/public
DescriptionThis predicate allows the distinction of ontologies that are appropriate for publishing from those ones that are currently kept internal, e.g., for testing purposes.
DomainGraph
Rangew3.org/2001/XMLSchema#boolean

 

slug

URIhttp://www.bbc.co.uk/ontologies/provenance/slug
DescriptionThis is a short human readable identifier for the ontology.
DomainOntology
Rangew3.org/2001/XMLSchema#string

 

transitions

URIhttp://www.bbc.co.uk/ontologies/provenance/transitions
DescriptionThe property that this transition property is providing a transition for.
Domainw3.org/2000/01/rdf-schema#Property
Rangew3.org/2000/01/rdf-schema#Property

 

version

URIhttp://www.bbc.co.uk/ontologies/provenance/version
DescriptionThe version of the graph.
DomainGraph
Rangew3.org/2001/XMLSchema#string

Rebuild Page

The page will automatically reload. You may need to reload again if the build takes longer than expected.

Useful links

Demo mode

Hides preview environment warning banner on preview pages.

Theme toggler

Select a theme and theme mode and click "Load theme" to load in your theme combination.

Theme:
Theme Mode: