< Back to Glossary

Data provenance

Data provenance is metadata that is paired with records and details the origin and confidence of the truth of data. Data provenance is important for tracking down errors within data and attributing them to sources. Additionally, data provenance can be useful in reporting and auditing for business processes. 
In a sense examples of data provenance are “facts about facts.” In Diffbot’s Knowledge Graph™ confidence scores are calculated for every fact as sources for data are compared and integrated into records. 
See also transparency and explainability in AI