Explore open access research and scholarly works from NERC Open Research Archive

Advanced Search

A geodata warehouse: using denormalisation techniques as a tool for delivering spatially enabled integrated geological information to geologists

Kingdon, A. ORCID: https://orcid.org/0000-0003-4979-588X; Nayembil, M.L.; Richardson, A.E.; Smith, A.G.. 2016 A geodata warehouse: using denormalisation techniques as a tool for delivering spatially enabled integrated geological information to geologists. Computers and Geosciences, 96. 87-97. 10.1016/j.cageo.2016.07.016

Abstract
New requirements to understand geological properties in three dimensions have led to the development of PropBase, a data structure and delivery tools to deliver this. At the BGS, relational database management systems (RDBMS) has facilitated effective data management using normalised subject-based database designs with business rules in a centralised, vocabulary controlled, architecture. These have delivered effective data storage in a secure environment. However, isolated subject-oriented designs prevented efficient cross-domain querying of datasets. Additionally, the tools provided often did not enable effective data discovery as they struggled to resolve the complex underlying normalised structures providing poor data access speeds. Users developed bespoke access tools to structures they didn’t fully understand sometimes delivering them incorrect results. Therefore, BGS has developed PropBase, a generic denormalised data structure within an RDBMS to store property data, to facilitate rapid and standardised data discovery and access, incorporating 2D and 3D physical and chemical property data, with associated metadata. This includes scripts to populate and synchronise the layer with its data sources through structured input and transcription standards. A core component of the architecture includes, an optimised query object, to deliver geoscience information from a structure equivalent to a data warehouse. This enables optimised query performance to deliver data in multiple standardised formats using a web discovery tool. Semantic interoperability is enforced through vocabularies combined from all data sources facilitating searching of related terms. PropBase holds 28.1 million spatially enabled property data points from 10 source databases incorporating over 50 property data types with a vocabulary set that includes 557 property terms. By enabling property data searches across multiple databases PropBase has facilitated new scientific research, previously considered impractical. PropBase is easily extended to incorporate 4D data (time series) and is providing a baseline for new “big data” monitoring projects.
Documents
514031:101653
[thumbnail of PropBase_Paper_submitted_version.pdf]
Preview
PropBase_Paper_submitted_version.pdf - Accepted Version

Download (1MB) | Preview
Information
Programmes:
BGS Programmes 2013 > Environmental Modelling
Library
Statistics

Downloads per month over past year

More statistics for this item...

Metrics

Altmetric Badge

Dimensions Badge

Share
Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email
View Item