Technical Manual for Data Injection into the GloSIS Database
2025-03-17
1 Introduction
The Global Soil Information System (GloSIS) is a key initiative under the Global Soil Partnership (GSP), designed to integrate and standardize soil data globally. However, soil information is often scattered across paper reports, scientific articles, and national archives, making it difficult to access, compare, and utilize effectively. Additionally, variability in laboratory analytical procedures, sampling methods, and classification systems leads to inconsistencies that hinder accurate global assessments. Harmonization is essential to standardize data formats, analytical methods, and reporting frameworks, ensuring comparability, accessibility, and reliability.
To address these challenges, the GloSIS database is structured as a harmonized relational database in PostgreSQL, following ISO 28258 standards for soil data storage. However, its complexity can pose difficulties for users unfamiliar with SQL or PostgreSQL programming, making it challenging to convert existing soil data into this standardized format.
This Technical Manual provides guidance on implementing a PostgreSQL database aligned with the GloSIS schema and introduces the glosis-shiny
service—a tool designed to simplify the conversion of soil data into the GloSIS format. By automating and harmonizing data integration, this application streamlines the workflow, making the process accessible to users without advanced SQL expertise. This step is crucial for building the GloSIS infrastructure, ensuring that soil data from various sources can be shared, compared, and utilized effectively on a global scale.
This document provides a detailed technical manual on setting up two Docker-based services:
- GloSIS PostgreSQL database (
glosis-db
). - R Shiny application for soil data injection and visualization of soil data (
glosis-shiny
).
It covers the installation and deployment of the applications using Docker, the process of data preparation for data injection workflow from Excel (xlsx) files and the visualization of the data.
This document uses three soils profiles from undisturbed forests from the PhD dissertation Soils, water and nutrients in a forest ecosystem in Suriname (Poels 1987), available at https://edepot.wur.nl/202240, as an exercise for soil data harmonization.