This session will present to the OBPS audience the Big Data Workbenches being developed under the umbrella of the Blue-Cloud 2026 initiative with marine data providers such as SeaDataNet, EMODnet Chemistry, CMEMS, EcoTaxa and more. All together, utilising best practices and standards, they are working to develop, validate, and document new analytical Big Data WorkBenches, which can be adopted by EMODnet, CMEMS, and selected RIs for producing at a regular interval a set of harmonised and validated data collections for a selection of Essential Ocean Variables (EOVs) in physics, chemistry, and biology, highly relevant for analysing the state of the environment and numerical simulations useful to deploy Digital Twins of the Ocean applications.
The session will give a general presentation of the workflows that are being designed and implemented by Ocean and data scientists from European marine infrastructures (Blue-Cloud 2026, SeaDataNet, EMODnet Chemistry, CMEMS, EcoTaxa) to deploy data-intensive Workbenches for selected Essential Ocean Variables (EOVs). Thanks to the Workbenches researchers harmonise, validate and qualify large and various in situ data sources, exploiting the blue analytical services available in the Blue-Cloud Virtual Research Environment. One of the first and most important step is to analyse and select the most useful metadata to retain between several input datasets, so that they can be compared. This semantic task will improve metadata used in different communities and hopefully will lead to a common choice. Other elements still under development can also be showcased and presented for community feedback, such as the semantic analyser, the semantic brokerage and the DAB components, Beacon as a powerful data lake that enables subsetting, and how all these elements are crucial for the workbenches.