SMOS Fast Reprocessing Platform at ESAC
The Data Processing Ground Segment (DPGS) of SMOS mission for the study of the water cycle is located at the European Space Astronomy Center (ESAC), in Villafranca del Castillo (Madrid).
The satellite was launched into space in November 2009 with an expected initial life time of 3 years, but its excellent technical and scientific status has allowed it to expand until 2019 and its continuity is currently being evaluated in order to be extended until 2024.
Indra’s role in DPGS
Indra, as the company that led the development of the Data Processing Ground Segment of SMOS, has been in charge of its corrective and preventive maintenance since the beginning of operations phase in June 2010. Besides this work, Indra is in charge of the engineering tasks carried out for supporting the operation of the DPGS Fast Processing Centre platform (FPC, for routine operational processing to L1 and L2 of satellite downlinked data) as well as for the execution of tests of new DPGS subsystems versions in the Integration and Maintenance Platform (IMP) hosted at Indra’s facilities in San Fernando de Henares (Madrid), before their deployment in DPGS’ FPC at Villafranca.
Reprocessing Platform
Being SMOS’ instrument MIRAS the first of its kind, in the nine years of the current Mission duration the processing algorithms have had to be evolved constantly by experts in seek of the best SMOS data scientific value for the international climate research community.
For such purpose, it has been fundamental the use of the specific SMOS DPGS Fast Reprocessing Platform (FRP), developed by Indra in 2010 under ESA contract and evolved in 2012 and 2015. The intensive processing capacity of this platform, along with its production orchestration capabilities, has allowed supporting the scientific validation of the processing baselines (to evaluate long-term trends, seasonal effects, etc.) before the complete reprocessing campaigns were launched. This, without affecting at all the operation of the FPC, as they are separate platforms.
When a processing baseline is considered validated and stable, it is deployed in the nominal platform for routine production, and the SMOS data since the start of the Mission up to the routine production starting date is reprocessed to L1 and L2 levels in order to achieve a consistent and homogeneous dataset that can be used by experts for scientific climate research. Currently, the processing baseline V7 has been integrated in the IMP, and several metrics exercises have been executed in FRP to support its scientific validation. When the experts consider it stable enough, the full 9 years dataset will be reprocessed.
FRP Scalability
This Fast Reprocessing Platform is based on the same software used by the nominal centre, i.e. the PDPC Core developed by Indra for FPC. However Indra provided critical improvements of this SW that make it possible to reprocess SMOS data up to 36 times faster than FPC production, depending on the level of the final product required.
In fact the speed is currently limited by the orchestration requirements and the number of dedicated processing machines, not by the FRP’s PDPC-Core SW scalability. For example, the orchestration requests consolidating and post-processing all L2 data of one day to generate a set of daily auxiliary files to be used as input for the L2 processing of the next day (i.e. the daily production is forced to be sequential). The reprocessing instance triggers in parallel the L2 processing tasks for all products from one day (29, one for each semiorbit), then consolidates all the outputs and provides them as input dataset to L2 post-processors, that generate the auxiliary daily files for the next day. The reprocessing speed is limited by the time that the slowest L2 processor task from each day takes to process L1 data to generate the L2 products, and no further parallelisation can be applied in that reprocessing instance.
Indra has overcome this difficulty by allowing to set-up several reprocessing instances in the FRP, each with its own processing machines farm, but managed coherently. The SMOS data timeline is divided in several segments, and each is processed in a different reprocessing instance. This compensates the limitations imposed by the orchestration to each instance, and allows achieving reprocessing speeds of e.g. 36:1 for an FRP with 3 reprocessing instances, each with a speed of 12:1 thanks to the L2 processor speed.
The platform is also equipped with a set of tools to simplify management and analysis of results of different instances of Fast Reprocessing Platform (FRP). This set of tools allows parallel execution of different periods for same Reprocessing Campaign by handling of Baselines (orchestration, processing, static auxiliary data, etc.).
Evolution to G-CORE
As part of the preparation for the mission extension, the evolution of DPGS to Indra’s G-CORE product is being analysed. G-CORE is an evolution of the SMOS PDPC CORE software that was developed for SMOS mission (based on the heritage of Helios I Data Processing Centre also developed by Indra), to become a flexible and easily customizable software solution intended to cover many of the processing management related functions of the GS PDGS.
- G-CORE provides high customization and configuration capability. The G-CORE can be configured to manage the data of several missions; ready to support the multi-mission concept.
- G-CORE is ready to support highly complex orchestration schemes.
- Its configurability capability allows the Operations team the easy integration of new versions of current processors or new processors.
- Reliable solution well proven in several scenarios: G-CORE is currently deployed on PAZ Mission GS (already operational) and INGENIO GS (in development), and other Indra products such as Multimission DAT for PAZ, TerraSAR-X and TamDEM-X, Land Analytics EO Platform and PAZ Users Services Provider.
- G-CORE will also allow the deployment of the reprocessing platform in a cloud including the capability of auto-diagnostic for auto-provisioning and additional activities to monitor and procure additional resources in different types of CSP; public, private or hybrid. The inclusion of the capacity to reduce and increase automatically the processing capacities (elasticity) according to the necessities of the moment (satellite pass, for example) can reduce the cost of the service and allow to offer a best price to the client.