Line 28: |
Line 28: |
| == The Solution == | | == The Solution == |
| Data-driven analysis approaches are better suited for such type of analysis. AI has the ability to sift through the data piles of ocean data to find the complex relation between the ocean observations. | | Data-driven analysis approaches are better suited for such type of analysis. AI has the ability to sift through the data piles of ocean data to find the complex relation between the ocean observations. |
| + | |
| [[File:Ocean Profile Classification Model.png|alt=Data-driven approach for analyzing ocean data|thumb|478x478px|<small>'''Profile classification model: a data-driven approach for analyzing ocean data'''</small> <ref>Guillaume Maze et al, Coherent heat patterns revealed by unsupervised classification of Argo temperature profiles in the North Atlantic Ocean, Progress in Oceanography, Volume 151, 2017, Pages 275-292.</ref>]] | | [[File:Ocean Profile Classification Model.png|alt=Data-driven approach for analyzing ocean data|thumb|478x478px|<small>'''Profile classification model: a data-driven approach for analyzing ocean data'''</small> <ref>Guillaume Maze et al, Coherent heat patterns revealed by unsupervised classification of Argo temperature profiles in the North Atlantic Ocean, Progress in Oceanography, Volume 151, 2017, Pages 275-292.</ref>]] |
| | | |
− | Supported by the 2020 – 2021 Results Fund, a Proof of Concept (POC) was developed for a predictive model to find (dis-)similarities between in situ multidimensional profiles of oceanographic data of the Pacific Ocean. The in situ Conductivity-Temperature-Depth (CTD) profiles are classified using the profile classification model <ref name=":0">https://pyxpcm.readthedocs.io/en/latest/index.html</ref>. The ocean profile classification model allows to automatically assemble ocean profiles in clusters according to their vertical structure similarities. The geospatial properties of these clusters can be used to address a large variety of oceanographic problems, E.g. front detection, water mass identification, natural region contouring (gyres, eddies), reference profile selection for QC validation, etc. The vertical structure of these clusters furthermore provides a highly synthetic representation of large ocean areas that can be used for dimensionality reduction and coherent intercomparisons of ocean data (re)-analysis or simulations <ref name=":0" />. | + | Supported by the 2020 – 2021 Results Fund, a Proof of Concept (POC) was developed for a predictive model to find (dis-)similarities between in situ multidimensional profiles of oceanographic data of the Pacific Ocean. The in situ Conductivity-Temperature-Depth (CTD) profiles are classified using the profile classification model <ref name=":0">https://pyxpcm.readthedocs.io/en/latest/index.html</ref>. The ocean profile classification model allows to automatically assemble ocean profiles in clusters according to their vertical structure similarities. The geospatial properties of these clusters can be used to address a large variety of oceanographic problems, E.g. front detection, water mass identification, natural region contouring (gyres, eddies), reference profile selection for QC validation, etc. The vertical structure of these clusters furthermore provides a highly synthetic representation of large ocean areas that can be used for dimensionality reduction and coherent intercomparisons of ocean data (re)-analysis or simulations <ref name=":0" />. |
− | | |
− | Two sets of experiments where conducted. The first experiment clusters a total of 3602 CTD profiles, where the final data point is less than 1,000 dbar, according to their temperature value only. Results have shown that the dataset of temperature profiles contains 9 groups of vertically coherent heat patterns, or classes. Each of the temperature profile classes reveals unique and physically coherent heat distributions along the vertical axis. When mapped in space, each of the 9 classes is found to define an oceanic region, even if no spatial information was used in the model determination.
| |
| | | |
− | In the second experiment, both temperature and salinity observations are considered. Results have shown that depending on the season, up to 15 classes are observed, with occasional classes that correspond to sensor failures.
| + | We have applied the predictive model to cluster a total of 3602 CTD profiles, according to their temperature value only, where the final data point is less than 1,000 dbar. Results have shown that the dataset of temperature profiles contains 9 groups of vertically coherent heat patterns, or classes. Each of the temperature profile classes reveals unique and physically coherent heat distributions along the vertical axis. When mapped in space, each of the 9 classes is found to define an oceanic region, even if no spatial information was used in the model determination. The model is also able to show natural phenomenas such as ocean inlets, shown as purple observations in the figure below. |
| + | [[File:Ocean data before after.png|alt=The ocean profiles before and after applying the predictive model|center|thumb|907x907px|'''<small>The ocean profiles before and after applying the predictive model</small>''']] |
| + | In addition, there is a posibiliy of identifying anomlous profiles by examing how much an ocean profile deviate from its own cluster. In the figure below, results have shown that there is a total of 4 profiles that may be anomalous. Those identified profiles belon to clusters 0, 4, and 6. |
| + | [[File:Anomlous profiles.png|center|thumb|531x531px|<small>'''Applying the predictive model to identify anomalous profiles.'''</small>]] |
| + | In the end, the insights gained from the model can provide ocean scientists with a data-driven tool to analyze ocean data. |
| | | |
| == References == | | == References == |