Status of Machine Learning for Argo QC #6

gmaze · 2019-12-04T10:20:59Z

I'd like to open a discussion thread to get the status of developments with regard to the use of Machine Learning techniques in Argo QC procedures.

Different groups may have started to explore this possibility and it would be constructive to get here the status of these efforts, to avoid duplicates and to get feedback.

This could include a description of:

the target variables (eg: QC flag for one TEMP measure, QC flag for one PSAL profile,...)
the choice of features, explanatory variables
the ML method (eg: random forest)
the dataset used
the overall performance or difficulties encountered
anything you think relevant wrt this topic

gmaze · 2019-12-04T10:39:10Z

At Ifremer/LOPS, we've tried the following:

Target variables:

Alarm status (True, False) of the ISAS13 test against climatology for one PSAL measurement

Features:

A "patch" of variables from the same profile as the target as well as from profiles before and after (+/- 2). Variables used: TEMP, PSAL, SIG0 and PRES.

ML method

Random forest

Dataset used

Argo snapshot from 2016/02 and ISAS team QC logs.

Overall performance or difficulties encountered

Performances not stable. We wanted to use a "balanced" training set with as many True as False samples. But because they are many more False than True samples, we need to sub-sample the False alarm set. Then we encounter the difficulty of selecting statistically "similar" sub-samples. Overall performances are highly sensible to this sub-sampling.
The True/False alarms training set is highly in-balanced simply because the ISAS13 test against climatology is not an effective test and raises too many False alarms.

gaelforget · 2020-01-24T01:57:30Z

The True/False alarms training set is highly in-balanced simply because the ISAS13 test against climatology is not an effective test and raises too many False alarms.

Not sure if that helps or if I totally understand but would it make sense to consider using several climatology products and e.g. counting the # of alarms (e.g. 0/6 vs 6/6) and setting a threshold? I used to do something like that in the MITprof QC for ECCO (I was using the min of cost functions if I recall).

gmaze · 2020-01-25T11:06:47Z

@gaelforget this is a good suggestion that we started to experiment as well: taking a final decision on the basis of several QC test outcomes.
But the choice of acceptable distance to the climatology is as important as the climatology value itself. One would need an "optimization" approach where, based on the historical dataset, we would determine the best combination of distance/reference to detect bad data.
This however points to another problem: namely that the distance beyond which a data would be declared "bad" is in practice dependent on the user application, this is particularly true for data assimilation where data need to somehow be compatible with the numerical ocean simulation by the model.
This finally lead us to the fact that the best we could do would be to compute a goodness probability for the data, it would be up to the user to define a threshold.

gmaze added enhancement New feature or request procedure About a specific procedure labels Dec 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Status of Machine Learning for Argo QC #6

Status of Machine Learning for Argo QC #6

gmaze commented Dec 4, 2019 •

edited

Loading

gmaze commented Dec 4, 2019

gaelforget commented Jan 24, 2020

gmaze commented Jan 25, 2020

Status of Machine Learning for Argo QC #6

Status of Machine Learning for Argo QC #6

Comments

gmaze commented Dec 4, 2019 • edited Loading

gmaze commented Dec 4, 2019

Target variables:

Features:

ML method

Dataset used

Overall performance or difficulties encountered

gaelforget commented Jan 24, 2020

gmaze commented Jan 25, 2020

gmaze commented Dec 4, 2019 •

edited

Loading