<?xml version="1.0" encoding="UTF-8"?>
<collection xmlns="http://www.loc.gov/MARC21/slim">
<record>
  <controlfield tag="001">7013</controlfield>
  <controlfield tag="005">20240807164554.0</controlfield>
  <datafield tag="037" ind1=" " ind2=" ">
    <subfield code="a">POSTER-2024-0072</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Bamahry, Fikri</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Application of machine learning to identify outliers in GNSS position time series based on observation’s data quality indicators</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2024</subfield>
  </datafield>
  <datafield tag="269" ind1=" " ind2=" ">
    <subfield code="c">2024-05-09</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">A dense network of GNSS (Global Navigation Satellite Systems) stations has the potential to monitor ground deformations accurately. However, positions may exhibit artificial variations or outliers unrelated to the actual ground motion and can degrade their geodynamic interpretation. We know that data quality degradation is one of the key factors influencing the outliers on GNSS position estimates. Currently, the interpretation process is performed manually by looking at data quality indicator plots, which is time-consuming and subjective to the analyst who interprets the GNSS time series. To have a more robust and efficient system and find the complex correlation between degraded data quality indicators and outliers in daily GNSS position time, we implemented a random forest algorithm to develop a model that can identify degraded GNSS quality that might be causing outliers in position time series. We used the daily position and daily data quality indicators time series of GNSS stations belonging to EUREF GNSS Permanent Network (EPN) to assess the most relevant GNSS data quality indicators that can explain the possible outliers in the GNSS position time series. Furthermore, to understand how the machine learning model predicts the possible outliers, we used the SHapley Additive exPlanations (SHAP) algorithm to explain the prediction that we got from our model. In this presentation, we will present the status of our investigations, the challenges we faced, and the preliminary outcome of this work.</subfield>
  </datafield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="a">ESFRIFED/</subfield>
    <subfield code="c"> EF/</subfield>
    <subfield code="f">211/SERVE / SERVE</subfield>
  </datafield>
  <datafield tag="594" ind1=" " ind2=" ">
    <subfield code="a">NO</subfield>
  </datafield>
  <datafield tag="653" ind1="1" ind2=" ">
    <subfield code="a">GNSS quality metrics</subfield>
  </datafield>
  <datafield tag="653" ind1="1" ind2=" ">
    <subfield code="a">GNSS position time series</subfield>
  </datafield>
  <datafield tag="653" ind1="1" ind2=" ">
    <subfield code="a">Machine learning</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Legrand, Juliette</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Bruyninx, Carine</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Pottiaux, Eric</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="t">4th ESA-ECWMF Workshop on Machine Learning for Earth Observation and Prediction, ESA/ESRIN, Frascati, Italy</subfield>
  </datafield>
  <datafield tag="856" ind1="0" ind2=" ">
    <subfield code="f">fikri.bamahry@ksb-orb.be</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">CPOSTER</subfield>
  </datafield>
</record>
</collection>