Add “data quality” indication for collected and processed data

Created on Tuesday 13 April 2021, 08:20

Back to task list
  • ID
    982104
  • Project
    Metabolism of Cities Data Hub
  • Status
    Open
  • Priority
    Medium
  • Type
    Programming work
  • Tags
    Data Hub Priority Plan 2021 General data hub improvements
  • Assigned to
    No one yet
  • Subscribers
    Carolin Bellstedt

You are not logged in

Log In Register

Please join us and let's build things, together!

Description

In a previous iteration of this site and as part of MultipliCity, we had a very nice pedrigree matrix for data quality.

We have previously discussed to bring it back at some point and now the need seems to arise more for it, as we will be introducing various levels of accuracy due to the downscaling calculations, as well as having access to data where we better know if it only has been estimated even if already at city level, for example. To have an indication / indicator for the data quality would therefore be quite nice. Eventually, this indication could also show up in different places, even as part of visualisations for example.

There are two places where such an indication could be made and makes sense right now:

  1. Data collection, e.g. if data you collected which is at the level of the city has been estimated and you want to mark that it is not accurate.
  2. Data processing, e.g. if a processor is downscaling data, it could be marked as of "less quality" automatically and even been given a certain degree depending on the proxy or type of downscaling.

In order to support this task and make it happen, it very likely also requires some discussion on what those degrees of quality, such as on reliability, accuracy, completeness etc. are, plus some front-end work, but we can also use this space to get this started.

Discussion and updates


New task was created