Guidelines for graph rankings #146
Replies: 3 comments 5 replies
-
Ideas for class-based rankingGranularity of the data model
Interoperability of the data model
Data quality
|
Beta Was this translation helpful? Give feedback.
-
|
In terms of secondary source data, we could also consider whether the data was first entered by the presenter (through a smart form on the third party's website, for example, or through some sort of automated data flow from the presenter's website) or whether it was completely entered manually or scraped by the third party without any input at all from the presenter. On a more micro level, we could also consider whether or not the source yields consistent errors across their data, indicating a systemic problem (ex: the wrong timezone value in their event structured data; automatically assigning all artists in a directory @ type Person when the directory contains a mixture of solo artists, groups and companies). |
Beta Was this translation helpful? Give feedback.
-
|
A draft ranking algorithm is described in this spreadsheet and this companion document. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Here are a few considerations for assigning a ranking to a given graph, for the purpose of syncing data to Artsdata's core graph or determining which data source is passed on to a data consumer:
Primary vs secondary source
Presence of an open data policy and consent
If the user consented to release the data as open data, this implies an understanding that the data will be reused, and therefore an assumption that the user exercised greater care in data population than if they were simply populating one-time disposable data.
Presence of data quality processes
Differentiated ranking by class or by property
Ideally, a system should consider each source not as simple, monolithic artifact, but as a complex artifact with multiple facets. Indeed, a given source may have a higher level of authority for entities of a given class or a for a given property.
Organizationentities. However, it is a slightly less reliable source forPersonandPlaceentities.offersdata than the official organizer's website. Or a given calendar might have more exhaustive data about nestedperformerentities.If you think of additional criterias or considerations, please add them below.
Beta Was this translation helpful? Give feedback.
All reactions