Skip to content

Add process to check for major drops in data between updates #16

@andrewtavis

Description

@andrewtavis

Terms

Description

Based on scribe-org/Scribe-Data#68, we need to keep in mind that there will be cases that a property on Wikidata will change such that there will be a large drop in data. In the referenced issue, Portuguese verbs are using a non-standard past perfect PID that could be combined with the more widely used one at some point.

This issue would look into ways of diffing the current data coverage against the new data coming in, which could be as simple as total keys and total non-null values of keys of sub-objects. We could then discuss a viable cutoff, and trigger some kind of warning or a Scribe-Data issue if it's too low 😊

Contribution

Would be happy to discuss! Could also help implement, but might be better if others get to this eventually as I'm a long way off on Go :)

Metadata

Metadata

Assignees

No one assigned

    Labels

    blockedAnother issue is blockinghelp wantedExtra attention is neededquestionFurther information is requested

    Projects

    Status

    Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions