Progress in lexicographic data in Wikidata 2023

From Simia
Revision as of 08:57, 5 January 2024 by Denny (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Here are some highlights of the progress in lexicographic data in Wikidata in 2023

What does the coverage mean? Given a text (usually Wikipedia in that language, but in some cases a corpus from the Leipzig Corpora Collection), how many of the occurrences in that text are already represented as forms in Wikidata's lexicographic data. Note that every percent more gets much more difficult than the previous one: an increase from 1% to 2% usually needs much much less work than from 91% to 92%.

Simia

Previous entry:
RIP Niklaus Wirth
Next entry:
Languages with the best lexicographic data coverage in Wikidata 2023