<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://simia.net/index.php?action=history&amp;feed=atom&amp;title=Wikidata_lexicographic_data_coverage_for_Croatian_in_2024</id>
	<title>Wikidata lexicographic data coverage for Croatian in 2024 - Revision history</title>
	<link rel="self" type="application/atom+xml" href="http://simia.net/index.php?action=history&amp;feed=atom&amp;title=Wikidata_lexicographic_data_coverage_for_Croatian_in_2024"/>
	<link rel="alternate" type="text/html" href="http://simia.net/index.php?title=Wikidata_lexicographic_data_coverage_for_Croatian_in_2024&amp;action=history"/>
	<updated>2026-05-05T18:56:49Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.32.0</generator>
	<entry>
		<id>http://simia.net/index.php?title=Wikidata_lexicographic_data_coverage_for_Croatian_in_2024&amp;diff=2658&amp;oldid=prev</id>
		<title>Denny: Created page with &quot;{{pubdate|{{subst:CURRENTDAY}}|{{subst:CURRENTMONTHNAME}}|{{subst:CURRENTYEAR}}}}  For last year I picked up Wikidata lexicographic data coverage for Croatian in 2023|an amb...&quot;</title>
		<link rel="alternate" type="text/html" href="http://simia.net/index.php?title=Wikidata_lexicographic_data_coverage_for_Croatian_in_2024&amp;diff=2658&amp;oldid=prev"/>
		<updated>2025-01-21T07:45:32Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;{{pubdate|{{subst:CURRENTDAY}}|{{subst:CURRENTMONTHNAME}}|{{subst:CURRENTYEAR}}}}  For last year I picked up Wikidata lexicographic data coverage for Croatian in 2023|an amb...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;{{pubdate|21|January|2025}}&lt;br /&gt;
&lt;br /&gt;
For last year I picked up [[Wikidata lexicographic data coverage for Croatian in 2023|an ambitious goal for growing the lexicographic data for Croatian]] in 2024. And, just like last year, I missed again.&lt;br /&gt;
&lt;br /&gt;
My goal was to grow the coverage to 50% - i.e. half of all the words in a Croatian corpus would be found in Wikidata. Instead, we grew from 45.5% to 47.9%. The number of forms grew from 4115 to 5506, more than a thousand new forms, a far bigger growth in forms than last year. So, even though the goal was missed, the speed of growth in Croatian is accelerating.&lt;br /&gt;
&lt;br /&gt;
Part of that growth in forms is due to [https://github.com/google-research-datasets/WordGraph Google's Wordgraph release], a free dataset with words in about 40 languages which describe people - both demonyms and professions.&lt;br /&gt;
&lt;br /&gt;
Do I want to set again a goal? After missing it twice, I am hesitant. Would I again reduce the goal further? But less than 50% sounds defeatist. But back to 60% is obviously too much. So, yes, let's go for 50% again. Let's see where it will take us this time. It's only 2.1% of coverage away from 50%, so that should be doable.&lt;br /&gt;
&lt;br /&gt;
* [https://www.wikidata.org/w/index.php?title=Wikidata%3ALexicographical_coverage%2Fhr%2FStatistics&amp;amp;diff=2296112426&amp;amp;oldid=2044020185 Changes in Croatian coverage in 2024]&lt;br /&gt;
* [https://www.wikidata.org/wiki/Wikidata:Lexicographical_coverage/hr/Missing Top 1000 missing words in Croatian]&lt;br /&gt;
&lt;br /&gt;
{{tag|Simia}}&lt;br /&gt;
&amp;lt;noinclude&amp;gt;{{simiapost|english}}&amp;lt;/noinclude&amp;gt;&lt;/div&gt;</summary>
		<author><name>Denny</name></author>
		
	</entry>
</feed>