This page lists how similar 260 languages are to each other. Hover over a language code for the language name. Click to see more explanation.
English | Simple English Scots Novial Zhuang Anglo-Saxon Siswati all |
German | Bavarian Alemannic Luxembourgish Pennsylvania German Ripuarian Low Saxon all |
French | Occitan Catalan Asturian Interlingue Ligurian Interlingua all |
Dutch | Dutch Low Saxon West Flemsih Limburgish Afrikaans Low Saxon Zealandic all |
Italian | Venetian Ligurian Corsican Interlingua Neapolitan Asturian all |
Polish | Silesian Cassubian Lower Sorbian Slovak Upper Sorbian Slovene all |
Spanish | Chavacano Asturian Galician Portuguese Aragonese Extremaduran all |
Russian | Bulgarian Serbian Ukrainian Rusyn Macedonian Moldavian all |
Portuguese | Galician Spanish Chavacano Mirandese Asturian Venetian all |
Swedish | Nynorsk Norwegian Danish Novial Dutch Low Saxon Anglo-Saxon all |
Catalan | Asturian Occitan Spanish Chavacano Aragonese Galician all |
Ukrainian | Rusyn Russian Serbian Bulgarian Macedonian Belarusian all |
Norwegian | Danish Nynorsk Swedish Dutch Low Saxon Frisian Dutch all |
Finnish | Estonian Voro Oromo Nynorsk Italian Norwegian all |
Vietnamese | Central Bicolano Min Nan Zhuang Wu Tok Pisin Bislama all |
Czech | Slovak Slovene Serbocroatian Croatian Bosnian Esperanto all |
Hungarian | Norwegian Nynorsk Danish Italian Swedish Ligurian all |
Korean | Cantonese Wu Novial English Simple English Scots all |
Indonesian | Malay Banjar Banyumasan Javanese Sundanese Tagalog all |
Turkish | Gagauz Crimean Tatar Karakalpak Azeri Turkmen Nynorsk all |
Romanian | Venetian Latin Sardinian Interlingua Corsican Italian all |
Farsi | Gilaki Mazandarani Western Panjabi Pashto Sorani Urdu all |
Arabic | Egyptian Arabic Pashto Sindhi Farsi Urdu Gilaki all |
Danish | Norwegian Nynorsk Swedish Dutch Low Saxon Anglo-Saxon German all |
Esperanto | Ladino Venetian Sardinian Chavacano Spanish Galician all |
Serbian | Macedonian Bulgarian Russian Ukrainian Moldavian Rusyn all |
Lithuanian | Latvian Esperanto Ladino Serbocroatian Bosnian Slovene all |
Slovene | Serbocroatian Croatian Bosnian Slovak Czech Esperanto all |
Slovak | Czech Slovene Serbocroatian Croatian Bosnian Esperanto all |
Malay | Indonesian Banjar Banyumasan Javanese Sundanese Tagalog all |
Hebrew | Yiddish Cantonese Wu Korean Novial English all |
Bulgarian | Macedonian Russian Serbian Moldavian Ukrainian Rusyn all |
Kazakh | Kirghiz Tatar Karachay-Balkar Bashkir Sakha Belarusian all |
Baque | Sardinian Papiamentu Ladino Romanian Catalan Occitan all |
Volapük | Papiamentu Novial Occitan French Ladino Bambara all |
Waray-Waray | Cebuano Central Bicolano Occitan Ladino Sundanese Portuguese all |
Croatian | Bosnian Serbocroatian Slovene Slovak Czech Esperanto all |
Hindi | Nepali Sanskrit Marathi Bihari Nepal Bhasa Pali all |
Estonian | Voro Finnish Esperanto Latin Sardinian Ladino all |
Azeri | Gagauz Turkish Crimean Tatar Karakalpak Turkmen Uzbek all |
Galician | Spanish Chavacano Portuguese Asturian Aragonese Mirandese all |
Simple English | English Scots Novial Zhuang Anglo-Saxon Siswati all |
Nynorsk | Norwegian Danish Swedish Faroese Icelandic Dutch Low Saxon all |
Thai | Wu English Simple English Novial Scots Cantonese all |
Nepal Bhasa | Sanskrit Nepali Hindi Marathi Bihari Pali all |
Greek | Pontic Wu Cantonese Novial English Korean all |
Latin | Interlingua Corsican Interlingue Sardinian Extremaduran Venetian all |
Aromanian | Romanian Estonian Sardinian Venetian Interlingue Latin all |
Occitan | Catalan French Chavacano Aragonese Spanish Asturian all |
Tagalog | Banjar Central Bicolano Javanese Kapampangan Banyumasan Indonesian all |
Macedonian | Bulgarian Serbian Russian Moldavian Ukrainian Rusyn all |
Georgian | Mingrelian Wu Cantonese Korean Simple English English all |
Haitian | Papiamentu Banyumasan Tagalog Banjar Central Bicolano Ladino all |
Serbocroatian | Bosnian Croatian Slovene Slovak Czech Esperanto all |
Piedmontese | Venetian Occitan Ligurian Friulian French Italian all |
Telugu | Wu Cantonese English Simple English Scots Novial all |
Tamil | Wu Cantonese Korean Novial English Simple English all |
Cebuano | Tagalog Waray-Waray Central Bicolano Banjar Javanese Indonesian all |
Belarusian | Belarusian Rusyn Russian Ukrainian Serbian Bulgarian all |
Breton | Sardinian Norwegian Venetian Italian Nynorsk French all |
Albanian | Novial Venetian Ladino Esperanto Papiamentu Italian all |
Latvian | Lithuanian Latgalian Ladino Esperanto Serbocroatian Bosnian all |
Javanese | Kapampangan Banyumasan Banjar Indonesian Malay Sundanese all |
Belarusian | Belarusian Rusyn Russian Ukrainian Serbian Bulgarian all |
Welsh | Cornish Breton Sardinian Nynorsk Italian Venetian all |
Luxembourgish | Alemannic Pennsylvania German German Ripuarian Low Saxon Bavarian all |
Malagasy | Bambara Banjar Sundanese Papiamentu Hausa Javanese all |
Marathi | Nepali Sanskrit Hindi English Simple English Scots all |
Icelandic | Faroese Nynorsk Swedish Norwegian Anglo-Saxon Danish all |
Bosnian | Serbocroatian Croatian Slovene Slovak Czech Esperanto all |
Yoruba | Novial Igbo English Siswati Romanian Simple English all |
Aragonese | Spanish Chavacano Asturian Galician Extremaduran Catalan all |
Bishnupriya Manipuri | Bengali Assamese English Simple English Novial Scots all |
Lombard | Venetian Italian Catalan Ligurian Occitan Interlingue all |
Frisian | Dutch Low Saxon West Flemsih Norwegian Saterland Frisian Limburgish Dutch all |
Swahili | Shona Hausa Zulu Banjar Javanese Tsonga all |
Bengali | Assamese Bishnupriya Manipuri English Simple English Novial Scots all |
Ido | Esperanto Norfolk Ladino Papiamentu Venetian Sardinian all |
Malayalam | Wu Cantonese Korean Novial English Simple English all |
Gujarati | English Simple English Scots Wu Cantonese Novial all |
Western Panjabi | Urdu Farsi Gilaki Mazandarani Pashto Sorani all |
Afrikaans | Dutch Dutch Low Saxon Limburgish West Flemsih Zealandic German all |
Armenian | Cantonese Wu Korean English Novial Simple English all |
Nepali | Sanskrit Hindi Marathi Bihari Nepal Bhasa English all |
Low Saxon | Ripuarian Luxembourgish German Dutch Low Saxon Alemannic Dutch all |
Sicilian | Corsican Sardinian Neapolitan Ligurian Latin Italian all |
Urdu | Western Panjabi Farsi Pashto Gilaki Mazandarani Kashmiri all |
Kurdish | Zazaki Romani Hausa Indonesian Malay Kirundi all |
Cantonese | Wu English Simple English Novial Scots Zhuang all |
Quechua | Aymara Hausa Corsican Swahili Banjar Aragonese all |
Sundanese | Banjar Javanese Banyumasan Indonesian Malay Tagalog all |
Zazaki | Kurdish Romani Hausa Ladino Papiamentu Indonesian all |
Asturian | Spanish Chavacano Galician Extremaduran Catalan Aragonese all |
Tatar | Kirghiz Kazakh Karachay-Balkar Sakha Bashkir Eastern Mari all |
Irish | Scots Gaelic Scots Bavarian Simple English English Wu all |
Neapolitan | Italian Ligurian Venetian Corsican Sardinian Sicilian all |
Chuvash | Moldavian Tatar Bulgarian Russian Macedonian Serbian all |
Samogitian | Lithuanian Latgalian Latvian Esperanto Finnish Slovene all |
Interlingua | Interlingue Venetian Italian Chavacano Catalan Latin all |
Walloon | French Picard Occitan Catalan Asturian Arpitan all |
Amharic | Tigrinya Wu English Novial Cantonese Simple English all |
Kannada | Wu Cantonese Korean English Simple English Novial all |
Alemannic | Bavarian Palatinate Pennsylvania German Ripuarian German Luxembourgish all |
Banyumasan | Indonesian Javanese Banjar Malay Sundanese Tagalog all |
Buginese | Occitan Hausa Venetian Malay Neapolitan Corsican all |
Burmese | English Simple English Novial Scots Interlingua Interlingue all |
Min Nan | Hakka Shona Igbo Vietnamese Min Dong Sicilian all |
Tajik | Serbian Russian Bulgarian Kirghiz Moldavian Ukrainian all |
Venetian | Italian Ligurian Chavacano Spanish Friulian Portuguese all |
Yiddish | Hebrew Wu Cantonese Korean Ladino Novial all |
Tarantino | Neapolitan Italian Ligurian Venetian Sardinian Corsican all |
Scots Gaelic | Irish Bavarian Norman Wu Icelandic Welsh all |
Ossetic | Tajik Karachay-Balkar Kirghiz Russian Moldavian Kazakh all |
Egyptian Arabic | Arabic Pashto Sindhi Farsi Urdu Gilaki all |
Nahuatl | Spanish Chavacano Aragonese Asturian Extremaduran Galician all |
Sakha | Kirghiz Tatar Buryat Kazakh Chechen Karachay-Balkar all |
Scots | Simple English English Novial Zhuang Siswati Wu all |
Uzbek | Hausa Javanese Ladino Central Bicolano Karakalpak Kapampangan all |
Kapampangan | Javanese Tagalog Banyumasan Banjar Indonesian Malay all |
Fiji Hindi | Novial Interlingue Interlingua Latin Scots English all |
Sanskrit | Nepali Hindi Marathi Pali Nepal Bhasa Bihari all |
Mongolian | Buryat Kalmyk Kirghiz Sakha Chechen Tajik all |
Upper Sorbian | Lower Sorbian Silesian Slovene Bosnian Croatian Serbocroatian all |
Maori | Tongan Javanese Banjar Sundanese Malay Tagalog all |
Limburgish | Dutch Low Saxon Dutch West Flemsih Afrikaans Zealandic German all |
Bashkir | Kazakh Tatar Kirghiz Karachay-Balkar Sakha Eastern Mari all |
Corsican | Sicilian Sardinian Italian Venetian Ligurian Neapolitan all |
Sinhala | English Simple English Scots Novial Zhuang Wu all |
Gan | Cantonese Wu English Simple English Novial Scots all |
Gilaki | Farsi Mazandarani Pashto Western Panjabi Urdu Sorani all |
Faroese | Icelandic Nynorsk Swedish Norwegian Danish Estonian all |
Central Bicolano | Tagalog Javanese Banjar Indonesian Malay Sundanese all |
Sorani | Farsi Gilaki Western Panjabi Mazandarani Urdu Pashto all |
Bavarian | Alemannic German Pennsylvania German Palatinate Ripuarian Luxembourgish all |
Tibetan | Dzongkha Norfolk Turkish English Novial Wu all |
Western Mari | Eastern Mari Kirghiz Russian Tatar Moldavian Bulgarian all |
Voro | Estonian Finnish Esperanto Sardinian Nynorsk Oromo all |
Dutch Low Saxon | West Flemsih Limburgish Dutch Afrikaans Zealandic Low Saxon all |
Ilokano | Central Bicolano Sundanese Banjar Javanese Malay Indonesian all |
Kirghiz | Kazakh Tatar Karachay-Balkar Sakha Eastern Mari Tajik all |
Turkmen | Turkish Gagauz Crimean Tatar Karakalpak Azeri Indonesian all |
West Flemsih | Dutch Low Saxon Dutch Limburgish Zealandic Afrikaans German all |
Northern Sami | Finnish Estonian Voro Nynorsk Swedish Italian all |
Manx | Cornish Nynorsk Aromanian Italian Finnish Sardinian all |
Divehi | Wu Cantonese English Simple English Scots Novial all |
Norman | French Picard Arpitan Occitan Ligurian Sardinian all |
Rusyn | Ukrainian Russian Serbian Bulgarian Belarusian Belarusian all |
Pangasian | Sundanese Banjar Javanese Tagalog Swahili Bambara all |
Punjabi | English Simple English Scots Novial Wu Zhuang all |
Romansh | Sardinian Venetian Ligurian Italian Corsican Occitan all |
Mazandarani | Farsi Gilaki Pashto Western Panjabi Urdu Sorani all |
Pashto | Gilaki Farsi English Mazandarani Simple English Sindhi all |
Khmer | English Simple English Scots Novial Zhuang Wu all |
Udmurt | Komi Russian Komi-Permyak Ukrainian Moldavian Kirghiz all |
Friulian | Venetian Ligurian Italian Sardinian Corsican Occitan all |
Cassubian | Polish Silesian Lower Sorbian Slovak Serbocroatian Upper Sorbian all |
Wu | Cantonese English Simple English Novial Scots Zhuang all |
Maltese | Corsican Italian Sicilian Sardinian Ligurian Ladino all |
Uyghur | Pashto Egyptian Arabic Farsi Gilaki Arabic Sorani all |
Ligurian | Venetian Italian Sardinian Neapolitan Corsican Portuguese all |
Komi-Permyak | Komi Udmurt Russian Moksha Ukrainian Tatar all |
Pali | Bihari Sanskrit Hindi Nepali Nepal Bhasa Marathi all |
Eastern Mari | Western Mari Kirghiz Russian Tatar Moldavian Bulgarian all |
Komi | Komi-Permyak Udmurt Russian Moldavian Kirghiz Serbian all |
Anglo-Saxon | English Simple English Danish Novial Nynorsk Norwegian all |
Ladino | Chavacano Spanish Asturian Venetian Portuguese Galician all |
Bihari | Pali Hindi Nepali Sanskrit Marathi English all |
Sardinian | Corsican Venetian Ligurian Italian Chavacano Portuguese all |
Novial | English Simple English Scots Interlingue Interlingua Venetian all |
Classical Chinese | Cantonese Wu Novial English Simple English Danish all |
Ripuarian | Alemannic Pennsylvania German Palatinate Bavarian Low Saxon Luxembourgish all |
Chavacano | Spanish Asturian Galician Portuguese Extremaduran Aragonese all |
Somali | Oromo Wolof Hausa Banjar Bambara Malay all |
Hakka | Min Nan Min Dong Venda Vietnamese Zulu Setswana all |
Navajo | Somali Wolof Oromo Estonian Swahili Voro all |
Cornish | Norwegian Breton Nynorsk Danish Novial Ladino all |
Arpitan | Occitan French Extremaduran Asturian Catalan Aragonese all |
Saterland Frisian | Dutch Low Saxon West Flemsih Frisian North Frisian Dutch German all |
Silesian | Polish Lower Sorbian Upper Sorbian Czech Slovene Croatian all |
Extremaduran | Asturian Chavacano Spanish Aragonese Galician Catalan all |
Oriya | English Simple English Scots Novial Zhuang Anglo-Saxon all |
Interlingue | Interlingua Venetian Italian Occitan Catalan Novial all |
Picard | French Occitan Walloon Friulian Ligurian Venetian all |
Kalmyk | Mongolian Buryat Kirghiz Russian Tajik Karachay-Balkar all |
Hawai'ian | Bambara Swahili Tsonga Aromanian Javanese Romani all |
Mingrelian | Georgian Wu Cantonese Korean Novial Simple English all |
Kinyarwanda | Kirundi Swahili Hausa Indonesian Malay Banjar all |
Lingala | Kongo Zulu Javanese Swahili Tagalog Indonesian all |
North Frisian | Dutch Low Saxon Dutch Low Saxon Limburgish Ripuarian Saterland Frisian all |
Pennsylvania German | Alemannic Palatinate Bavarian Ripuarian Luxembourgish German all |
Palatinate | Alemannic Pennsylvania German Bavarian Ripuarian Luxembourgish German all |
Aymara | Quechua Hausa Swahili Banjar Bambara Shona all |
Karachay-Balkar | Kirghiz Kazakh Tatar Sakha Tajik Russian all |
Tongan | Maori Sundanese Javanese Zulu Siswati Tsonga all |
Acehnese | Sundanese Banjar Tagalog Banyumasan Indonesian Javanese all |
Emilian-Romagnol | Friulian Venetian Ligurian Romansh Neapolitan Corsican all |
Crimean Tatar | Turkish Gagauz Karakalpak Azeri Turkmen Zazaki all |
Chechen | Avar Sakha Lak Abkhazian Kirghiz Mongolian all |
Guarani | Spanish Chavacano Portuguese Galician Ladino Asturian all |
Erzya | Moksha Russian Moldavian Rusyn Ukrainian Bulgarian all |
Zealandic | West Flemsih Dutch Low Saxon Limburgish Dutch Afrikaans Luxembourgish all |
Aramaic | Wu Cantonese Korean Interlingue Chavacano Novial all |
Greenlandic | Finnish Oromo Estonian Voro Sicilian Corsican all |
Papiamentu | Ladino Chavacano Spanish Venetian Portuguese Galician all |
Gagauz | Turkish Crimean Tatar Azeri Karakalpak Turkmen Hausa all |
Banjar | Indonesian Malay Banyumasan Javanese Sundanese Tagalog all |
Lak | Avar Chechen Karachay-Balkar Tajik Sakha Kirghiz all |
Tok Pisin | Bislama Central Bicolano Novial Siswati Norfolk Javanese all |
Wolof | Oromo Somali North Frisian Estonian Finnish Bambara all |
Lojban | Estonian Extremaduran Corsican Venetian Sardinian Serbocroatian all |
Assamese | Bengali English Simple English Scots Novial Bishnupriya Manipuri all |
Moksha | Erzya Russian Rusyn Tajik Ukrainian Serbian all |
Avar | Lak Chechen Kirghiz Karachay-Balkar Tajik Buryat all |
Kabyle | Nynorsk Italian Finnish Norwegian Corsican Occitan all |
Lower Sorbian | Upper Sorbian Silesian Slovene Serbocroatian Croatian Bosnian all |
Tahitian | Ilokano Samoan Tetum Baque Corsican Interlingua all |
Shona | Swahili Kirundi Kinyarwanda Tsonga Romani Siswati all |
Sranan | Banjar Banyumasan Papiamentu Indonesian Javanese Sundanese all |
Laotian | English Simple English Scots Novial Wu Cantonese all |
Abkhazian | Kirghiz Chechen Sakha Karachay-Balkar Tatar Kazakh all |
Igbo | English Simple English Novial Scots Anglo-Saxon Siswati all |
Nauruan | Sundanese Indonesian Malay Javanese Banyumasan Tsonga all |
Tetum | Ladino Portuguese Papiamentu Sardinian Galician Mirandese all |
Kongo | Zulu Lingala Tsonga Swahili Javanese Siswati all |
Mirandese | Portuguese Galician Spanish Chavacano Asturian Extremaduran all |
Karakalpak | Turkish Gagauz Crimean Tatar Azeri Turkmen Uzbek all |
Latgalian | Latvian Lithuanian Romanian Esperanto Estonian Serbocroatian all |
Northern Sotho | Setswana Siswati Zulu Tsonga Samoan Novial all |
Romani | Novial Hausa Siswati Anglo-Saxon Indonesian Ladino all |
Old Church Slavonic | Rusyn Russian Serbian Bulgarian Ukrainian Belarusian all |
Karbadian | Karachay-Balkar Ossetic Avar Buryat Chechen Lak all |
Setswana | Northern Sotho Siswati Tsonga Zulu Malay Indonesian all |
Samoan | Sundanese Banjar Tongan Bambara Estonian Swahili all |
Moldavian | Bulgarian Russian Serbian Macedonian Ukrainian Rusyn all |
Sindhi | Pashto Egyptian Arabic Arabic Farsi Gilaki Western Panjabi all |
Bislama | Tok Pisin Central Bicolano Interlingue Norfolk Esperanto Tongan all |
Bambara | Hausa Banjar Akan Central Bicolano Swahili Javanese all |
Inupiak | Estonian Sicilian Corsican Latin Interlingue Interlingua all |
Siswati | Zulu English Simple English Novial Scots Tsonga all |
Inuktitut | Inupiak Greenlandic Sicilian Latin Novial Corsican all |
Norfolk | Novial Ladino Chavacano Spanish Extremaduran Asturian all |
Zhuang | English Simple English Scots Novial German Anglo-Saxon all |
Cherokee | Novial Interlingua Latin Interlingue English Venetian all |
Pontic | Greek Wu Cantonese Novial Korean Venetian all |
Gothic | Nynorsk Hausa Faroese Papiamentu Estonian Icelandic all |
Min Dong | Min Nan Hakka Vietnamese Kapampangan Javanese Zhuang all |
Ewe | Siswati Igbo Akan Novial Swahili Zulu all |
Hausa | Banjar Bambara Swahili Sundanese Kirundi Kinyarwanda all |
Zulu | Siswati Tsonga Javanese Swahili Kirundi Kinyarwanda all |
Tigrinya | Amharic Alemannic Anglo-Saxon Bavarian Palatinate Pennsylvania German all |
Kashmiri | Urdu Western Panjabi Farsi Gilaki Pashto Mazandarani all |
Buryat | Mongolian Kalmyk Kirghiz Sakha Tajik Karachay-Balkar all |
Oromo | Somali Wolof Finnish Bambara Hausa Banjar all |
Venda | Tsonga Zulu Shona Siswati Swahili Setswana all |
Tsonga | Zulu Siswati Swahili Shona Javanese Kongo all |
Sangro | Tsonga Kurdish Kongo Banyumasan Lingala Indonesian all |
Kirundi | Kinyarwanda Shona Swahili Hausa Zulu Banjar all |
Cree | Swahili Shona Romani Central Bicolano Siswati Hausa all |
Dzongkha | Tibetan English Simple English Scots Norfolk Novial all |
Akan | Bambara Papiamentu Hausa Sundanese Banjar Oromo all |
The results on this page are based on the comparison of letter trigram frequency in the given languages. This means, we took the text of 262 language editions of Wikipedia, counted how often three letters in a row appear, and compared the result with each other to figure out how similar the languages are — in this respect.
The alphabets have not been normalized, which leads to a great difference in some languages where you would not expect them, for example between Serbian and Croatian. Chinese and Japanese have been skipped due to their huge number of ngrams (for the raw data, see the letter frequency corpus).
Note that this similarity does not mean that the languages are indeed similar in any other sense of similarity besides their simple letter frequency. This page makes no direct historical, cultural, or political statement.
There are also results based on the letter pairs and single letters. For comparison and evaluation, there is some scarce data collected from Ethnologue.
There is a separate page with more information on the underlying trigram, bigram, and unigram data.
You can download the complete similarity dataset (2 MB).
The Wikipedia dumps used where the most recent ones as downloaded on October 9 2012.
Created November 4 2012, Denny Vrandečić.