Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 02 Jul '18 - 09 Jul '18

Top 10

Wiki Total Female Female (%)
0 Egyptian Arabic 106 89 83.96
1 Welsh 60 41 68.33
2 Hebrew 117 44 37.61
3 Asturian 210 72 34.29
4 Korean 96 32 33.33
5 French 545 177 32.48
6 English 535 147 27.48
7 Italian 1272 336 26.42
8 Spanish 433 114 26.33
9 Catalan 176 45 25.57

Bottom 10

Wiki Total Female Female (%)
49 Belarusian (Taraškievica) 45 0 0.00
48 Kazakh 48 0 0.00
47 Serbian 90 2 2.22
46 Vietnamese 1399 38 2.72
45 Esperanto 74 4 5.41
44 Arabic 647 45 6.96
43 Uzbek 36 3 8.33
42 Serbo-Croatian 45 4 8.89
41 Latvian 42 4 9.52
40 Belarusian 50 5 10.00

All time, as of 16 Jul '18

Top 10

Wiki Total Female Female (%)
0 Welsh 21685 11561 53.31
1 Tagalog 20840 5293 25.40
2 South Azerbaijani 30311 7653 25.25
3 Korean 84131 19831 23.57
4 Norwegian (Bokmål) 150088 33561 22.36
5 Japanese 272000 60721 22.32
6 Swedish 213386 43958 20.60
7 Spanish 345675 68836 19.91
8 Persian 124737 24707 19.81
9 Simple English 37119 7147 19.25

Bottom 10

Wiki Total Female Female (%)
49 Tajik 47517 477 1.00
48 Malagasy 29908 1961 6.56
47 Latin 26977 2982 11.05
46 Belarusian 25929 3063 11.81
45 Greek 43147 5811 13.47
44 Hungarian 96944 13219 13.64
43 Slovak 27566 3869 14.04
42 Ukrainian 151109 21290 14.09
41 Russian 389932 55851 14.32
40 Slovenian 41820 6052 14.47