Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 29 Sep '20 - 20 Oct '20

Top 75

Wiki Total Female Female (%)
0 Macedonian 353 320 90.65
1 Igbo 41 37 90.24
2 Yoruba 38 31 81.58
3 Kinyarwanda 20 14 70.00
4 Limburgish 3 2 66.67
5 Crimean Tatar 3 2 66.67
6 Scottish Gaelic 8 5 62.50
7 Catalan 733 448 61.12
8 Yiddish 2 1 50.00
9 Walloon 2 1 50.00
10 Egyptian Arabic 26685 12770 47.85
11 Sorani 61 29 47.54
12 Haitian 7 3 42.86
13 Basque 204 86 42.16
14 Bosnian 19 8 42.11
15 Lithuanian 51 21 41.18
16 Zazaki 71 29 40.85
17 Turkish 499 203 40.68
18 Hausa 42 17 40.48
19 Nepali 15 6 40.00
20 Welsh 35 14 40.00
21 Tajik 50 19 38.00
22 Malayalam 58 22 37.93
23 Marathi 152 57 37.50
24 Indonesian 586 219 37.37
25 Malay 97 36 37.11
26 Portuguese 282 102 36.17
27 Chuvash 31 11 35.48
28 French 1719 592 34.44
29 Afrikaans 509 172 33.79
30 Guarani 3 1 33.33
31 Alemannic 3 1 33.33
32 Shona 3 1 33.33
33 Somali 27 9 33.33
34 Lower Sorbian 3 1 33.33
35 Piedmontese 12 4 33.33
36 Zulu 3 1 33.33
37 Albanian 36 12 33.33
38 Romanian 100 32 32.00
39 Italian 1564 494 31.59
40 Vietnamese 496 150 30.24
41 Sindhi 10 3 30.00
42 Spanish 1532 450 29.37
43 Assamese 35 10 28.57
44 Armenian 377 106 28.12
45 Swedish 562 158 28.11
46 Dutch 691 194 28.08
47 Georgian 137 37 27.01
48 Bengali 344 91 26.45
49 Luxembourgish 42 11 26.19
50 Greek 311 81 26.05
51 English 4902 1277 26.05
52 Persian 760 196 25.79
53 Galician 126 32 25.40
54 Kabyle 4 1 25.00
55 Interlingua 12 3 25.00
56 Mazandarani 4 1 25.00
57 Slovenian 41 10 24.39
58 Belarusian 182 44 24.18
59 Slovak 29 7 24.14
60 Kurdish 50 12 24.00
61 Russian 1762 418 23.72
62 Korean 266 61 22.93
63 Czech 450 103 22.89
64 Azerbaijani 114 26 22.81
65 Hungarian 275 59 21.45
66 Aragonese 14 3 21.43
67 Bulgarian 146 31 21.23
68 Serbo-Croatian 19 4 21.05
69 Finnish 319 67 21.00
70 West Frisian 24 5 20.83
71 Mingrelian 10 2 20.00
72 Western Panjabi 249 49 19.68
73 Hindi 46 9 19.57
74 Hebrew 763 147 19.27

Bottom 75

Wiki Total Female Female (%)
149 Gujarati 3 -1 -33.33
148 Central_Bicolano 5 -1 -20.00
147 West Flemish 5 -1 -20.00
146 Faroese 5 -1 -20.00
145 Occitan 16 -3 -18.75
144 Min Nan 10 -1 -10.00
143 South Azerbaijani 14 -1 -7.14
142 Waray-Waray 16 -1 -6.25
141 Acehnese 8 0 0.00
140 Franco-Provençal/Arpitan 4 0 0.00
139 Banyumasan 2 0 0.00
138 Malagasy 6 0 0.00
137 Minangkabau 3 0 0.00
136 Bihari 5 0 0.00
135 Bavarian 5 0 0.00
134 Hakka 7 0 0.00
133 Ossetian 5 0 0.00
132 Northern Luri 3 0 0.00
131 Sardinian 6 0 0.00
130 Rusyn 3 0 0.00
129 Gova Konknni 3 0 0.00
128 North Frisian 3 0 0.00
127 Javanese 15 0 0.00
126 Kannada 24 0 0.00
125 Oriya 28 0 0.00
124 Swati 2 0 0.00
123 Turkmen 2 0 0.00
122 Maltese 3 0 0.00
121 Venetian 6 0 0.00
120 Kazakh 114 3 2.63
119 Chechen 32 1 3.12
118 Kirghiz 18 1 5.56
117 Lezgian 17 1 5.88
116 Irish 65 4 6.15
115 Classical Chinese 16 1 6.25
114 Latvian 79 5 6.33
113 Punjabi 14 1 7.14
112 Belarusian (Taraškievica) 27 2 7.41
111 Bashkir 302 23 7.62
110 Quechua 13 1 7.69
109 Estonian 109 9 8.26
108 Telugu 22 2 9.09
107 Japanese 1546 154 9.96
106 Mongolian 10 1 10.00
105 Cebuano 10 1 10.00
104 Cantonese 159 16 10.06
103 Ukrainian 937 98 10.46
102 Latin 38 4 10.53
101 Arabic 1588 179 11.27
100 Volapük 123 14 11.38
99 Thai 59 7 11.86
98 Esperanto 171 21 12.28
97 Wu 256 32 12.50
96 Interlingue 8 1 12.50
95 Low Saxon 8 1 12.50
94 Ido 23 3 13.04
93 Chinese 1743 246 14.11
92 Pashto 7 1 14.29
91 Danish 147 21 14.29
90 Tatar 47 7 14.89
89 Tuvan 67 10 14.93
88 Urdu 112 17 15.18
87 Lombard 13 2 15.38
86 Serbian 154 24 15.58
85 Icelandic 32 5 15.62
84 Akan 32 5 15.62
83 Norwegian (Nynorsk) 174 28 16.09
82 Simple English 519 85 16.38
81 Burmese 30 5 16.67
80 Breton 18 3 16.67
79 Croatian 53 9 16.98
78 Tamil 191 34 17.80
77 Corsican 11 2 18.18
76 Polish 676 124 18.34
75 German 2416 458 18.96

All time, as of 20 Oct '20

Top 75

Wiki Total Female Female (%)
0 Tuvan 1144 651 56.91
1 Maithili 3340 1801 53.92
2 Asturian 29051 14745 50.76
3 Welsh 25512 12870 50.45
4 Nepali 4691 1926 41.06
5 Punjabi 9807 3861 39.37
6 Emilian-Romagnol 1369 528 38.57
7 Afrikaans 38067 13884 36.47
8 Bihari 1087 383 35.23
9 Malayalam 15543 5238 33.70
10 Oriya 3090 1015 32.85
11 Assamese 1554 482 31.02
12 Interlingue 1122 344 30.66
13 Central_Bicolano 1795 543 30.25
14 Cebuano 1924 553 28.74
15 Scots 9184 2544 27.70
16 Urdu 15048 4142 27.53
17 Cantonese 19266 5287 27.44
18 Sundanese 1472 403 27.38
19 Sinhalese 1829 497 27.17
20 Bashkir 7807 2073 26.55
21 Vietnamese 63456 16740 26.38
22 Vepsian 820 216 26.34
23 Faroese 2456 645 26.26
24 Breton 13111 3372 25.72
25 Thai 24463 6271 25.63
26 Telugu 5992 1533 25.58
27 Korean 111499 28424 25.49
28 Kannada 4109 1036 25.21
29 Haitian 10820 2666 24.64
30 Sindhi 1140 278 24.39
31 Bengali 26798 6481 24.18
32 Norwegian (Bokmål) 171864 39947 23.24
33 Hindi 19642 4544 23.13
34 Armenian 49647 11327 22.82
35 West Frisian 11039 2502 22.67
36 Western Panjabi 7037 1514 21.51
37 Spanish 408182 87524 21.44
38 Min Nan 8549 1824 21.34
39 Swedish 239428 50900 21.26
40 Albanian 15206 3204 21.07
41 Japanese 342244 72083 21.06
42 Indonesian 83304 17090 20.52
43 Persian 164697 33530 20.36
44 Simple English 50488 10144 20.09
45 Finnish 156516 31127 19.89
46 Javanese 6280 1247 19.86
47 Hebrew 85383 16858 19.74
48 Interlingua 1690 330 19.53
49 Turkish 91922 17937 19.51
50 Zazaki 2289 446 19.48
51 Georgian 22023 4277 19.42
52 Estonian 49398 9585 19.40
53 Hausa 2219 430 19.38
54 Serbian 46257 8766 18.95
55 Mingrelian 2315 438 18.92
56 Alemannic 4181 789 18.87
57 Chinese 184734 34721 18.80
58 Sakha 2419 454 18.77
59 Azerbaijani 32741 6130 18.72
60 English 1757656 327087 18.61
61 Basque 42239 7832 18.54
62 Ilokano 1425 264 18.53
63 French 610225 113034 18.52
64 Romanian 57199 10585 18.51
65 Tagalog 16304 3009 18.46
66 Portuguese 233772 42831 18.32
67 Icelandic 9827 1798 18.30
68 Tamil 24771 4524 18.26
69 Galician 37707 6872 18.22
70 Kurdish 3391 616 18.17
71 Sardinian 1574 286 18.17
72 Samogitian 1240 225 18.15
73 Sorani 4773 866 18.14
74 Greek 55489 10049 18.11

Bottom 75

Wiki Total Female Female (%)
149 Tajik 26409 777 2.94
148 Min Dong 813 38 4.67
147 Classical Chinese 2570 138 5.37
146 Malagasy 30173 1986 6.58
145 Hakka 884 60 6.79
144 Piedmontese 3605 297 8.24
143 Chuvash 6432 560 8.71
142 Uzbek 6907 612 8.86
141 Yiddish 2738 264 9.64
140 Walloon 1220 120 9.84
139 Amharic 1494 150 10.04
138 Latin 28325 3122 11.02
137 Belarusian (Taraškievica) 11869 1310 11.04
136 Mazandarani 2369 263 11.10
135 Tatar 11181 1244 11.13
134 Sicilian 2872 357 12.43
133 Belarusian 31040 3935 12.68
132 Venetian 1263 163 12.91
131 Quechua 4912 643 13.09
130 Yoruba 7590 999 13.16
129 Kazakh 20308 2748 13.53
128 Sanskrit 1488 204 13.71
127 Egyptian Arabic 619793 85018 13.72
126 Esperanto 60115 8301 13.81
125 Lombard 2455 340 13.85
124 Waray-Waray 3744 526 14.05
123 Low Saxon 7333 1031 14.06
122 Ido 7831 1103 14.09
121 Scottish Gaelic 2350 335 14.26
120 Bavarian 1617 231 14.29
119 Slovak 29176 4255 14.58
118 Slovenian 43737 6467 14.79
117 Kirghiz 4666 690 14.79
116 Norwegian (Nynorsk) 30525 4536 14.86
115 Volapük 7119 1058 14.86
114 Hungarian 111336 16819 15.11
113 Russian 456343 69134 15.15
112 Mongolian 6057 918 15.16
111 Corsican 1050 162 15.43
110 West Flemish 1198 185 15.44
109 Kapampangan 1018 160 15.72
108 Occitan 9585 1510 15.75
107 Swahili 8016 1266 15.79
106 Italian 410005 65253 15.92
105 Pashto 1512 241 15.94
104 Croatian 34182 5475 16.02
103 South Azerbaijani 82005 13232 16.14
102 German 786244 126964 16.15
101 Arabic 461226 74588 16.17
100 Wu 7763 1258 16.21
99 Bosnian 7369 1201 16.30
98 Macedonian 16280 2656 16.31
97 Polish 366890 60202 16.41
96 Ukrainian 196462 32340 16.46
95 Limburgish 1204 201 16.69
94 Serbo-Croatian 34375 5759 16.75
93 Maltese 1101 185 16.80
92 Bulgarian 75904 12763 16.81
91 Aymara 1042 177 16.99
90 Upper Sorbian 1386 236 17.03
89 Danish 80677 13757 17.05
88 Aragonese 5554 951 17.12
87 Czech 121792 20958 17.21
86 Chechen 1354 234 17.28
85 Mirandese 1025 178 17.37
84 Gujarati 1857 325 17.50
83 Dutch 227037 40116 17.67
82 Lithuanian 33725 5966 17.69
81 Irish 10966 1946 17.75
80 Malay 27420 4890 17.83
79 Burmese 2696 481 17.84
78 Marathi 13481 2405 17.84
77 Latvian 24238 4339 17.90
76 Luxembourgish 13378 2400 17.94
75 Catalan 165596 29932 18.08