Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 28 Jul '20 - 04 Aug '20

Top 75

Wiki Total Female Female (%)
0 Hausa 2 2 100.00
1 Malagasy 2 2 100.00
2 Kapampangan 7 7 100.00
3 Pangasinan 5 5 100.00
4 Malayalam 48 46 95.83
5 Cantonese 1295 1156 89.27
6 Vietnamese 530 433 81.70
7 Azerbaijani 218 168 77.06
8 Cornish 4 3 75.00
9 Korean 602 410 68.11
10 Minangkabau 3 2 66.67
11 Khmer 3 2 66.67
12 Sinhalese 3 2 66.67
13 Assamese 7 4 57.14
14 Waray-Waray 18 10 55.56
15 Scots 36 20 55.56
16 Punjabi 13 7 53.85
17 Sakha 2 1 50.00
18 Emilian-Romagnol 2 1 50.00
19 Occitan 2 1 50.00
20 Chichewa 4 2 50.00
21 Ligurian 2 1 50.00
22 Banjar 2 1 50.00
23 Kabyle 2 1 50.00
24 Scottish Gaelic 2 1 50.00
25 Nepali 16 8 50.00
26 Haitian 29 14 48.28
27 Aragonese 15 7 46.67
28 Sundanese 130 59 45.38
29 Spanish 532 233 43.80
30 Bengali 147 64 43.54
31 Yiddish 7 3 42.86
32 Danish 22 9 40.91
33 Galician 66 27 40.91
34 Greek 62 24 38.71
35 Indonesian 311 118 37.94
36 Catalan 201 76 37.81
37 Welsh 8 3 37.50
38 Albanian 8 3 37.50
39 Telugu 14 5 35.71
40 Malay 48 17 35.42
41 Italian 395 137 34.68
42 Lombard 3 1 33.33
43 Central_Bicolano 3 1 33.33
44 Basque 51 17 33.33
45 Irish 31 10 32.26
46 Yoruba 16 5 31.25
47 Latvian 10 3 30.00
48 Interlingua 10 3 30.00
49 Tatar 7 2 28.57
50 Latin 11 3 27.27
51 English 1974 506 25.63
52 Turkish 143 36 25.17
53 Asturian 4 1 25.00
54 Swedish 238 59 24.79
55 French 477 117 24.53
56 Wu 45 11 24.44
57 Hebrew 140 34 24.29
58 Arabic 302 73 24.17
59 Macedonian 35 8 22.86
60 Dutch 242 54 22.31
61 Urdu 27 6 22.22
62 Norwegian (Nynorsk) 41 9 21.95
63 Hindi 23 5 21.74
64 Japanese 576 122 21.18
65 Zazaki 38 8 21.05
66 Georgian 19 4 21.05
67 Chinese 173 36 20.81
68 Hungarian 134 27 20.15
69 Slovenian 10 2 20.00
70 Faroese 5 1 20.00
71 West Frisian 5 1 20.00
72 Swahili 20 4 20.00
73 Thai 30 6 20.00
74 Armenian 104 20 19.23

Bottom 75

Wiki Total Female Female (%)
149 Kirghiz 4 -1 -25.00
148 South Azerbaijani 7 -1 -14.29
147 Maori 1 0 0.00
146 Hawaiian 2 0 0.00
145 Venetian 2 0 0.00
144 Rusyn 2 0 0.00
143 Ido 12 0 0.00
142 Mongolian 3 0 0.00
141 Amharic 8 0 0.00
140 Dutch Low Saxon 3 0 0.00
139 Low Saxon 8 0 0.00
138 Sindhi 3 0 0.00
137 Gujarati 3 0 0.00
136 Javanese 8 0 0.00
135 Breton 3 0 0.00
134 Somali 7 0 0.00
133 Belarusian (Taraškievica) 3 0 0.00
132 Western Panjabi 3 0 0.00
131 Bosnian 6 0 0.00
130 Kazakh 5 0 0.00
129 Lao 4 0 0.00
128 Chechen 5 0 0.00
127 Chuvash 20 0 0.00
126 Limburgish 5 0 0.00
125 West Flemish 2 0 0.00
124 Tok Pisin 1 0 0.00
123 Pennsylvania German 1 0 0.00
122 Norman 1 0 0.00
121 Papiamentu 2 0 0.00
120 Cebuano 2 0 0.00
119 Uyghur 2 0 0.00
118 Turkmen 2 0 0.00
117 Mazandarani 2 0 0.00
116 Erzya 2 0 0.00
115 Friulian 2 0 0.00
114 Võro 2 0 0.00
113 Classical Chinese 2 0 0.00
112 Walloon 2 0 0.00
111 Northern Luri 2 0 0.00
110 Kinyarwanda 2 0 0.00
109 Kannada 2 0 0.00
108 Meadow Mari 2 0 0.00
107 Marathi 38 1 2.63
106 Afrikaans 170 6 3.53
105 Norwegian (Bokmål) 302 16 5.30
104 Uzbek 17 1 5.88
103 Croatian 14 1 7.14
102 Quechua 12 1 8.33
101 Estonian 34 3 8.82
100 Serbo-Croatian 11 1 9.09
99 Serbian 65 6 9.23
98 Bulgarian 54 5 9.26
97 Sorani 10 1 10.00
96 Belarusian 39 4 10.26
95 Lithuanian 9 1 11.11
94 Luxembourgish 9 1 11.11
93 Slovak 17 2 11.76
92 Ukrainian 364 47 12.91
91 Volapük 22 3 13.64
90 Egyptian Arabic 25021 3442 13.76
89 Finnish 84 12 14.29
88 Kurdish 7 1 14.29
87 Tajik 14 2 14.29
86 Esperanto 35 5 14.29
85 Russian 460 67 14.57
84 German 1101 167 15.17
83 Oriya 13 2 15.38
82 Burmese 19 3 15.79
81 Tamil 55 9 16.36
80 Portuguese 138 23 16.67
79 Romanian 30 5 16.67
78 Czech 133 23 17.29
77 Polish 259 45 17.37
76 Persian 286 51 17.83
75 Simple English 163 31 19.02

All time, as of 04 Aug '20

Top 75

Wiki Total Female Female (%)
0 Tuvan 1065 637 59.81
1 Maithili 3332 1800 54.02
2 Welsh 25359 12823 50.57
3 Asturian 34720 14959 43.08
4 Nepali 4616 1903 41.23
5 Punjabi 9727 3821 39.28
6 Emilian-Romagnol 1363 528 38.74
7 Afrikaans 36417 13380 36.74
8 Bihari 1077 381 35.38
9 Oriya 2999 1009 33.64
10 Malayalam 15197 5037 33.14
11 Assamese 1481 461 31.13
12 Central_Bicolano 1784 540 30.27
13 Interlingue 1084 326 30.07
14 Bashkir 6779 1974 29.12
15 Cebuano 1904 549 28.83
16 Urdu 14760 4091 27.72
17 Sundanese 1463 404 27.61
18 Sinhalese 1817 496 27.30
19 Cantonese 18055 4918 27.24
20 Vietnamese 61109 16378 26.80
21 Vepsian 818 216 26.41
22 Faroese 2449 645 26.34
23 Telugu 5900 1520 25.76
24 Breton 13053 3357 25.72
25 Korean 107297 27464 25.60
26 Thai 24430 6246 25.57
27 Kannada 4068 1031 25.34
28 Sindhi 1109 273 24.62
29 Haitian 10732 2618 24.39
30 Bengali 25715 6146 23.90
31 Norwegian (Bokmål) 167903 39552 23.56
32 Hindi 19413 4492 23.14
33 West Frisian 10971 2486 22.66
34 Armenian 47497 10749 22.63
35 Western Panjabi 6769 1459 21.55
36 Scots 14638 3132 21.40
37 Spanish 402385 85847 21.33
38 Min Nan 8517 1815 21.31
39 Swedish 237528 50353 21.20
40 Tagalog 17771 3760 21.16
41 Japanese 336997 71168 21.12
42 Albanian 15062 3164 21.01
43 Persian 161955 32923 20.33
44 Simple English 48780 9755 20.00
45 Finnish 155142 30879 19.90
46 Javanese 6240 1242 19.90
47 Hebrew 83530 16509 19.76
48 Indonesian 79072 15541 19.65
49 Estonian 48901 9525 19.48
50 Interlingua 1661 322 19.39
51 Turkish 89217 17264 19.35
52 Georgian 21594 4170 19.31
53 Serbian 45775 8701 19.01
54 Chinese 177798 33802 19.01
55 Zazaki 1832 345 18.83
56 Hausa 2114 397 18.78
57 Sakha 2413 453 18.77
58 Mingrelian 2275 427 18.77
59 English 1738627 322341 18.54
60 Azerbaijani 31864 5887 18.48
61 Ilokano 1424 263 18.47
62 Romanian 56839 10481 18.44
63 French 604810 111429 18.42
64 Icelandic 9747 1782 18.28
65 Basque 41712 7626 18.28
66 Alemannic 4137 755 18.25
67 Portuguese 232312 42369 18.24
68 Sardinian 1566 285 18.20
69 Samogitian 1237 225 18.19
70 Tamil 24191 4395 18.17
71 Galician 37188 6690 17.99
72 Latvian 23960 4309 17.98
73 Luxembourgish 13231 2372 17.93
74 Irish 10717 1920 17.92

Bottom 75

Wiki Total Female Female (%)
149 Tajik 26282 748 2.85
148 Min Dong 816 39 4.78
147 Classical Chinese 2527 137 5.42
146 Malagasy 30142 1983 6.58
145 Hakka 875 60 6.86
144 Piedmontese 3587 290 8.08
143 Uzbek 7010 580 8.27
142 Chuvash 6333 533 8.42
141 Yiddish 2650 247 9.32
140 Amharic 1492 150 10.05
139 Walloon 1183 119 10.06
138 Tatar 10643 1140 10.71
137 Belarusian (Taraškievica) 11809 1300 11.01
136 Latin 28246 3115 11.03
135 Mazandarani 2363 262 11.09
134 Egyptian Arabic 544328 63589 11.68
133 Belarusian 30310 3754 12.39
132 Sicilian 2870 357 12.44
131 Yoruba 7536 965 12.81
130 Quechua 4806 631 13.13
129 Venetian 1240 163 13.15
128 Sanskrit 1484 202 13.61
127 Kazakh 20012 2737 13.68
126 Esperanto 59477 8211 13.81
125 Lombard 2426 337 13.89
124 Scottish Gaelic 2336 326 13.96
123 Swahili 8554 1202 14.05
122 Low Saxon 7303 1028 14.08
121 Waray-Waray 3697 522 14.12
120 Ido 7761 1097 14.13
119 Bavarian 1609 231 14.36
118 Slovak 29045 4216 14.52
117 Macedonian 15797 2324 14.71
116 Slovenian 43601 6428 14.74
115 Norwegian (Nynorsk) 30180 4468 14.80
114 Kirghiz 4628 687 14.84
113 Hungarian 110370 16613 15.05
112 Volapük 6826 1029 15.07
111 Russian 451054 68113 15.10
110 Mongolian 6032 912 15.12
109 Corsican 1035 158 15.27
108 Pashto 1460 224 15.34
107 West Flemish 1184 182 15.37
106 Kurdish 3112 482 15.49
105 Kapampangan 1015 159 15.67
104 Italian 404913 63922 15.79
103 Occitan 9551 1509 15.80
102 Croatian 33982 5447 16.03
101 German 776025 125093 16.12
100 South Azerbaijani 81968 13229 16.14
99 Arabic 456864 73786 16.15
98 Bosnian 7322 1183 16.16
97 Polish 364363 59625 16.36
96 Wu 7055 1158 16.41
95 Limburgish 1194 197 16.50
94 Ukrainian 192762 31852 16.52
93 Serbo-Croatian 34290 5747 16.76
92 Bulgarian 75341 12665 16.81
91 Maltese 1097 185 16.86
90 Aymara 1035 176 17.00
89 Danish 80239 13665 17.03
88 Czech 120287 20574 17.10
87 Aragonese 5512 944 17.13
86 Upper Sorbian 1378 236 17.13
85 Mirandese 1020 177 17.35
84 Dutch 224412 39471 17.59
83 Lithuanian 33592 5913 17.60
82 Gujarati 1839 324 17.62
81 Sorani 4616 816 17.68
80 Malay 27126 4796 17.68
79 Catalan 162711 28792 17.70
78 Chechen 1295 230 17.76
77 Marathi 13112 2332 17.79
76 Burmese 2579 459 17.80
75 Greek 54377 9691 17.82