Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 07 Oct '19 - 14 Oct '19

Top 75

Wiki Total Female Female (%)
0 Luxembourgish 3 3 100.00
1 Low Saxon 1 1 100.00
2 Sesotho 2 2 100.00
3 Aragonese 1 1 100.00
4 Kurdish 1 1 100.00
5 Volapük 19 19 100.00
6 Punjabi 1 1 100.00
7 Icelandic 62 60 96.77
8 Crimean Tatar 8 7 87.50
9 Tajik 32 27 84.38
10 Chuvash 4 3 75.00
11 Welsh 13 9 69.23
12 Haitian 3 2 66.67
13 Interlingue 3 2 66.67
14 Samogitian 3 2 66.67
15 Greek 206 129 62.62
16 Khmer 5 3 60.00
17 Macedonian 27 15 55.56
18 Azerbaijani 9 5 55.56
19 Kannada 2 1 50.00
20 Slovak 14 7 50.00
21 Faroese 2 1 50.00
22 Breton 8 4 50.00
23 South Azerbaijani 4 2 50.00
24 Bosnian 2 1 50.00
25 Egyptian Arabic 16 7 43.75
26 Ukrainian 269 115 42.75
27 Malay 68 29 42.65
28 Korean 203 85 41.87
29 Galician 46 19 41.30
30 Polish 228 84 36.84
31 Basque 45 16 35.56
32 Scots 20 7 35.00
33 Arabic 340 118 34.71
34 Malayalam 30 10 33.33
35 Tatar 3 1 33.33
36 Burmese 6 2 33.33
37 Marathi 25 8 32.00
38 Georgian 22 7 31.82
39 Norwegian (Nynorsk) 19 6 31.58
40 Japanese 403 126 31.27
41 English 1642 509 31.00
42 Hungarian 149 46 30.87
43 Catalan 202 60 29.70
44 Chinese 441 130 29.48
45 Assamese 7 2 28.57
46 Finnish 107 30 28.04
47 French 518 145 27.99
48 Latvian 11 3 27.27
49 Italian 518 140 27.03
50 Vietnamese 26 7 26.92
51 Hindi 19 5 26.32
52 Romanian 31 8 25.81
53 Indonesian 105 27 25.71
54 Armenian 74 19 25.68
55 Swedish 157 40 25.48
56 Tamil 8 2 25.00
57 Ido 20 5 25.00
58 Northern Sami 4 1 25.00
59 Zulu 4 1 25.00
60 Spanish 400 96 24.00
61 Afrikaans 34 8 23.53
62 Portuguese 53 12 22.64
63 Hebrew 100 21 21.00
64 Norwegian (Bokmål) 158 33 20.89
65 Slovenian 15 3 20.00
66 Thai 27 5 18.52
67 Estonian 56 10 17.86
68 Danish 118 21 17.80
69 Simple English 90 16 17.78
70 Belarusian 18 3 16.67
71 Bulgarian 66 11 16.67
72 Irish 6 1 16.67
73 German 810 133 16.42
74 Persian 293 46 15.70

Bottom 75

Wiki Total Female Female (%)
149 Fula 0 0 0.00
148 Lingala 0 0 0.00
147 Karachay-Balkar 0 0 0.00
146 Samoan 0 0 0.00
145 Limburgish 1 0 0.00
144 Javanese 1 0 0.00
143 Bislama 1 0 0.00
142 Sakha 1 0 0.00
141 Latin 2 0 0.00
140 Waray-Waray 6 0 0.00
139 Somali 2 0 0.00
138 Saterland Frisian 2 0 0.00
137 North Frisian 2 0 0.00
136 Zazaki 2 0 0.00
135 Mingrelian 2 0 0.00
134 Asturian 23 0 0.00
133 Yiddish 2 0 0.00
132 Yoruba 2 0 0.00
131 Oriya 2 0 0.00
130 Akan 2 0 0.00
129 Telugu 3 0 0.00
128 Silesian 3 0 0.00
127 Kabyle 3 0 0.00
126 Hausa 12 0 0.00
125 Belarusian (Taraškievica) 4 0 0.00
124 Malagasy 4 0 0.00
123 Cebuano 4 0 0.00
122 Sorani 8 0 0.00
121 Newar / Nepal Bhasa 1 0 0.00
120 Uyghur 1 0 0.00
119 Kirghiz 2 0 0.00
118 Venetian 1 0 0.00
117 Rusyn 1 0 0.00
116 Gujarati 1 0 0.00
115 Lezgian 1 0 0.00
114 Sinhalese 1 0 0.00
113 Tetum 1 0 0.00
112 Gothic 1 0 0.00
111 Shona 1 0 0.00
110 Mongolian 1 0 0.00
109 Sindhi 1 0 0.00
108 Avar 1 0 0.00
107 Xhosa 1 0 0.00
106 Central_Bicolano 1 0 0.00
105 Lao 1 0 0.00
104 Maori 1 0 0.00
103 Bihari 1 0 0.00
102 Minangkabau 1 0 0.00
101 Ligurian 1 0 0.00
100 Pashto 1 0 0.00
99 Nauruan 1 0 0.00
98 Erzya 1 0 0.00
97 Min Nan 1 0 0.00
96 Albanian 18 1 5.56
95 Uzbek 18 1 5.56
94 Cantonese 17 1 5.88
93 Croatian 17 1 5.88
92 Wu 32 2 6.25
91 Western Panjabi 99 7 7.07
90 Kazakh 26 2 7.69
89 Alemannic 12 1 8.33
88 Lithuanian 12 1 8.33
87 Bengali 75 7 9.33
86 Serbian 51 5 9.80
85 Tagalog 10 1 10.00
84 Nepali 10 1 10.00
83 Czech 106 11 10.38
82 Quechua 9 1 11.11
81 Esperanto 41 5 12.20
80 Swahili 8 1 12.50
79 Bashkir 8 1 12.50
78 Urdu 8 1 12.50
77 Turkish 75 10 13.33
76 Dutch 160 24 15.00
75 Russian 395 62 15.70

All time, as of 14 Oct '19

Top 75

Wiki Total Female Female (%)
0 Maithili 3205 1749 54.57
1 Asturian 28010 14679 52.41
2 Welsh 24691 12528 50.74
3 Nepali 4308 1764 40.95
4 Punjabi 8316 3345 40.22
5 Egyptian Arabic 9114 3652 40.07
6 Emilian-Romagnol 1298 500 38.52
7 Bihari 1025 372 36.29
8 Afrikaans 31520 11222 35.60
9 Oriya 2579 914 35.44
10 Interlingue 919 284 30.90
11 Malayalam 14033 4330 30.86
12 Urdu 12964 3608 27.83
13 Assamese 1095 296 27.03
14 Sinhalese 1777 480 27.01
15 Vepsian 807 213 26.39
16 Faroese 2418 638 26.39
17 Thai 23140 6035 26.08
18 Telugu 5494 1420 25.85
19 Vietnamese 55773 14361 25.75
20 Breton 12631 3220 25.49
21 Cebuano 1743 442 25.36
22 Korean 93573 22932 24.51
23 Central_Bicolano 1580 381 24.11
24 Norwegian (Bokmål) 161717 37450 23.16
25 Hindi 17366 4017 23.13
26 Tagalog 18310 4221 23.05
27 West Frisian 10687 2418 22.63
28 Kannada 3737 834 22.32
29 Japanese 296845 66255 22.32
30 Western Panjabi 5034 1119 22.23
31 Bengali 20625 4549 22.06
32 Haitian 9119 1949 21.37
33 Cantonese 13244 2806 21.19
34 Min Nan 8341 1767 21.18
35 Swedish 225354 47587 21.12
36 Spanish 379083 78307 20.66
37 Albanian 14378 2960 20.59
38 Bashkir 5002 1023 20.45
39 Armenian 41240 8267 20.05
40 Scots 13333 2639 19.79
41 Persian 144886 28565 19.72
42 Javanese 5981 1179 19.71
43 Finnish 147204 28966 19.68
44 Hebrew 77147 15166 19.66
45 Sundanese 1043 204 19.56
46 Estonian 46779 9073 19.40
47 Zazaki 1440 279 19.38
48 Chinese 163347 31529 19.30
49 Simple English 42538 8189 19.25
50 South Azerbaijani 54519 10364 19.01
51 Serbian 42689 8055 18.87
52 Mingrelian 2157 407 18.87
53 Turkish 82521 15331 18.58
54 Indonesian 70568 13111 18.58
55 Romanian 54968 10007 18.21
56 Portuguese 223878 40486 18.08
57 English 1664081 300759 18.07
58 Interlingua 1472 265 18.00
59 Icelandic 9478 1704 17.98
60 French 578771 103599 17.90
61 Latvian 23016 4120 17.90
62 Azerbaijani 29743 5319 17.88
63 Irish 9901 1757 17.75
64 Sakha 2237 397 17.75
65 Lithuanian 32933 5787 17.57
66 Marathi 12410 2178 17.55
67 Ilokano 1385 243 17.55
68 Samogitian 1210 212 17.52
69 Luxembourgish 12833 2242 17.47
70 Mirandese 1010 176 17.43
71 Dutch 213923 37012 17.30
72 Aymara 979 169 17.26
73 Georgian 19681 3384 17.19
74 Galician 35068 6018 17.16

Bottom 75

Wiki Total Female Female (%)
149 Tajik 25695 592 2.30
148 Min Dong 807 38 4.71
147 Classical Chinese 2398 132 5.50
146 Hausa 1514 89 5.88
145 Malagasy 30019 1973 6.57
144 Hakka 861 60 6.97
143 Chuvash 6101 475 7.79
142 Piedmontese 3541 285 8.05
141 Yiddish 2469 217 8.79
140 Newar / Nepal Bhasa 802 72 8.98
139 Uzbek 5382 511 9.49
138 Amharic 1467 144 9.82
137 Tatar 10268 1012 9.86
136 Walloon 1101 109 9.90
135 Belarusian (Taraškievica) 11555 1259 10.90
134 Mazandarani 2353 259 11.01
133 Latin 27796 3063 11.02
132 Chechen 1161 138 11.89
131 Belarusian 28006 3344 11.94
130 Yoruba 7303 875 11.98
129 Sicilian 2862 356 12.44
128 Quechua 4186 530 12.66
127 Venetian 1212 156 12.87
126 Maltese 1026 134 13.06
125 Lombard 2257 307 13.60
124 Scottish Gaelic 2219 302 13.61
123 Sanskrit 1475 201 13.63
122 Esperanto 56754 7804 13.75
121 Kazakh 19379 2679 13.82
120 Sorani 4147 575 13.87
119 Waray-Waray 3571 497 13.92
118 Low Saxon 7093 1002 14.13
117 Ido 7226 1026 14.20
116 Slovak 28447 4043 14.21
115 Bavarian 1620 231 14.26
114 Macedonian 15037 2155 14.33
113 Kurdish 2788 402 14.42
112 Norwegian (Nynorsk) 29292 4243 14.49
111 Slovenian 42677 6213 14.56
110 Hungarian 104646 15327 14.65
109 Russian 424426 62445 14.71
108 Kirghiz 4510 665 14.75
107 Kapampangan 1000 148 14.80
106 Wu 3987 590 14.80
105 Greek 49622 7409 14.93
104 Mongolian 5948 893 15.01
103 West Flemish 1166 175 15.01
102 Ukrainian 174167 26919 15.46
101 Volapük 5489 850 15.49
100 Arabic 412805 64193 15.55
99 Swahili 6695 1044 15.59
98 Italian 382923 59784 15.61
97 Pashto 1349 211 15.64
96 Malay 24401 3834 15.71
95 Bosnian 7104 1122 15.79
94 Occitan 9287 1466 15.79
93 German 735544 116477 15.84
92 Gujarati 1664 267 16.05
91 Polish 346051 55563 16.06
90 Croatian 33151 5329 16.07
89 Corsican 949 154 16.23
88 Alemannic 3834 625 16.30
87 Catalan 152150 24981 16.42
86 Tamil 21994 3619 16.45
85 Aragonese 5341 881 16.50
84 Sardinian 1494 247 16.53
83 Danish 77673 12887 16.59
82 Basque 38812 6450 16.62
81 Czech 113686 18907 16.63
80 Bulgarian 72450 12082 16.68
79 Serbo-Croatian 33866 5666 16.73
78 Limburgish 1057 177 16.75
77 Burmese 2350 398 16.94
76 Upper Sorbian 1334 227 17.02
75 Sindhi 920 157 17.07