Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 16 Apr '18 - 23 Apr '18

Top 10

Wiki Total Female Female (%)
0 Georgian 30 24 80.00
1 Finnish 262 193 73.66
2 Basque 30 17 56.67
3 Azerbaijani 50 24 48.00
4 Punjabi 45 19 42.22
5 Portuguese 192 79 41.15
6 Turkish 49 20 40.82
7 Catalan 131 53 40.46
8 Arabic 543 218 40.15
9 Thai 54 21 38.89

Bottom 10

Wiki Total Female Female (%)
49 Luxembourgish 27 -1 -3.70
48 Scots 356 5 1.40
47 Volapük 261 9 3.45
46 Telugu 40 2 5.00
45 Japanese 275 15 5.45
44 Urdu 29 2 6.90
43 Tatar 178 13 7.30
42 Danish 34 3 8.82
41 Dutch 206 24 11.65
40 Indonesian 192 25 13.02

All time, as of 23 Apr '18

Top 10

Wiki Total Female Female (%)
0 Welsh 21465 11461 53.39
1 South Azerbaijani 27867 7328 26.30
2 Tagalog 21121 5511 26.09
3 Korean 83161 19399 23.33
4 Japanese 269745 60310 22.36
5 Norwegian (Bokmål) 148958 33114 22.23
6 Persian 115938 24052 20.75
7 Swedish 211368 43495 20.58
8 Spanish 339902 67209 19.77
9 Simple English 36208 6989 19.30

Bottom 10

Wiki Total Female Female (%)
49 Tajik 47511 475 1.00
48 Malagasy 29751 1954 6.57
47 Latin 26813 2960 11.04
46 Belarusian 25650 3038 11.84
45 Greek 42114 5537 13.15
44 Hungarian 96063 13061 13.60
43 Slovak 27507 3846 13.98
42 Ukrainian 147743 20684 14.00
41 Russian 383351 54618 14.25
40 Slovenian 41618 6011 14.44