Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of October 2014 about 99% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 11 Jun '17 - 18 Jun '17

Top 10

Wiki Total Female Female (%)
0 Punjabi 151 135 89.40
1 Hindi 60 42 70.00
2 Czech 162 101 62.35
3 Basque 62 27 43.55
4 Bengali 24 10 41.67
5 Romanian 36 14 38.89
6 Mirandese 21 8 38.10
7 Scots 54 19 35.19
8 Turkish 79 25 31.65
9 Esperanto 38 12 31.58

Bottom 10

Wiki Total Female Female (%)
49 West Frisian 362 0 0.00
48 Serbian 122 4 3.28
47 Macedonian 23 1 4.35
46 Belarusian 33 2 6.06
45 Min Nan 109 7 6.42
44 Amharic 44 3 6.82
43 Tamil 36 3 8.33
42 Norwegian (Bokmål) 152 14 9.21
41 Ukrainian 234 22 9.40
40 Catalan 211 20 9.48

All time

Top 10

Wiki Total Female Female (%)
0 Welsh 18095 9384 51.86
1 Korean 74023 17597 23.77
2 Japanese 257755 57617 22.35
3 Norwegian (Bokmål) 143524 31718 22.10
4 Swedish 202070 41255 20.42
5 Persian 106070 21572 20.34
6 Simple English 33121 6351 19.18
7 Vietnamese 24316 4567 18.78
8 Chinese 134408 24950 18.56
9 Spanish 309060 56749 18.36

Bottom 10

Wiki Total Female Female (%)
49 Malagasy 28977 1927 6.65
48 Latin 26262 2866 10.91
47 Slovenian 39428 4458 11.31
46 Belarusian 23771 2715 11.42
45 Greek 36422 4554 12.50
44 Tamil 17323 2230 12.87
43 Hungarian 89526 11948 13.35
42 Ukrainian 127385 17261 13.55
41 Slovak 26803 3675 13.71
40 Galician 28687 3948 13.76