Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 12 Feb '18 - 19 Feb '18

Top 10

Wiki Total Female Female (%)
0 Western Panjabi 31 22 70.97
1 Catalan 331 173 52.27
2 Turkish 212 96 45.28
3 Indonesian 367 162 44.14
4 Lithuanian 32 12 37.50
5 Swedish 289 104 35.99
6 Scots 45 16 35.56
7 Spanish 913 319 34.94
8 Galician 52 18 34.62
9 Armenian 80 27 33.75

Bottom 10

Wiki Total Female Female (%)
49 Macedonian 35 -2 -5.71
48 Low Saxon 47 0 0.00
47 Tatar 223 9 4.04
46 Greek 157 7 4.46
45 Volapük 101 5 4.95
44 Chinese 766 44 5.74
43 Slovak 78 8 10.26
42 Thai 58 6 10.34
41 Japanese 251 30 11.95
40 Esperanto 57 8 14.04

All time, as of 19 Feb '18

Top 10

Wiki Total Female Female (%)
0 South Azerbaijani 27811 7330 26.36
1 Tagalog 20952 5429 25.91
2 Korean 82204 19003 23.12
3 Japanese 267601 59769 22.34
4 Norwegian (Bokmål) 146988 32466 22.09
5 Swedish 209583 42983 20.51
6 Persian 114040 23347 20.47
7 Simple English 35527 6840 19.25
8 Spanish 331579 63049 19.01
9 Serbian 37096 6943 18.72

Bottom 10

Wiki Total Female Female (%)
49 Tajik 47116 412 0.87
48 Malagasy 29151 1942 6.66
47 Latin 26727 2950 11.04
46 Belarusian 25130 2913 11.59
45 Greek 40831 5306 13.00
44 Hungarian 94729 12766 13.48
43 Ukrainian 143833 19932 13.86
42 Slovak 27355 3810 13.93
41 Russian 375941 52852 14.06
40 Malay 20171 2839 14.07