Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of October 2014 about 99% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 03 Dec '17 - 11 Dec '17

Top 10

Wiki Total Female Female (%)
0 Slovenian 207 169 81.64
1 English 4097 2033 49.62
2 Bengali 115 44 38.26
3 Korean 684 257 37.57
4 Persian 256 94 36.72
5 Spanish 1961 700 35.70
6 Estonian 105 36 34.29
7 Tagalog 83 28 33.73
8 Turkish 197 61 30.96
9 Serbian 954 294 30.82

Bottom 10

Wiki Total Female Female (%)
49 Amharic 226 3 1.33
48 Arabic 737 57 7.73
47 Kazakh 1553 148 9.53
46 Tatar 307 35 11.40
45 Serbo-Croatian 226 29 12.83
44 Norwegian (Bokmål) 302 42 13.91
43 Afrikaans 163 23 14.11
42 Hungarian 273 39 14.29
41 Portuguese 559 80 14.31
40 Bulgarian 1018 149 14.64

All time

Top 10

Wiki Total Female Female (%)
0 Tagalog 20306 5270 25.95
1 Korean 80725 18666 23.12
2 Japanese 264749 59391 22.43
3 Norwegian (Bokmål) 146270 32248 22.05
4 Persian 112278 22838 20.34
5 Swedish 205968 41894 20.34
6 Simple English 34841 6696 19.22
7 Chinese 138949 26120 18.80
8 Vietnamese 25932 4860 18.74
9 Serbian 36643 6851 18.70

Bottom 10

Wiki Total Female Female (%)
49 Tajik 46957 401 0.85
48 Malagasy 29114 1937 6.65
47 Latin 26549 2916 10.98
46 Belarusian 24816 2873 11.58
45 Tamil 19299 2503 12.97
44 Greek 39570 5180 13.09
43 Hungarian 91892 12386 13.48
42 Ukrainian 139723 19213 13.75
41 Slovak 27096 3753 13.85
40 Galician 29618 4139 13.97