Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 19 Mar '17 - 26 Mar '17

Top 10

Wiki Total Female Female (%)
0 Punjabi 82 79 96.34
1 Malayalam 93 86 92.47
2 Oriya 39 34 87.18
3 Hindi 62 52 83.87
4 Maithili 36 29 80.56
5 Slovenian 56 44 78.57
6 Western Panjabi 54 39 72.22
7 Basque 51 28 54.90
8 Spanish 545 293 53.76
9 Indonesian 115 51 44.35

Bottom 10

Wiki Total Female Female (%)
49 Min Nan 45 3 6.67
48 Arabic 895 66 7.37
47 Scots 195 15 7.69
46 Thai 53 5 9.43
45 Portuguese 736 73 9.92
44 Cantonese 39 4 10.26
43 Latvian 58 6 10.34
42 Romanian 46 5 10.87
41 Hungarian 263 34 12.93
40 Ukrainian 244 35 14.34

All time, as of 26 Mar '17

Top 10

Wiki Total Female Female (%)
0 Welsh 17927 9361 52.22
1 Korean 72192 17060 23.63
2 Japanese 243342 54123 22.24
3 Norwegian (Bokmål) 140928 31052 22.03
4 Persian 101996 21044 20.63
5 Swedish 197663 40240 20.36
6 Simple English 32340 6208 19.20
7 Vietnamese 23232 4293 18.48
8 Chinese 130925 23869 18.23
9 Spanish 302319 54787 18.12

Bottom 10

Wiki Total Female Female (%)
49 Malagasy 28889 1927 6.67
48 Slovenian 38673 3880 10.03
47 Latin 26118 2826 10.82
46 Belarusian 22922 2503 10.92
45 Greek 34972 4182 11.96
44 Tamil 16890 2172 12.86
43 Hungarian 85320 11339 13.29
42 Ukrainian 123685 16592 13.41
41 Slovak 26400 3602 13.64
40 Galician 28227 3864 13.69