Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 14 Jan '19 - 21 Jan '19

Top 10

Wiki Total Female Female (%)
0 Danish 260 204 78.46
1 Malay 51 35 68.63
2 Basque 66 44 66.67
3 Galician 58 32 55.17
4 Cantonese 34 18 52.94
5 Irish 132 66 50.00
6 Indonesian 168 75 44.64
7 Catalan 369 163 44.17
8 Tamil 42 17 40.48
9 Vietnamese 284 104 36.62

Bottom 10

Wiki Total Female Female (%)
49 South Azerbaijani 72 0 0.00
48 Tatar 40 0 0.00
47 Kirghiz 162 9 5.56
46 Low Saxon 58 4 6.90
45 Croatian 33 3 9.09
44 Western Panjabi 31 3 9.68
43 Ukrainian 356 36 10.11
42 Serbo-Croatian 38 4 10.53
41 Bulgarian 107 12 11.21
40 Persian 338 39 11.54

All time, as of 21 Jan '19

Top 10

Wiki Total Female Female (%)
0 Asturian 27912 14660 52.52
1 South Azerbaijani 30738 7700 25.05
2 Korean 87304 20842 23.87
3 Norwegian (Bokmål) 154027 34731 22.55
4 Japanese 278444 62473 22.44
5 Swedish 219133 45663 20.84
6 Spanish 361773 72943 20.16
7 Persian 130794 26000 19.88
8 Simple English 39592 7677 19.39
9 Hebrew 73059 14075 19.27

Bottom 10

Wiki Total Female Female (%)
49 Tajik 25714 505 1.96
48 Malagasy 29933 1963 6.56
47 Latin 27378 3023 11.04
46 Belarusian 26733 3142 11.75
45 Greek 45889 6348 13.83
44 Hungarian 99778 13938 13.97
43 Slovak 27896 3941 14.13
42 Russian 401288 58030 14.46
41 Slovenian 42134 6122 14.53
40 Ukrainian 159953 23262 14.54