Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 13 Aug '17 - 20 Aug '17

Top 10

Wiki Total Female Female (%)
0 Slovenian 64 63 98.44
1 Azerbaijani 64 34 53.12
2 Catalan 196 97 49.49
3 Danish 58 28 48.28
4 Armenian 51 22 43.14
5 Cantonese 36 14 38.89
6 Estonian 38 14 36.84
7 Turkish 55 20 36.36
8 English 970 332 34.23
9 Persian 175 53 30.29

Bottom 10

Wiki Total Female Female (%)
49 Tagalog 1150 16 1.39
48 Georgian 22 1 4.55
47 Belarusian 39 2 5.13
46 Scots 52 4 7.69
45 Egyptian Arabic 25 2 8.00
44 Hindi 59 5 8.47
43 Galician 31 3 9.68
42 South Azerbaijani 24 3 12.50
41 Hungarian 108 14 12.96
40 Bulgarian 74 10 13.51

All time, as of 20 Aug '17

Top 10

Wiki Total Female Female (%)
0 Welsh 18288 9450 51.67
1 Korean 77905 17854 22.92
2 Japanese 259989 58337 22.44
3 Norwegian (Bokmål) 144480 31906 22.08
4 Persian 107583 21989 20.44
5 Swedish 203920 41476 20.34
6 Simple English 33664 6457 19.18
7 Vietnamese 24750 4660 18.83
8 Chinese 136020 25214 18.54
9 Spanish 312352 57398 18.38

Bottom 10

Wiki Total Female Female (%)
49 Malagasy 29011 1930 6.65
48 Latin 26340 2880 10.93
47 Belarusian 23965 2729 11.39
46 Slovenian 39626 4557 11.50
45 Tamil 19071 2425 12.72
44 Greek 37410 4791 12.81
43 Hungarian 90441 12117 13.40
42 Ukrainian 130222 17775 13.65
41 Galician 28899 3987 13.80
40 Slovak 26962 3731 13.84