Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of October 2014 about 99% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 12 Mar '17 - 19 Mar '17

Top 10

Wiki Total Female Female (%)
0 Oriya 39 40 102.56
1 Malayalam 147 143 97.28
2 Maithili 66 63 95.45
3 Punjabi 74 69 93.24
4 Slovenian 113 102 90.27
5 Hindi 52 46 88.46
6 Nepali 31 20 64.52
7 Chinese 386 235 60.88
8 Estonian 46 27 58.70
9 Swedish 229 134 58.52

Bottom 10

Wiki Total Female Female (%)
49 Galician 32 1 3.12
48 Urdu 35 3 8.57
47 Greek 156 15 9.62
46 Polish 891 99 11.11
45 Arabic 449 56 12.47
44 Slovak 80 11 13.75
43 Portuguese 753 114 15.14
42 French 1188 186 15.66
41 German 811 129 15.91
40 Russian 548 94 17.15

All time

Top 10

Wiki Total Female Female (%)
0 Welsh 17915 9357 52.23
1 Korean 72024 17024 23.64
2 Japanese 243122 54076 22.24
3 Norwegian (Bokmål) 140849 31034 22.03
4 Persian 101646 20987 20.65
5 Swedish 197509 40191 20.35
6 Simple English 32280 6197 19.20
7 Vietnamese 23156 4273 18.45
8 Chinese 130774 23835 18.23
9 Serbian 32963 5965 18.10

Bottom 10

Wiki Total Female Female (%)
49 Malagasy 28883 1927 6.67
48 Slovenian 38617 3836 9.93
47 Latin 26111 2826 10.82
46 Belarusian 22862 2489 10.89
45 Greek 34860 4158 11.93
44 Tamil 16860 2169 12.86
43 Hungarian 85058 11305 13.29
42 Ukrainian 123442 16557 13.41
41 Slovak 26382 3597 13.63
40 Galician 28149 3841 13.65