Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 21 May '17 - 28 May '17

Top 10

Wiki Total Female Female (%)
0 Hindi 75 70 93.33
1 Punjabi 53 46 86.79
2 Basque 70 58 82.86
3 Cantonese 23 12 52.17
4 Danish 47 22 46.81
5 Hebrew 132 52 39.39
6 Thai 98 35 35.71
7 Lao 23 8 34.78
8 Arabic 170 54 31.76
9 Hungarian 85 25 29.41

Bottom 10

Wiki Total Female Female (%)
49 Macedonian 31 0 0.00
48 Malay 842 9 1.07
47 Bihari 126 2 1.59
46 Tamil 40 1 2.50
45 Urdu 60 2 3.33
44 Ukrainian 299 26 8.70
43 Serbian 83 8 9.64
42 Japanese 131 13 9.92
41 Norwegian (Bokmål) 74 8 10.81
40 Italian 604 67 11.09

All time, as of 28 May '17

Top 10

Wiki Total Female Female (%)
0 Welsh 18064 9378 51.92
1 Korean 73712 17507 23.75
2 Thai 16260 3830 23.55
3 Japanese 257343 57547 22.36
4 Norwegian (Bokmål) 142330 31190 21.91
5 Swedish 201535 41167 20.43
6 Persian 105500 21470 20.35
7 Simple English 32968 6321 19.17
8 Vietnamese 24134 4542 18.82
9 Chinese 133950 24841 18.54

Bottom 10

Wiki Total Female Female (%)
49 Malagasy 28974 1927 6.65
48 Latin 26218 2854 10.89
47 Belarusian 23460 2622 11.18
46 Slovenian 39359 4432 11.26
45 Greek 36074 4470 12.39
44 Tamil 17257 2219 12.86
43 Hungarian 89215 11882 13.32
42 Ukrainian 126512 17146 13.55
41 Slovak 26755 3669 13.71
40 Galician 28566 3921 13.73