Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of October 2014 about 99% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 10 Sep '18 - 17 Sep '18

Top 10

Wiki Total Female Female (%)
0 Thai 149 110 73.83
1 Turkish 98 63 64.29
2 Hindi 36 20 55.56
3 Chinese 187 103 55.08
4 Arabic 585 301 51.45
5 Romanian 34 17 50.00
6 Hebrew 75 37 49.33
7 Korean 120 59 49.17
8 Galician 57 28 49.12
9 Serbian 30 14 46.67

Bottom 10

Wiki Total Female Female (%)
49 Bashkir 26 1 3.85
48 Portuguese 194 17 8.76
47 Wu 44 6 13.64
46 Bulgarian 62 10 16.13
45 German 698 115 16.48
44 Spanish 813 142 17.47
43 Russian 320 59 18.44
42 Catalan 178 34 19.10
41 Urdu 68 13 19.12
40 Czech 117 25 21.37

All time

Top 10

Wiki Total Female Female (%)
0 Welsh 21818 11579 53.07
1 Asturian 21598 11459 53.06
2 South Azerbaijani 30426 7664 25.19
3 Korean 85791 20380 23.76
4 Norwegian (Bokmål) 151386 34001 22.46
5 Japanese 274747 61454 22.37
6 Swedish 216173 44798 20.72
7 Spanish 350403 70218 20.04
8 Persian 127748 25319 19.82
9 Simple English 38261 7398 19.34

Bottom 10

Wiki Total Female Female (%)
49 Tajik 33416 496 1.48
48 Malagasy 29914 1965 6.57
47 Latin 27093 2995 11.05
46 Belarusian 26281 3092 11.77
45 Greek 44328 6011 13.56
44 Hungarian 97691 13414 13.73
43 Slovak 27669 3887 14.05
42 Russian 394011 56481 14.33
41 Ukrainian 155228 22518 14.51
40 Slovenian 41934 6084 14.51