Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 17 Sep '18 - 24 Sep '18

Top 10

Wiki Total Female Female (%)
0 Vietnamese 118 92 77.97
1 Asturian 104 79 75.96
2 Punjabi 30 17 56.67
3 Tamil 28 15 53.57
4 Volapük 30 14 46.67
5 Latvian 30 14 46.67
6 Thai 168 77 45.83
7 Malayalam 58 26 44.83
8 English 1036 454 43.82
9 French 397 145 36.52

Bottom 10

Wiki Total Female Female (%)
49 Min Nan 56 -1 -1.79
48 Hungarian 85 0 0.00
47 Arabic 9232 16 0.17
46 Belarusian 29 1 3.45
45 Kannada 28 1 3.57
44 Estonian 31 2 6.45
43 Finnish 37 3 8.11
42 Bulgarian 54 5 9.26
41 Serbian 32 3 9.38
40 Catalan 149 14 9.40

All time, as of 24 Sep '18

Top 10

Wiki Total Female Female (%)
0 Asturian 21702 11538 53.17
1 Welsh 21825 11576 53.04
2 South Azerbaijani 30430 7664 25.19
3 Korean 85860 20392 23.75
4 Norwegian (Bokmål) 151438 34008 22.46
5 Japanese 275031 61543 22.38
6 Swedish 216288 44817 20.72
7 Spanish 351460 70356 20.02
8 Persian 127899 25341 19.81
9 Simple English 38354 7417 19.34

Bottom 10

Wiki Total Female Female (%)
49 Tajik 29295 495 1.69
48 Malagasy 29917 1963 6.56
47 Latin 27107 2996 11.05
46 Belarusian 26311 3093 11.76
45 Greek 44434 6031 13.57
44 Hungarian 97776 13414 13.72
43 Slovak 27678 3887 14.04
42 Russian 394436 56567 14.34
41 Slovenian 41943 6084 14.51
40 Ukrainian 155435 22576 14.52