Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 08 Jan '17 - 15 Jan '17

Top 10

Wiki Total Female Female (%)
0 Arabic 531 301 56.69
1 Korean 158 68 43.04
2 Finnish 67 24 35.82
3 Romanian 104 36 34.62
4 Azerbaijani 81 27 33.33
5 Bengali 36 12 33.33
6 Persian 273 86 31.50
7 Serbian 32 10 31.25
8 Swahili 39 12 30.77
9 Danish 70 20 28.57

Bottom 10

Wiki Total Female Female (%)
49 South Azerbaijani 686 2 0.29
48 Chuvash 359 2 0.56
47 Hakka 41 1 2.44
46 Sicilian 86 5 5.81
45 Belarusian 38 3 7.89
44 Latvian 37 3 8.11
43 Basque 59 6 10.17
42 Hungarian 82 9 10.98
41 Simple English 53 6 11.32
40 Russian 532 62 11.65

All time, as of 15 Jan '17

Top 10

Wiki Total Female Female (%)
0 Welsh 17484 9324 53.33
1 Korean 63127 15835 25.08
2 Japanese 240153 53494 22.27
3 Norwegian (Bokmål) 139575 30697 21.99
4 Persian 99764 20695 20.74
5 Swedish 196148 39766 20.27
6 Simple English 31527 6080 19.29
7 Vietnamese 22596 4111 18.19
8 Chinese 127539 23170 18.17
9 Serbian 32633 5893 18.06

Bottom 10

Wiki Total Female Female (%)
49 Malagasy 28770 1920 6.67
48 Slovenian 38170 3580 9.38
47 Belarusian 22472 2411 10.73
46 Latin 26017 2813 10.81
45 Greek 33843 4040 11.94
44 Tamil 16704 2154 12.90
43 Hungarian 83989 11135 13.26
42 Ukrainian 120634 16122 13.36
41 Galician 27752 3767 13.57
40 Basque 25440 3451 13.57