Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 11 Mar '19 - 18 Mar '19

Top 10

Wiki Total Female Female (%)
0 Afrikaans 1302 1205 92.55
1 Malayalam 76 70 92.11
2 Basque 80 67 83.75
3 Vietnamese 199 166 83.42
4 Finnish 218 165 75.69
5 Tamil 83 62 74.70
6 Hindi 73 46 63.01
7 Spanish 813 470 57.81
8 Galician 37 21 56.76
9 Norwegian (Nynorsk) 44 22 50.00

Bottom 10

Wiki Total Female Female (%)
49 Newar / Nepal Bhasa 93 1 1.08
48 Arabic 21805 770 3.53
47 Wu 55 2 3.64
46 Azerbaijani 46 2 4.35
45 Simple English 40 2 5.00
44 Danish 2237 119 5.32
43 Welsh 84 5 5.95
42 Estonian 50 3 6.00
41 Malay 51 5 9.80
40 Bulgarian 65 7 10.77

All time, as of 18 Mar '19

Top 10

Wiki Total Female Female (%)
0 Asturian 27925 14661 52.50
1 Afrikaans 22940 7045 30.71
2 Korean 88777 21302 23.99
3 Norwegian (Bokmål) 155026 34956 22.55
4 Japanese 281473 63276 22.48
5 Swedish 220418 46092 20.91
6 Spanish 366479 74355 20.29
7 Persian 133227 26417 19.83
8 Hebrew 73903 14320 19.38
9 Simple English 39987 7729 19.33

Bottom 10

Wiki Total Female Female (%)
49 Tajik 25767 509 1.98
48 Malagasy 29960 1965 6.56
47 Latin 27498 3031 11.02
46 Belarusian 26967 3166 11.74
45 Arabic 240223 33421 13.91
44 Greek 46625 6536 14.02
43 Hungarian 100786 14201 14.09
42 Slovak 28076 3961 14.11
41 Slovenian 42285 6146 14.53
40 Russian 405419 58926 14.53