Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 11 Jun '18 - 18 Jun '18

Top 10

Wiki Total Female Female (%)
0 Marathi 28 25 89.29
1 Galician 77 54 70.13
2 Bulgarian 161 92 57.14
3 English 617 245 39.71
4 Italian 553 213 38.52
5 Hebrew 85 32 37.65
6 Spanish 582 219 37.63
7 Vietnamese 170 63 37.06
8 French 460 169 36.74
9 Malayalam 34 12 35.29

Bottom 10

Wiki Total Female Female (%)
49 Latvian 65 1 1.54
48 Uzbek 38 1 2.63
47 Quechua 37 1 2.70
46 Serbo-Croatian 36 1 2.78
45 Danish 54 2 3.70
44 Irish 46 3 6.52
43 Bengali 135 13 9.63
42 Chinese 210 21 10.00
41 Esperanto 36 4 11.11
40 Persian 534 66 12.36

All time, as of 18 Jun '18

Top 10

Wiki Total Female Female (%)
0 Welsh 21609 11513 53.28
1 Tagalog 20813 5285 25.39
2 South Azerbaijani 30250 7649 25.29
3 Korean 83845 19703 23.50
4 Japanese 271142 60584 22.34
5 Norwegian (Bokmål) 149596 33316 22.27
6 Swedish 212783 43828 20.60
7 Spanish 344339 68474 19.89
8 Persian 122965 24321 19.78
9 Simple English 36862 7097 19.25

Bottom 10

Wiki Total Female Female (%)
49 Tajik 47524 476 1.00
48 Malagasy 29907 1962 6.56
47 Latin 26922 2978 11.06
46 Belarusian 25825 3057 11.84
45 Greek 42918 5761 13.42
44 Hungarian 96740 13177 13.62
43 Slovak 27544 3862 14.02
42 Ukrainian 150215 21128 14.07
41 Russian 386313 55257 14.30
40 Slovenian 41766 6043 14.47