Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

No changes detected since last week.

Probably Wikidata did not export its data as we expected. See the "All Time" stats below for now.

All time, as of 15 Oct '18

Top 10

Wiki Total Female Female (%)
0 Asturian 22931 12559 54.77
1 Welsh 21846 11580 53.01
2 South Azerbaijani 30457 7672 25.19
3 Korean 86195 20497 23.78
4 Norwegian (Bokmål) 151576 34050 22.46
5 Japanese 275674 61760 22.40
6 Swedish 216750 44914 20.72
7 Spanish 354227 70811 19.99
8 Persian 128414 25445 19.81
9 Simple English 38606 7468 19.34

Bottom 10

Wiki Total Female Female (%)
49 Tajik 25623 496 1.94
48 Malagasy 29922 1964 6.56
47 Latin 27169 3003 11.05
46 Belarusian 26430 3102 11.74
45 Greek 44808 6095 13.60
44 Hungarian 98202 13608 13.86
43 Slovak 27717 3893 14.05
42 Russian 395415 56777 14.36
41 Slovenian 41978 6093 14.51
40 Ukrainian 155961 22678 14.54