Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 12 Nov '17 - 20 Nov '17

Top 10

Wiki Total Female Female (%)
0 Slovenian 75 67 89.33
1 Serbian 91 72 79.12
2 English 1844 832 45.12
3 Tagalog 111 48 43.24
4 Finnish 55 20 36.36
5 Bengali 81 28 34.57
6 Turkish 61 21 34.43
7 Simple English 67 23 34.33
8 Hebrew 91 29 31.87
9 Azerbaijani 62 19 30.65

Bottom 10

Wiki Total Female Female (%)
49 Scots 22 -1 -4.55
48 Hausa 24 0 0.00
47 Vietnamese 104 5 4.81
46 Basque 51 3 5.88
45 Romanian 40 4 10.00
44 Portuguese 197 25 12.69
43 Swedish 240 31 12.92
42 Armenian 84 11 13.10
41 Ukrainian 248 33 13.31
40 Galician 167 23 13.77

All time, as of 20 Nov '17

Top 10

Wiki Total Female Female (%)
0 Tagalog 20074 5186 25.83
1 Korean 79900 18386 23.01
2 Japanese 263102 58928 22.40
3 Norwegian (Bokmål) 145827 32181 22.07
4 Swedish 205324 41725 20.32
5 Persian 111877 22708 20.30
6 Simple English 34544 6637 19.21
7 Chinese 138412 26035 18.81
8 Vietnamese 25639 4821 18.80
9 Spanish 318537 59100 18.55

Bottom 10

Wiki Total Female Female (%)
49 Tajik 46922 380 0.81
48 Malagasy 29108 1936 6.65
47 Latin 26499 2906 10.97
46 Belarusian 24316 2771 11.40
45 Tamil 19265 2491 12.93
44 Greek 39148 5086 12.99
43 Hungarian 91520 12331 13.47
42 Ukrainian 135526 18471 13.63
41 Slovenian 40840 5640 13.81
40 Slovak 27073 3747 13.84