Gender by language

This plot shows the top 40 Wikipedia Languages (by number of gendered biographies), and compares their number of gendered biographies to female percentage of those biographies. This cutoff is arbitrary for the sake of clearly visualizing the distribution across major Wikipedia languages. For a comprehensive coverage, one can fetch the complete data from the data repository.

The changes show the number of biographies added and the female percentage change between the two latest snapshots.

As of January 2016 about 98% of biographies were attached to at least one Wikipedia site, so this data is mostly complete.

Hover over the language circle to view its details.

Changes 08 Jan '18 - 15 Jan '18

Top 10

Wiki Total Female Female (%)
0 Danish 56 41 73.21
1 Tamil 42 21 50.00
2 Asturian 85 41 48.24
3 Galician 37 17 45.95
4 Korean 303 134 44.22
5 Cantonese 54 23 42.59
6 Bengali 48 20 41.67
7 Tagalog 46 19 41.30
8 Estonian 42 17 40.48
9 Armenian 124 44 35.48

Bottom 10

Wiki Total Female Female (%)
49 Arabic 1425 22 1.54
48 Japanese 259 7 2.70
47 Polish 308 23 7.47
46 Javanese 59 5 8.47
45 Russian 589 54 9.17
44 Chinese 235 23 9.79
43 Italian 631 68 10.78
42 Latvian 101 11 10.89
41 Tatar 55 7 12.73
40 Hungarian 113 16 14.16

All time, as of 15 Jan '18

Top 10

Wiki Total Female Female (%)
0 South Azerbaijani 20492 7301 35.63
1 Tagalog 20782 5368 25.83
2 Korean 81219 18864 23.23
3 Japanese 265746 59542 22.41
4 Norwegian (Bokmål) 146563 32365 22.08
5 Swedish 207213 42313 20.42
6 Persian 112934 23010 20.37
7 Simple English 35170 6762 19.23
8 Spanish 326334 61426 18.82
9 Chinese 139881 26252 18.77

Bottom 10

Wiki Total Female Female (%)
49 Tajik 46975 406 0.86
48 Malagasy 29132 1937 6.65
47 Latin 26629 2932 11.01
46 Belarusian 24936 2887 11.58
45 Greek 40078 5231 13.05
44 Hungarian 92586 12453 13.45
43 Ukrainian 141604 19618 13.85
42 Slovak 27163 3765 13.86
41 Russian 372547 52343 14.05
40 Malay 19959 2806 14.06