性别检测工具在预测中文名字的性别方面有多准确？一项针对 20000 个拼音形式的名字的研究。

How accurate are gender detection tools in predicting the gender for Chinese names? A study with 20,000 given names in Pinyin format.

出版信息

J Med Libr Assoc. 2022 Apr 1;110(2):205-211. doi: 10.5195/jmla.2022.1289.

DOI:10.5195/jmla.2022.1289

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9014919/

Abstract

OBJECTIVE

We recently showed that the gender detection tools NamSor, Gender API, and Wiki-Gendersort accurately predicted the gender of individuals with Western given names. Here, we aimed to evaluate the performance of these tools with Chinese given names in Pinyin format.

METHODS

We constructed two datasets for the purpose of the study. File #1 was created by randomly drawing 20,000 names from a gender-labeled database of 52,414 Chinese given names in Pinyin format. File #2, which contained 9,077 names, was created by removing from File #1 all unisex names that we were able to identify (i.e., those that were listed in the database as both male and female names). We recorded for both files the number of correct classifications (correct gender assigned to a name), misclassifications (wrong gender assigned to a name), and nonclassifications (no gender assigned). We then calculated the proportion of misclassifications and nonclassifications (errorCoded).

RESULTS

For File #1, errorCoded was 53% for NamSor, 65% for Gender API, and 90% for Wiki-Gendersort. For File #2, errorCoded was 43% for NamSor, 66% for Gender API, and 94% for Wiki-Gendersort.

CONCLUSION

We found that all three gender detection tools inaccurately predicted the gender of individuals with Chinese given names in Pinyin format and therefore should not be used in this population.

摘要

目的

我们最近发现 NamSor、Gender API 和 Wiki-Gendersort 等性别检测工具可以准确预测具有西方名字的个体的性别。在这里，我们旨在评估这些工具在中文拼音名字中的性能。

方法

我们构建了两个数据集用于本研究。文件 #1 通过从 52414 个中文拼音名字的性别标记数据库中随机抽取 20000 个名字创建。文件 #2 包含 9077 个名字，是通过从文件 #1 中删除我们能够识别的所有中性名字（即那些在数据库中被列为男女名字的名字）创建的。我们为两个文件记录了正确分类的数量（正确分配给名字的性别）、错误分类的数量（错误分配给名字的性别）和未分类的数量（未分配性别）。然后，我们计算了错误分类和未分类的比例（错误编码）。

结果

对于文件 #1，NamSor 的错误编码为 53%，Gender API 的错误编码为 65%，Wiki-Gendersort 的错误编码为 90%。对于文件 #2，NamSor 的错误编码为 43%，Gender API 的错误编码为 66%，Wiki-Gendersort 的错误编码为 94%。

结论

我们发现所有三种性别检测工具都不准确地预测了具有中文拼音名字的个体的性别，因此不应该在这个人群中使用。

相似文献

How accurate are gender detection tools in predicting the gender for Chinese names? A study with 20,000 given names in Pinyin format.

J Med Libr Assoc. 2022 Apr 1;110(2):205-211. doi: 10.5195/jmla.2022.1289.

Performance of gender detection tools: a comparative study of name-to-gender inference services.

J Med Libr Assoc. 2021 Jul 1;109(3):414-421. doi: 10.5195/jmla.2021.1185.

Using genderize.io to infer the gender of first names: how to improve the accuracy of the inference.

J Med Libr Assoc. 2021 Oct 1;109(4):609-612. doi: 10.5195/jmla.2021.1252.

Erratum to "How accurate are gender detection tools in predicting the gender for Chinese names? A study with 20,000 given names in Pinyin format," 2022;110(2):205-11.

J Med Libr Assoc. 2022 Apr 1;110(2):E33. doi: 10.5195/jmla.2022.1544.

What Is the Performance of ChatGPT in Determining the Gender of Individuals Based on Their First and Last Names?

JMIR AI. 2024 Mar 13;3:e53656. doi: 10.2196/53656.

How well does NamSor perform in predicting the country of origin and ethnicity of individuals based on their first and last names?

PLoS One. 2023 Nov 16;18(11):e0294562. doi: 10.1371/journal.pone.0294562. eCollection 2023.

Are Accuracy Parameters Useful for Improving the Performance of Gender Detection Tools? A Comparative Study with Western and Chinese Names.

J Gen Intern Med. 2022 Nov;37(15):4024-4027. doi: 10.1007/s11606-022-07469-6. Epub 2022 Mar 15.

Difficult name, cold man: Chinese names, gender stereotypicality and trustworthiness.

Int J Psychol. 2021 Jun;56(3):349-360. doi: 10.1002/ijop.12727. Epub 2020 Dec 7.

Novel Evidence for the Increasing Prevalence of Unique Names in China: A Reply to Ogihara.

Front Psychol. 2021 Dec 6;12:731244. doi: 10.3389/fpsyg.2021.731244. eCollection 2021.

The Role of Semantic Gender in Name Comprehension: An Event-Related Potentials Study.

J Psycholinguist Res. 2020 Feb;49(1):175-185. doi: 10.1007/s10936-019-09677-4.

引用本文的文献

Female first and senior authorship in high-impact critical care journals 2005-2024.

Crit Care. 2025 Sep 8;29(1):395. doi: 10.1186/s13054-025-05649-4.

Scientific publications that use promotional language in the abstract receive more citations and public attention.

Commun Psychol. 2025 Aug 5;3(1):118. doi: 10.1038/s44271-025-00293-8.

Comparative analysis of automatic gender detection from names: evaluating the stability and performance of ChatGPT Namsor, and Gender-API.

PeerJ Comput Sci. 2024 Oct 17;10:e2378. doi: 10.7717/peerj-cs.2378. eCollection 2024.

Inferring gender from first names: Comparing the accuracy of Genderize, Gender API, and the gender R package on authors of diverse nationality.

PLOS Digit Health. 2024 Oct 29;3(10):e0000456. doi: 10.1371/journal.pdig.0000456. eCollection 2024 Oct.

What Is the Performance of ChatGPT in Determining the Gender of Individuals Based on Their First and Last Names?

JMIR AI. 2024 Mar 13;3:e53656. doi: 10.2196/53656.

How well does NamSor perform in predicting the country of origin and ethnicity of individuals based on their first and last names?

PLoS One. 2023 Nov 16;18(11):e0294562. doi: 10.1371/journal.pone.0294562. eCollection 2023.

A gender perspective on the global migration of scholars.

Proc Natl Acad Sci U S A. 2023 Mar 7;120(10):e2214664120. doi: 10.1073/pnas.2214664120. Epub 2023 Feb 27.

Scientific authorship by gender: trends before and during a global pandemic.

Humanit Soc Sci Commun. 2022;9(1):348. doi: 10.1057/s41599-022-01365-4. Epub 2022 Oct 4.

本文引用的文献

Performance of gender detection tools: a comparative study of name-to-gender inference services.

J Med Libr Assoc. 2021 Jul 1;109(3):414-421. doi: 10.5195/jmla.2021.1185.

Are female authors under-represented in primary healthcare and general internal medicine journals?

Br J Gen Pract. 2021 Jun 24;71(708):302. doi: 10.3399/bjgp21X716249. Print 2021 Jul.

Comparison and benchmark of name-to-gender inference services.

PeerJ Comput Sci. 2018 Jul 16;4:e156. doi: 10.7717/peerj-cs.156. eCollection 2018.

Women Physicians and Promotion in Academic Medicine.

N Engl J Med. 2020 Nov 26;383(22):2148-2157. doi: 10.1056/NEJMsa1916935.

Sex Distribution of Editorial Board Members Among Emergency Medicine Journals.

Ann Emerg Med. 2021 Jan;77(1):117-123. doi: 10.1016/j.annemergmed.2020.03.027. Epub 2020 May 4.

Sex and gender reporting in global health: new editorial policies.

BMJ Glob Health. 2018 Jul 26;3(4):e001038. doi: 10.1136/bmjgh-2018-001038. eCollection 2018.

The rapid rise of a research nation.

Nature. 2015 Dec 17;528(7582):S170-3. doi: 10.1038/528S170a.

Women in medicine: historical perspectives and recent trends.

Br Med Bull. 2015 Jun;114(1):5-15. doi: 10.1093/bmb/ldv007. Epub 2015 Mar 8.

Sociology. The gender gap in NIH grant applications.

Science. 2008 Dec 5;322(5907):1472-4. doi: 10.1126/science.1165878.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

性别检测工具在预测中文名字的性别方面有多准确？一项针对 20000 个拼音形式的名字的研究。

How accurate are gender detection tools in predicting the gender for Chinese names? A study with 20,000 given names in Pinyin format.

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献