Quantifying and Debiasing Gender Bias in Japanese Gender-specific Words with Word Embedding

Leisi Chen, Toru Sugimoto

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Machine Learning is playing a significant role in modern life. However, the problem that Machine Learning has biases and stereotypes has also drawn the researcher's attention. Word2Vec, a popular framework in the NLP field to encode the word's meaning as a real-valued vector, has been used in many machine learning and natural language processing tasks. Still, it also has been proved that it contains severe biases toward women. In this paper, we used Word2Vec to analyze the relationship between gender-specific words and personality adjectives in Japanese to Figure out the latent gender bias in those gender-specific words. We first found that the Word2Vec model trained by Japanese Wikipedia data shows that some occupation gender-specific words strongly connect with negative personality adjectives. The experiment results reflect that people commonly use these gender-specific words to criticize women in these specific occupations. Then we eliminated the projection of word vectors of personality adjectives on the gender subspace and reduced the relationship between negative personality adjectives and gender-specific words by word vector calculation.

Original languageEnglish
Title of host publication2022 Joint 12th International Conference on Soft Computing and Intelligent Systems and 23rd International Symposium on Advanced Intelligent Systems, SCIS and ISIS 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665499248
DOIs
Publication statusPublished - 2022
EventJoint 12th International Conference on Soft Computing and Intelligent Systems and 23rd International Symposium on Advanced Intelligent Systems, SCIS and ISIS 2022 - Ise, Japan
Duration: 2022 Nov 292022 Dec 2

Publication series

Name2022 Joint 12th International Conference on Soft Computing and Intelligent Systems and 23rd International Symposium on Advanced Intelligent Systems, SCIS and ISIS 2022

Conference

ConferenceJoint 12th International Conference on Soft Computing and Intelligent Systems and 23rd International Symposium on Advanced Intelligent Systems, SCIS and ISIS 2022
Country/TerritoryJapan
CityIse
Period22/11/2922/12/2

Keywords

  • Machine Learning
  • NLP
  • Word2Vec
  • gender bias

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Control and Optimization
  • Modelling and Simulation
  • Numerical Analysis

Fingerprint

Dive into the research topics of 'Quantifying and Debiasing Gender Bias in Japanese Gender-specific Words with Word Embedding'. Together they form a unique fingerprint.

Cite this