A calculation cost reduction method for a log-likelihood maximization in word2vec

Sakuya Nakamura, Masaomi Kimura

研究成果: Conference contribution

抄録

Word2vec models learn text data and provide distributed representations to words. The distributed representations use vectors which show the meaning of the words. Thus the word2vec models are useful for Natural Language Processing (NLP). However, it is difficult to update the models for new data addition because it takes a long time to generate the word2vec model. This calculation time has become an impediment to analize text data which contains a lot of unknown words. This is caused by computational time in the calculation of the likelihood function. The purpose of this study was to speed up the training of Continuous Bag-of-Word Model(CBOW), which is one of the word2vec models, by reducing the calculation cost of the likelihood function. The likelihood function in CBOW has been expressed by the use of a softmax function and has a huge amount of computational time. In this paper, a sigmoid function replaces the softmax function as the approximated likelihood function, because the sigmoid function can reproduce the charactaristic change of the likelihood function in CBOW.

本文言語English
ホスト出版物のタイトルICAC 2019 - 2019 25th IEEE International Conference on Automation and Computing
編集者Hui Yu
出版社Institute of Electrical and Electronics Engineers Inc.
ISBN(電子版)9781861376664
DOI
出版ステータスPublished - 2019 9月
イベント25th IEEE International Conference on Automation and Computing, ICAC 2019 - Lancaster, United Kingdom
継続期間: 2019 9月 52019 9月 7

出版物シリーズ

名前ICAC 2019 - 2019 25th IEEE International Conference on Automation and Computing

Conference

Conference25th IEEE International Conference on Automation and Computing, ICAC 2019
国/地域United Kingdom
CityLancaster
Period19/9/519/9/7

ASJC Scopus subject areas

  • 人工知能
  • コンピュータ ネットワークおよび通信
  • コンピュータ サイエンスの応用
  • 制御と最適化

フィンガープリント

「A calculation cost reduction method for a log-likelihood maximization in word2vec」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル