TY - GEN
T1 - A calculation cost reduction method for a log-likelihood maximization in word2vec
AU - Nakamura, Sakuya
AU - Kimura, Masaomi
N1 - Publisher Copyright:
© 2019 Chinese Automation and Computing Society in the UK-CACSUK.
PY - 2019/9
Y1 - 2019/9
N2 - Word2vec models learn text data and provide distributed representations to words. The distributed representations use vectors which show the meaning of the words. Thus the word2vec models are useful for Natural Language Processing (NLP). However, it is difficult to update the models for new data addition because it takes a long time to generate the word2vec model. This calculation time has become an impediment to analize text data which contains a lot of unknown words. This is caused by computational time in the calculation of the likelihood function. The purpose of this study was to speed up the training of Continuous Bag-of-Word Model(CBOW), which is one of the word2vec models, by reducing the calculation cost of the likelihood function. The likelihood function in CBOW has been expressed by the use of a softmax function and has a huge amount of computational time. In this paper, a sigmoid function replaces the softmax function as the approximated likelihood function, because the sigmoid function can reproduce the charactaristic change of the likelihood function in CBOW.
AB - Word2vec models learn text data and provide distributed representations to words. The distributed representations use vectors which show the meaning of the words. Thus the word2vec models are useful for Natural Language Processing (NLP). However, it is difficult to update the models for new data addition because it takes a long time to generate the word2vec model. This calculation time has become an impediment to analize text data which contains a lot of unknown words. This is caused by computational time in the calculation of the likelihood function. The purpose of this study was to speed up the training of Continuous Bag-of-Word Model(CBOW), which is one of the word2vec models, by reducing the calculation cost of the likelihood function. The likelihood function in CBOW has been expressed by the use of a softmax function and has a huge amount of computational time. In this paper, a sigmoid function replaces the softmax function as the approximated likelihood function, because the sigmoid function can reproduce the charactaristic change of the likelihood function in CBOW.
KW - CBOW
KW - Component
KW - Computational time
KW - Softmax
KW - Training acceleration
KW - Word2Vec
UR - http://www.scopus.com/inward/record.url?scp=85075783079&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85075783079&partnerID=8YFLogxK
U2 - 10.23919/IConAC.2019.8895214
DO - 10.23919/IConAC.2019.8895214
M3 - Conference contribution
AN - SCOPUS:85075783079
T3 - ICAC 2019 - 2019 25th IEEE International Conference on Automation and Computing
BT - ICAC 2019 - 2019 25th IEEE International Conference on Automation and Computing
A2 - Yu, Hui
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th IEEE International Conference on Automation and Computing, ICAC 2019
Y2 - 5 September 2019 through 7 September 2019
ER -