Word2vec is a toolkit launched by Google in 2013 by open source for obtaining word vector. It is simple and efficient, so it has aroused the attention of many people. Tomas Mikolov, the author of word2vec, did not talk about too many algorithm details in two related papers [3, 4], thus increasing the mystery of this toolkit to a certain extent. Some people who couldn't help but choose to take a look at the source code through anatomy. Out of curiosity, I became one of them. After reading the code, I felt that I had gained a lot. I wrote it into my documents and gave it to anyone who needed it.
Related Links
(1) contents and Preface
(2) Prerequisites
(3) Background Knowledge
(4) Hierarchical softmax-based model
(5) Negative sampling-based model
(6) several source code details
Author: peghoty
Source: http://blog.csdn.net/itplus/article/details/37969519
You are welcome to repost/share the article, but be sure to declare the source of the article.
Explanation of mathematical principles in word2vec (1) contents and Preface