この論文をさがす
抄録
type:Article
WeChat is one of social network applications that connects people widely. Huge data is generated when users conduct conversations, which can be used to enhance their lives. This paper will describe how this data is collected, how to develop a personalized chatbot using personal conversation records. Our system will have a cognitive map based on the word2vec model, which is used to learn and store the relationship of each word that appears in the chatting records. Each word will be mapped to a continuous high dimensional vector space. Then the sequence-to-sequence framework (seq2seq) will be adopted to learn the chatting styles from all pairs of chatting sentences. Meanwhile, the traditional one-hot embedding layer will be replaced with our word2vec embedding layer in the seq2seq model. Furthermore, an autoencoder of seq2seq architecture is trained to learn the vector representation of each sentence, then the cosine similarity between model generated response and the pre-existing response in test set can be evaluated , and the distance with principal component analysis (PCA) projection can be also displayed. As a result, our word2vec embedded seq2seq model significantly outperforms the one-hot embedded one.
収録刊行物
-
- 法政大学大学院紀要. 情報科学研究科編
-
法政大学大学院紀要. 情報科学研究科編 14 1-6, 2019-03-31
法政大学大学院情報科学研究科
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1390290699801598080
-
- NII論文ID
- 120006714896
-
- NII書誌ID
- AA12746425
-
- ISSN
- 24321192
-
- Web Site
- http://hdl.handle.net/10114/00021920
-
- 本文言語コード
- en
-
- データソース種別
-
- JaLC
- IRDB
- CiNii Articles