Emoji2Vec Visualization

Intro

The Emoji Cloud is a visualization of the semantic space of emojis, based on a recent paper at ICWSM. Embeddings of emojis are trained using LINE and the 3D layout is learned using LargeVis.

Word2vec is a group of related models that are used to produce word embeddings. Emoji2vec has the smilar idea, which use deep neural network to train the model and get the middle layer as the representation of word vector.

Word vectors are positioned in the vector space such that words that share common contexts in the corpus are located in close proximity to one another in the space. As emoji can be treated as word too, the model can be transformed to form the embedding of emoji.

Emoji Cloud

Since that emoji vector after trained have very high dimension,s the embedding needs to be reduced dimension, in which I used LargeVis, which is a method that use the random projection trees and other techniques to find the approximate representation of high dimensional vectors in low dimension. Here I set the dimension to 3.

The 3D-model visulization is created by Three.js. Which supports interaction and searching function. The model can be seen and played here.

发布于 2017-06-08