When using the NLP API and in particular the documents.classifyText, it will obviously be classified under one of the categories listed here. My question is, do we know what was used to create these categories? Were they created from different datasets/corpora like Wikipedia, Gigaword, and Freebase? Does the Word2Vec term embedding relate to category embeddings at all? Any information, references or resources would be greatly appreciated.
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |