Our ESR Shaheen Syed has got a paper accepted at the 4th IEEE International Conference on Data Science and Advanced Analytics. He will present his paper, entitled “Full-Text or Abstract? Examining Topic Coherence Scores Using Latent Dirichlet Allocation” during the conference to be held in Tokyo on 19-21 October 2017.
The paper examines how different types of textual data, and more specifically fisheries research articles, affects the quality of topics from the topic model Latent Dirichlet Allocation (LDA). LDA can be utilized to automatically uncover topics from documents without the need for prior labeling or annotation of these documents.
The paper will be made public after the conference.