Boosting algorithms with topic modeling for multi-label text categorization: A comparative empirical study

2015 ◽  
Vol 41 (5) ◽  
pp. 732-746 ◽  
Author(s):  
Bassam Al-Salemi ◽  
Mohd. Juzaiddin Ab Aziz ◽  
Shahrul Azman Noah
2018 ◽  
Vol 11 (4) ◽  
pp. 77 ◽  
Author(s):  
Malek Mouhoub ◽  
Mustakim Al Helal

Topic modeling is a powerful technique for unsupervised analysis of large document collections. Topic models have a wide range of applications including tag recommendation, text categorization, keyword extraction and similarity search in the text mining, information retrieval and statistical language modeling. The research on topic modeling is gaining popularity day by day. There are various efficient topic modeling techniques available for the English language as it is one of the most spoken languages in the whole world but not for the other spoken languages. Bangla being the seventh most spoken native language in the world by population, it needs automation in different aspects. This paper deals with finding the core topics of Bangla news corpus and classifying news with similarity measures. The document models are built using LDA (Latent Dirichlet Allocation) with bigram.


Sign in / Sign up

Export Citation Format

Share Document