Myanmar Text Classifier Using Genetic Algorithm

Khin Mar Soe
Page No: 
1264-1267

Text Classification is the task of automatically assigning a set of documents into certain categories (class or topics) from a predefined set. This also play important role in natural language processing and also crossroad between information retrieval and machine Learning. Witnessing the dramatic growth of text document in digital form from news website make the task of text classification more and more increase. Therefore, the task’s popularity becomes increased over last ten year. Theapplication of this method can be found in spam filtering, question and answering, language identification. This paper presents the idea of text classification process in term of using machine learning technique and illustrates how Myanmar news documents were classified by applying genetic algorithm. The applied system will beused Myanmar online news articles from Myanmar news website for the purpose of training and testing the system and term frequency inverse document frequency (tf_idf) algorithm were used to select related feature according to their labelled document which is also applied in many text mining methods.

Download PDF: