Text classification is a process of assigning tags or categories to a text according to its content . It's one of the fundamental tasks in Natural Language Processing (NLP) with broad applications such as sentiment analysis, topic labeling, spam detection etc.
- Performed Text Classification using Multinomial Naive Bayes(already implemented in sklearn).
- Implemented Naive Bayes from scratch for Text Classification.
- Compared the Results of my implementation of Naive Bayes with the one in Sklearn.
Here,We are provided with several news articles related to various fields like science,sports,politics etc. and we are supposed to build a classifier and train that classifier upon these articles such that in future, given a news article it can predict the category or the field to which this news belong.
I have tried to keep it as simple as possible(by sticking to the algorithm behind NaiveBayes' classification) . I have tried to mention every minute detail in the form of comments so that one can analyze each and every aspect of it.