How to implement incremental learning in NLP

Question

We are building a system wherein, we would have a initial very small amount of trained data to start with. The job is to Classify the incoming data(Document, for our case) into 2 categories: Category A & B. Data is document , so the user needs to classify the Document to belong to Category A or B. So, with the limited amount of data, we create the trained data set and we start predicting the Category of next Document using the trained data set.

Now if the prediction is correct user moves to the next Document. But if the prediction is incorrect , the user inputs the correct Category (Lets Say Category A was predicted by the system, wherein the correct assignment to the data should be Category B). So now the system should use this learning(Category B instead of Category A) to enrich(learn) itself in near real time.

It should train only the added data and not the complete dataset , which is already trained. So it should be incremental learning. For classification we would be applying Naive Bayes Classification.

Now the question is :

How do we implement the incremental training, without training the whole data set every time?
I know there are incremental learning libraries like Vowpal Wabbit & creme. Will it be a good solution using these library for my case?

How to implement incremental learning in NLP

Answers (1)

Related Questions