-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers
Description
Currently, the patterns have been vectorized into Bag of Words (BoW) Model, in a most simple way - on the basis of if the word is present in a pattern or not.
But since other factors like the frequency of that word in pattern might also play a role, we can try other approaches too like -
- CountVectorizer
- TF-IDF Vectorizer
Try models using different vectorizers and see which works best.
Ref:
https://medium.com/greyatom/an-introduction-to-bag-of-words-in-nlp-ac967d43b428
https://machinelearningmastery.com/gentle-introduction-bag-words-model/
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers