Skip to content

Try other methods for vectorizing patterns #2

@bhu800

Description

@bhu800

Currently, the patterns have been vectorized into Bag of Words (BoW) Model, in a most simple way - on the basis of if the word is present in a pattern or not.

But since other factors like the frequency of that word in pattern might also play a role, we can try other approaches too like -

  • CountVectorizer
  • TF-IDF Vectorizer

Try models using different vectorizers and see which works best.

Ref:
https://medium.com/greyatom/an-introduction-to-bag-of-words-in-nlp-ac967d43b428
https://machinelearningmastery.com/gentle-introduction-bag-words-model/

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions