Vectorized Text Example

This repo contains a minimally working sample of how to process text data, transform it into vectors (using TF-IDF), and then build a simple clusering model (KMeans) on the vectors. It also included a demo script of making a word cloud of top terms in the corpus.

The selected corpus was lyrics from Taylor Swift's songs performed on her "Eras" Tour encompassing over 40 songs across 11 albums. Songs selected based on this article. Lyrics taken directly from Google search results.

You can see the resulting vector file here.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
analyze.py		analyze.py
cloud.py		cloud.py
tswift.png		tswift.png
vectors.csv		vectors.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vectorized Text Example

About

Uh oh!

Releases

Packages

Languages

UK-IPOP/example-text-vectors

Folders and files

Latest commit

History

Repository files navigation

Vectorized Text Example

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages