-
Notifications
You must be signed in to change notification settings - Fork 1
Keywords
Marcel Heinz edited this page Aug 3, 2018
·
5 revisions
- Inspect words below computer languages annotated with their frequency: https://github.com/softlang/wikionto/blob/master/data/nndict.json
- Plot a bar chart with the top 10 using https://github.com/softlang/wikionto/blob/master/src/data/plotting_nn_top10.py
Later, we added a keyword exploration for the whole article set under the identified root categories. We observe similar results, especially for songs. We count in how many articles a word stem appears. We list the top ten below.
- 'song', 31737
- 'system', 30818
- 'number', 26394
- 'group', 24422
- 'time', 21760
- 'state', 21739
- 'star', 19338
- 'year', 18898
- 'unit', 18197
- 'languag', 17325
See https://github.com/softlang/wikionto/blob/master/src/data/explore/all_nouns.py for reproduction.