Topic Modeling



No file uploaded.Loading data...
Model loaded successfully.
KMeans model fitted successfully.

First 10 Keywords:
['root', 'user', 'login']

Nearest keyword to cluster centroid:
('login', 1.5349699706090836)
('login', 1.4020770036117205)
('login', 2.4155707880485386)
('root', 1.7054998871098839)
('root', 0.6479849504961069)

Top TF-IDF scores on each cluster:
Cluster 0:
(check,0.50) (pass,0.50) (unknown,0.50) (user,0.50) (authentication,0.33) (euid,0.33) (failure,0.33) (logname,0.33) (nodevssh,0.33) (rhost,0.33)
Cluster 1:
ValueError: max_df corresponds to < documents than min_df