Topic Modeling



No file uploaded.Loading data...
Model loaded successfully.
KMeans model fitted successfully.

First 10 Keywords:
['root', 'user', 'login']

Nearest keyword to cluster centroid:
('root', 0.6702372014950745)
('login', 2.1869818996886834)
('login', 1.5382413210111627)
('login', 2.897773535337693)
('login', 2.4409011105452407)

Top TF-IDF scores on each cluster:
Cluster 0:
(root,9.00) (session,1.96) (closed,1.49) (check,1.15) (pass,1.15) (unknown,1.15) (eventtemplate,1.00) (opened,0.79)
Cluster 1:
(check,0.58) (pass,0.58) (unknown,0.58) (authentication,0.32) (euid,0.32) (failure,0.32) (logname,0.32) (nodevssh,0.32) (rhost,0.32) (root,0.32)
Cluster 2:
(check,0.50) (pass,0.50) (unknown,0.50) (user,0.50) (authentication,0.33) (euid,0.33) (failure,0.33) (logname,0.33) (nodevssh,0.33) (rhost,0.33)
Cluster 3:
ValueError: max_df corresponds to < documents than min_df