Skip to main content

Table 3 Hierarchical clustering accuracy and running time on SIDER2 dataset

From: A heuristic approach to determine an appropriate number of topics in topic modeling

T*

5

10

20

30

40

50

Misclassified

443

411

362

355

285

205

Time (ms)

43,378

45,233

48,252

49,278

50,493

51,443

T

60

70

80

90

100

 

Misclassified

223

246

251

269

269

 

Time (ms)

52,526

52,577

54,298

54,468

54,608

 
  1. *T: Number of topics.