From: A novel multiple kernel fuzzy topic modeling technique for biomedical data
Datasets | Documents (Preprocess) | Words | Unique words |
---|---|---|---|
MuchMore Springer | 1527 | 19,835 | 5008 |
Ohsumed | 2092 | 22,669 | 13,238 |
Genia | 2000 | 21,560 | 17,834 |
Biotext | 40 | 25,921 | 10,267 |
58,927 | 395,636 | 25,309 | |
WSJ | 1300 | 680Â K | 36Â K |