You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

TableOfContents(4)


Abstract

Basically, It'll shows how to construct the matrix from the files.

This word count matrix (document-word) approach is often referred to as latent semantic indexing and document clustering (Of course, A word frequently present in all documents will not be useful for clustering – The length of all documents is not uniform so a lengthy document will have higher word counts).

  • No labels