-
LSA assumes words close in meaning will occur in similar peices of text with distributional hypothesis.
-
LSA tries finding
low-rank approximtionforterm-document matrixbydot productbetween all term-vectors (giving correlation between terms) and document-vectors (giving document correlation over terms). -
From theory of Linear Algebra, there exists a decomposition of
term-document matrixgiving adiagonal matrixalongwith a tall and a wide matrix. This is SVD. -
Uses a BoW (Bag of Word) model resulting in term-document matrix (rows for words, columns for documents).
-
LSA learns latent topics by performing matrix decomposition on document-term matrix using SVD.
-
LSA typically used as dimesnion reduction or noise reduction technique.