Latent Semantic Indexing

seobangalore Offline referral


Can any one explain about Latent Semantic Indexing?
KevinMax Offline referral

Posts: 575
Joined: Aug 2013
Reputation: 78

Senior Member
In Simple, Search engines understand the content on your page by reading your common words and phrase. This is we generally call it as LSI.

Latent semantic indexing is bit ancient term and mainly used by Google and other major search engines. Web Spiders don't find time to read all the content in the internet from top to bottom. Neither it reads here and there to understand the site/page is all about.

Maybe, The Keywords proximity/prominence/frequency terms are derived from the concept of LSI.
surenot Offline referral

Posts: 500
Joined: Jan 2014
Reputation: 16

Senior Member
Here is what Wikipedia says about it:

Quote:Latent semantic indexing (LSI) is an indexing and retrieval method that uses a mathematical technique called singular value decomposition (SVD) to identify patterns in the relationships between the terms and concepts contained in an unstructured collection of text.

In other words. Your normal indexing process will scan the document for keywords and record them. LSI takes it to the next level by checking documents against each-other to see if they contain the same keywords.

Documents containing many words in common are classified as semantically close those that contain a few keywords are called semantically distant.

That is how search engine determines whether the content is a duplicate on.
Erin Nagata Offline referral

Posts: 369
Joined: Jul 2014
Reputation: 5

Senior Member
Searchenginejournal explained that in latent semantic indexing "the contents of a webpage are crawled by a search engine and the most common words and phrases are collated and identified as the keywords for the page".

