Thursday, January 24, 2008

Latent Semantic Indexing

Latent semantic Indexing (LSI) is a process of extracting related words or information from your website content. This is very interesting topic in Information Retrieval (IR) System. Top search engines works on the latent semantic indexing (LSI) based.

Lexical indexing is completely based on Lexical analysis. Lexical analysis is the processing of an input which can be a form of sequence of characters which will be produce as output. A sequence of characters or symbols called as lexical tokens. A lexical analyzer will be divided into two stages. First stage is known as a scanner and second stage is known as an evaluator. The Latent Semantic Indexing is depending on these two states. LSI based search engine optimization is much more complex in comparison to normal search engine optimization. The search engine ranking for a particular website will have to pass several processes in the latent semantic indexing based search engine optimization. This process will contain the occurrence of a keyword in a document and the close relationship with the other words of the document, flavor of your website content.

If you're searching in a Latent semantic indexing (LSI) indexed database then the search engine looks at similar values it has calculated for every synonyms word and returns the best matched website that will be the best fit to the query. Because latent semantic indexing does not require exact matching words for ranking result.

For LSI based Search Engine Optimization we go through following process:

Categorization of the documents
Contextual Explanation from the lexical similar words
Conceptual Comparison
Cross-Lingual Text Analysis
Content Relationship Discovery
Document Summarization
Taxonomy Generation

No comments: