Extensive pages
along with their URLs are taken as samples, a query which can best summarize
the page itself is constructed and sent to the search engine, the samples’ URLs
are compared with the returned URLs, if there is a match between them or their
content, consider the query as a lexical signature query or strong query. Assume
that the page and its URL in surface web are supposed to be found by general
search engine leads to the search engine’s quality measurement. By sending the
strong query to different search engines, the qualities are derived. It will be
a good source for the measurement only if the query extraction and processing
targeted on web pages are well designed and implemented. This process is called
find a lexical signature query for a given web page.
Keywords:
lexical signature, query, search engine, Google, Yahoo, HTML tags, term
frequency, document frequency, graph-based ranking algorithm, word rank,
sentence rank.