Characters that are skipped during indexing will also be ignored during a search. When searching for a term with skipped characters, it will retrieve all documents containing the term, excluding the skipped character or any other skip characters in a similar format.
Example 1:
ALL (“1+2”)
As "+" is considered a skip character (refer to KB Characters Excluded During Indexing), this search will return all documents containing the term "1+2." It will also retrieve documents containing "1=2," "1&2," or any other terms with skip characters in a similar format. In additio to this, it will include documents with "1 2" (whitespace in the middle), but it will not include documents with "1-2" since "–" is not indexed with splitting or skipping.
Example 2:
ALL (“at&t”) AND ALL (“p & g”)
This search will retrieve all documents containing the term "at&t" and all documents containing "P & G" with a space. However, since "&" is skipped during indexing for "p & g" (due to the whitespace between the terms), it will also retrieve documents with "p & g," "p = g," "p g," or any other terms with skipped characters in a similar format.
Example 3:
ALL (“p g”~1)
This is a proximity query. The query will match documents containing "p & g" as well because "&" is skipped during indexing. However, it will not match documents with "p&g" since that term is indexed as is.
Example 4:
ALL (“pg&”)
In this case, since "&" is considered a skip character (see Table 1), and "pg&" is indexed as "pg," this search will retrieve documents with the term "pg&" as well as documents with "pg=" or "pg+" or any other terms with skip characters in a similar format.
Example 5:
ALL (“www.google.com”)
Since website names are indexed without skipping ".", this search will retrieve all documents containing the exact term.
Example 6:
ALL (“www.google.com.”)
Since "www.google.com." is indexed as "wwwgooglecom" (without acronyms), periods are skipped during search as well. Therefore, this search query is equivalent to
ALL (“wwwgooglecom”)
This will retrieve all documents with the term "www.google.com." It will also retrieve documents with the term "wwwgooglecom" but will NOT retrieve documents with the term "www.google.com."
Comments
0 comments
Please sign in to leave a comment.