search-web.rst
author Oleksandr Gavenko <gavenkoa@gmail.com>
Mon, 25 Oct 2010 17:24:11 +0300
changeset 624 76016e0c69f9
parent 615 43c69dfaf8ea
permissions -rwxr-xr-x
Driver Selection Process.
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
614
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     1
-*- mode: outline; coding: utf-8 -*-
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     2
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     3
* Google search query syntax.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     4
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     5
  http://www.google.com/support/websearch/bin/answer.py?answer=136861
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     6
                Google search basics: More search help
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     7
  http://code.google.com/intl/ru/apis/soapsearch/reference.html
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     8
                Google SOAP Search API Reference
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     9
  http://www.google.com/cse/docs/resultsxml.html
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    10
                Google WebSearch Protocol Reference for Google Site Search
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    11
  http://en.wikipedia.org/wiki/Google_Search
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    12
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    13
** Phrase Search.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    14
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    15
Use double quotes to search exactly mutch of string. Words marked in this way will appear together
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    16
in all results exactly as entered.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    17
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    18
  "WORD1 WORD2 WORD3"
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    19
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    20
Note: You may need to use a "+" to force inclusion of common words in a phrase.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    21
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    22
** Boolean OR Search.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    23
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    24
"OR" capital is essential.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    25
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    26
  WORD1 OR WORD2
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    27
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    28
Remove site from search by "-site:"
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    29
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    30
  WORD1 WORD2 -site:ebay.com -site:shopping.com
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    31
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    32
** Include query term (search exactly as is).
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    33
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    34
If a common word is essential to getting the results you want, you can include it by putting a "+"
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    35
sign in front of it.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    36
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    37
 +WORD WORD1 WORD2
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    38
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    39
** Exclude query term.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    40
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    41
You can exclude a word from your search by putting a minus sign ("-") immediately in front of the
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    42
term you want to exclude from the search results.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    43
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    44
  WORD1 WORD2 -WORD
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    45
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    46
** Fill in the blanks.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    47
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    48
  GNU *
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    49
  Mozilla *
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    50
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    51
** Site Restricted Search.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    52
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    53
  site:example.com WORD1 WORD2
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    54
  site:.gov WORD
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    55
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    56
** Cached Results Page.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    57
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    58
The query prefix "cache:" returns the cached HTML version of the specified web document that the
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    59
Google search crawled. Note there can be no space between "cache:" and the web page URL. If you
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    60
include other words in the query, Google will highlight those words within the cached document.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    61
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    62
  cache:www.google.com
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    63
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    64
Use Google as a free proxy (if direct access bloked): cache:example.com
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    65
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    66
** Title Search.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    67
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    68
Restricts the results to those with all of the query words in the title.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    69
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    70
  intitle:WORD1 intitle:WORD2 WORD3
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    71
  allintitle:WORD1 WORD2
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    72
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    73
Note: Putting "intitle:" in front of every word in your query is equivalent to putting "allintitle:"
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    74
at the front of your query.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    75
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    76
** URL Search.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    77
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    78
If you prepend "inurl:" to a query term, Google search restricts the results to documents containing
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    79
that word in the result URL. Note there can be no space between the "inurl:" and the following word.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    80
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    81
Starting a query with the term "allinlinks:" restricts the results to those with all of the query
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    82
words in the URL links on the page.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    83
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    84
  inurl:WORD1 inurl:WORD2 WORD
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    85
  allinurl: WORD1 WORD2
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    86
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    87
Note: "inurl:" works only on words, not URL components. In particular, it ignores punctuation and
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    88
uses only the first word following the "inurl:" operator. To find multiple words in a result URL,
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    89
use the "inurl:" operator for each word.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    90
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    91
Note: Putting "inurl:" in front of every word in your query is equivalent to putting "allinurl:" at
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    92
the front of your query.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    93
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    94
** Text Only Search.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    95
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    96
Starting a query with the term "allintext:" restricts the results to those with all of the query
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    97
words in only the body text, ignoring link, URL, and title matches.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    98
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    99
  allintext: WORD1 WORD2
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   100
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   101
** File Type Filtering.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   102
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   103
The query prefix "filetype:" filters the results returned to include only documents with the
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   104
extension specified immediately after. Note there can be no space between "filetype:&quot; and the
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   105
specified extension.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   106
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   107
  WORD filetype:doc OR filetype:pdf
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   108
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   109
** File Type Exclusion.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   110
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   111
The query prefix "-filetype:" filters the results to exclude documents with the extension specified
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   112
immediately after. Note there can be no space between "-filetype:" and the specified extension.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   113
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   114
  WORD -filetype:doc -filetype:pdf
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   115
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   116
** Web Document Info.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   117
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   118
The query prefix "info:" returns a single result for the specified URL if it exists in the index.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   119
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   120
  info:www.google.com
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   121
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   122
Note: No other query terms can be specified when using this special query term.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   123
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   124
** Back Links.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   125
615
43c69dfaf8ea Word definition.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 614
diff changeset
   126
The query prefix "link:" lists web pages that have links to the specified web page.
614
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   127
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   128
  link:www.google.com
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   129
615
43c69dfaf8ea Word definition.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 614
diff changeset
   130
Note: there can be no space between "link:" and the web page URL.
43c69dfaf8ea Word definition.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 614
diff changeset
   131
614
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   132
Note: No other query terms can be specified when using this special query term.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   133
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   134
** Related Links.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   135
615
43c69dfaf8ea Word definition.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 614
diff changeset
   136
Lists web pages that are similar to the specified web page.
614
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   137
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   138
  related:www.google.com
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   139
615
43c69dfaf8ea Word definition.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 614
diff changeset
   140
Note: there can be no space between "related:" and the web page URL.
43c69dfaf8ea Word definition.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 614
diff changeset
   141
614
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   142
Note: No other query terms can be specified when using this special query term.
e26ea9748d66 Google search query syntax.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   143
615
43c69dfaf8ea Word definition.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 614
diff changeset
   144
** Word definition.
43c69dfaf8ea Word definition.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 614
diff changeset
   145
43c69dfaf8ea Word definition.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 614
diff changeset
   146
The query prefix "define:" will provide a definition of the words listed after it.
43c69dfaf8ea Word definition.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 614
diff changeset
   147
43c69dfaf8ea Word definition.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 614
diff changeset
   148
  define:WORD
43c69dfaf8ea Word definition.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 614
diff changeset
   149