web-search.rst
author Oleksandr Gavenko <gavenkoa@gmail.com>
Thu, 23 Feb 2012 13:36:53 +0200
changeset 1229 7ad37ea8bfea
parent 1228 403a36b286c7
child 1230 66e1ce9cfa2a
permissions -rw-r--r--
Image by text search.
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
960
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     1
.. -*- coding: utf-8 -*-
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     2
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     3
=============
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     4
 WEB search.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     5
=============
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     6
.. contents::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     7
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     8
Disable page indexing by search engine.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     9
=======================================
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    10
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    11
Add to html page in head tag such code::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    12
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    13
  <meta name="ROBOTS" content="NOINDEX,NOFOLLOW" />
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    14
1228
403a36b286c7 Image by image search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1227
diff changeset
    15
Image by image search.
403a36b286c7 Image by image search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1227
diff changeset
    16
======================
403a36b286c7 Image by image search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1227
diff changeset
    17
1229
7ad37ea8bfea Image by text search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1228
diff changeset
    18
 * http://www.gazopa.com/
7ad37ea8bfea Image by text search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1228
diff changeset
    19
 * http://images.google.com/
7ad37ea8bfea Image by text search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1228
diff changeset
    20
 * http://www.tineye.com/
7ad37ea8bfea Image by text search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1228
diff changeset
    21
7ad37ea8bfea Image by text search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1228
diff changeset
    22
Image by text search.
7ad37ea8bfea Image by text search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1228
diff changeset
    23
=====================
7ad37ea8bfea Image by text search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1228
diff changeset
    24
7ad37ea8bfea Image by text search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1228
diff changeset
    25
 * http://en.wikipedia.org/wiki/List_of_CBIR_Engines
1228
403a36b286c7 Image by image search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1227
diff changeset
    26
960
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    27
Dictionary Search.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    28
==================
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    29
1229
7ad37ea8bfea Image by text search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1228
diff changeset
    30
 * http://www.onelook.com/
960
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    31
1225
72e807334dd9 Code search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1220
diff changeset
    32
Code search.
72e807334dd9 Code search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1220
diff changeset
    33
============
72e807334dd9 Code search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1220
diff changeset
    34
1219
7d462f37bb78 Google codesearch no longer exist.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 989
diff changeset
    35
  http://code.google.com/
1227
b79ea88cebcc Code search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1226
diff changeset
    36
                Google Code.
1220
74f1ff61a801 Koders.com.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1219
diff changeset
    37
  http://www.koders.com/
1227
b79ea88cebcc Code search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1226
diff changeset
    38
                Search for code in released Open Source tarballs. You can select
b79ea88cebcc Code search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1226
diff changeset
    39
                license and language.
1220
74f1ff61a801 Koders.com.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1219
diff changeset
    40
  http://www.merobase.com/
1227
b79ea88cebcc Code search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1226
diff changeset
    41
                Search for file names in Open Source VCS.
1220
74f1ff61a801 Koders.com.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1219
diff changeset
    42
  http://snipplr.com/
1227
b79ea88cebcc Code search.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1226
diff changeset
    43
                Search for code snippets from its service.
1220
74f1ff61a801 Koders.com.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1219
diff changeset
    44
1226
ff93131a285d Search in blogs.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1225
diff changeset
    45
Search in blogs.
ff93131a285d Search in blogs.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1225
diff changeset
    46
================
ff93131a285d Search in blogs.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1225
diff changeset
    47
ff93131a285d Search in blogs.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1225
diff changeset
    48
 * http://blogsearch.google.com/
ff93131a285d Search in blogs.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1225
diff changeset
    49
 * http://www.technorati.com/
ff93131a285d Search in blogs.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1225
diff changeset
    50
1220
74f1ff61a801 Koders.com.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1219
diff changeset
    51
DuckDuckGo.
74f1ff61a801 Koders.com.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1219
diff changeset
    52
===========
74f1ff61a801 Koders.com.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1219
diff changeset
    53
74f1ff61a801 Koders.com.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1219
diff changeset
    54
General search engine.
74f1ff61a801 Koders.com.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1219
diff changeset
    55
74f1ff61a801 Koders.com.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1219
diff changeset
    56
  http://duckduckgo.com/
74f1ff61a801 Koders.com.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1219
diff changeset
    57
                search page
74f1ff61a801 Koders.com.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1219
diff changeset
    58
960
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    59
Google historical corpus statistics.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    60
====================================
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    61
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    62
  http://ngrams.googlelabs.com/
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    63
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    64
Google search query syntax.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    65
===========================
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    66
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    67
  http://www.google.com/support/websearch/bin/answer.py?answer=136861
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    68
                Google search basics: More search help
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    69
  http://www.google.ru/help/operators.html
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    70
                Advanced Operators
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    71
  http://code.google.com/intl/ru/apis/soapsearch/reference.html
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    72
                Google SOAP Search API Reference
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    73
  http://www.google.com/cse/docs/resultsxml.html
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    74
                Google WebSearch Protocol Reference for Google Site Search
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    75
  http://en.wikipedia.org/wiki/Google_Search
966
9221118aef0f Fix RST syntax error.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 963
diff changeset
    76
                Google Search
960
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    77
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    78
Phrase Search.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    79
--------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    80
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    81
Use double quotes to search exactly mutch of string. Words marked in this way
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    82
will appear together in all results exactly as entered::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    83
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    84
  "WORD1 WORD2 WORD3"
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    85
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    86
Note: You may need to use a "+" to force inclusion of common words in a phrase.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    87
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    88
Boolean OR Search.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    89
------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    90
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    91
"OR" capital is essential::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    92
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    93
  WORD1 OR WORD2
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    94
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    95
Remove site from search by "-site:"::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    96
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    97
  WORD1 WORD2 -site:ebay.com -site:shopping.com
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    98
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    99
Include query term (search exactly as is).
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   100
------------------------------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   101
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   102
If a common word is essential to getting the results you want, you can include
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   103
it by putting a "+" sign in front of it::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   104
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   105
  +WORD WORD1 WORD2
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   106
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   107
Exclude query term.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   108
-------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   109
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   110
You can exclude a word from your search by putting a minus sign ("-")
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   111
immediately in front of the term you want to exclude from the search results::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   112
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   113
  WORD1 WORD2 -WORD
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   114
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   115
Fill in the blanks.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   116
-------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   117
::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   118
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   119
  GNU *
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   120
  Mozilla *
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   121
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   122
Site Restricted Search.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   123
-----------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   124
::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   125
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   126
  site:example.com WORD1 WORD2
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   127
  site:.gov WORD
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   128
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   129
Cached Results Page.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   130
--------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   131
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   132
The query prefix "cache:" returns the cached HTML version of the specified web
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   133
document that the Google search crawled. Note there can be no space between
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   134
"cache:" and the web page URL. If you include other words in the query, Google
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   135
will highlight those words within the cached document::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   136
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   137
  cache:www.google.com
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   138
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   139
Use Google as a free proxy (if direct access bloked): cache:example.com
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   140
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   141
Title Search.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   142
-------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   143
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   144
Restricts the results to those with all of the query words in the title::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   145
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   146
  intitle:WORD1 intitle:WORD2 WORD3
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   147
  allintitle:WORD1 WORD2
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   148
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   149
Note: Putting "intitle:" in front of every word in your query is equivalent to
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   150
putting "allintitle:" at the front of your query.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   151
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   152
URL Search.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   153
-----------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   154
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   155
If you prepend "inurl:" to a query term, Google search restricts the results to
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   156
documents containing that word in the result URL. Note there can be no space
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   157
between the "inurl:" and the following word.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   158
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   159
Starting a query with the term "allinlinks:" restricts the results to those with
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   160
all of the query words in the URL links on the page::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   161
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   162
  inurl:WORD1 inurl:WORD2 WORD
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   163
  allinurl: WORD1 WORD2
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   164
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   165
Note: "inurl:" works only on words, not URL components. In particular, it
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   166
ignores punctuation and uses only the first word following the "inurl:"
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   167
operator. To find multiple words in a result URL, use the "inurl:" operator for
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   168
each word.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   169
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   170
Note: Putting "inurl:" in front of every word in your query is equivalent to
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   171
putting "allinurl:" at the front of your query.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   172
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   173
Link anchor search.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   174
-------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   175
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   176
Searches for text in a page's link anchors. A link anchor is the descriptive
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   177
text of a link::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   178
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   179
  inanchor:"WORD1 WORD2"
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   180
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   181
Text Only Search.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   182
-----------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   183
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   184
Starting a query with the term "allintext:" restricts the results to those with
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   185
all of the query words in only the body text, ignoring link, URL, and title
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   186
matches::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   187
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   188
  intext:WORD
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   189
  allintext: WORD1 WORD2
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   190
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   191
File Type Filtering.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   192
--------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   193
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   194
The query prefix "filetype:" filters the results returned to include only
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   195
documents with the extension specified immediately after. Note there can be no
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   196
space between "filetype:&quot; and the specified extension::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   197
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   198
  WORD filetype:doc OR filetype:pdf
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   199
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   200
File Type Exclusion.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   201
--------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   202
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   203
The query prefix "-filetype:" filters the results to exclude documents with the
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   204
extension specified immediately after. Note there can be no space between
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   205
"-filetype:" and the specified extension::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   206
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   207
  WORD -filetype:doc -filetype:pdf
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   208
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   209
Web Document Info.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   210
------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   211
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   212
The query prefix "info:" returns a single result for the specified URL if it
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   213
exists in the index::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   214
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   215
  info:www.google.com
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   216
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   217
Note: No other query terms can be specified when using this special query term.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   218
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   219
Back Links.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   220
-----------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   221
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   222
The query prefix "link:" lists web pages that have links to the specified web
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   223
page::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   224
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   225
  link:www.google.com
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   226
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   227
Note: there can be no space between "link:" and the web page URL.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   228
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   229
Note: No other query terms can be specified when using this special query term.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   230
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   231
Related Links.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   232
--------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   233
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   234
Lists web pages that are similar to the specified web page::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   235
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   236
  related:www.google.com
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   237
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   238
Note: there can be no space between "related:" and the web page URL.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   239
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   240
Note: No other query terms can be specified when using this special query term.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   241
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   242
Word definition.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   243
----------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   244
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   245
The query prefix "define:" will provide a definition of the words listed after
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   246
it::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   247
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   248
  define:WORD
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   249
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   250
Yahoo search query syntax.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   251
==========================
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   252
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   253
  http://help.yahoo.com/l/uk/yahoo/search/basics/index.html
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   254
                Yahoo! Search Help Topics
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   255
  http://help.yahoo.com/l/uk/yahoo/search/basics/basics-04.html
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   256
                Search Tips
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   257
  http://help.yahoo.com/l/uk/yahoo/search/basics/basics-08.html
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   258
                What is Advanced Search?
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   259
  http://help.yahoo.com/l/uk/yahoo/search/basics/basics-19.html
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   260
                How do I search for a specific URL, sub-page, or find sites that link to mine?
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   261
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   262
All of these words.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   263
-------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   264
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   265
Includes all of the words you typed in the search box. This is similar to
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   266
inserting "AND" between words or the symbol "+" before a word.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   267
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   268
At least one of these words.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   269
----------------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   270
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   271
Searches for results that match either one or more of the words. This is similar
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   272
to inserting "OR" between the words.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   273
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   274
Exact phrase.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   275
-------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   276
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   277
Searches for the words in exactly the order you enter them. This is similar to
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   278
putting quotes (" ") around a set of words.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   279
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   280
None of these words.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   281
--------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   282
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   283
Excludes words from your search. This is similar to inserting "NOT" between the
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   284
words or the symbol "-" before a word.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   285
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   286
site:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   287
-----
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   288
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   289
This allows one to find all documents within a particular domain and all its
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   290
subdomains.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   291
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   292
To exclude DOMAIN from search::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   293
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   294
  -site:DOMAIN
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   295
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   296
hostname:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   297
---------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   298
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   299
This allows one to find all documents from a particular host only.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   300
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   301
link:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   302
-----
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   303
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   304
This allows one to find documents that link to a particular URL.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   305
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   306
url:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   307
----
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   308
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   309
This allows one to find a specific document in our index.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   310
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   311
inurl:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   312
------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   313
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   314
This allows one to find a specific keyword as part of indexed URLs.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   315
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   316
intitle:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   317
--------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   318
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   319
This allows one to find a specific keyword as part of the indexed titles.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   320
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   321
Back links.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   322
-----------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   323
::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   324
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   325
  linkdomain:DOMAIN
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   326
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   327
Bing search query syntax.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   328
=========================
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   329
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   330
  http://onlinehelp.microsoft.com/en-WW/bing/ff808535.aspx
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   331
                Bing Help
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   332
  http://onlinehelp.microsoft.com/en-us/bing/ff808438.aspx
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   333
                Advanced search options
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   334
  http://onlinehelp.microsoft.com/en-us/bing/ff524480.aspx
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   335
                Search effectively
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   336
  http://onlinehelp.microsoft.com/en-us/bing/ff808421.aspx
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   337
                Advanced search keywords
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   338
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   339
"+"
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   340
---
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   341
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   342
Finds webpages that contain all the terms that are preceded by the + symbol.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   343
Also allows you to include terms that are usually ignored.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   344
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   345
" "
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   346
---
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   347
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   348
Finds the exact words in a phrase.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   349
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   350
"()"
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   351
----
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   352
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   353
Finds or excludes webpages that contain a group of words.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   354
966
9221118aef0f Fix RST syntax error.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 963
diff changeset
   355
AND or "&".
9221118aef0f Fix RST syntax error.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 963
diff changeset
   356
-----------
960
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   357
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   358
Finds webpages that contain all the terms or phrases.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   359
966
9221118aef0f Fix RST syntax error.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 963
diff changeset
   360
NOT or "-".
9221118aef0f Fix RST syntax error.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 963
diff changeset
   361
-----------
960
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   362
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   363
Excludes webpages that contain a term or phrase.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   364
966
9221118aef0f Fix RST syntax error.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 963
diff changeset
   365
OR or "|".
9221118aef0f Fix RST syntax error.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 963
diff changeset
   366
----------
960
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   367
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   368
Finds webpages that contain either of the terms or phrases.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   369
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   370
contains:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   371
---------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   372
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   373
Keeps results focused on sites that have links to the file types that you
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   374
specify::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   375
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   376
  contains:wma
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   377
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   378
filetype:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   379
---------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   380
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   381
Returns only webpages created in the file type that you specify::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   382
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   383
  filetype:pdf
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   384
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   385
inanchor: or inbody: or intitle:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   386
--------------------------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   387
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   388
These keywords return webpages that contain the specified term in the metadata,
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   389
such as the anchor, body, or title of the site, respectively. Specify only one
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   390
term per keyword. You can string multiple keyword entries as needed.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   391
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   392
ip:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   393
---
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   394
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   395
Finds sites that are hosted by a specific IP address. The IP address must be a
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   396
dotted quad address. Type the ip: keyword, followed by the IP address of the
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   397
website.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   398
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   399
language:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   400
---------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   401
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   402
Returns webpages for a specific language. Specify the language code directly
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   403
after the language: keyword. You can also access this function using the Search
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   404
Builder Language function. For more information about using Search Builder, see
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   405
Use advanced search.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   406
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   407
loc: or location:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   408
-----------------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   409
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   410
Returns webpages from a specific country or region. Specify the country or
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   411
region code directly after the loc: keyword. To focus on two or more languages,
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   412
use a logical OR to group the languages::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   413
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   414
  WORD1 WORD2 (loc:US OR loc:GB)
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   415
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   416
prefer:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   417
-------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   418
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   419
Adds emphasis to a search term or another operator to help focus the search
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   420
results.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   421
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   422
site:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   423
-----
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   424
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   425
Returns webpages that belong to the specified site. To focus on two or more
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   426
domains, use a logical OR to group the domains. You can use site: to search for
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   427
web domains, top level domains, and directories that are not more than two
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   428
levels deep. You can also search for webpages that contain a specific search
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   429
word on a site.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   430
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   431
feed:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   432
-----
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   433
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   434
Finds RSS or Atom feeds on a website for the terms you search for.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   435
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   436
hasfeed:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   437
--------
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   438
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   439
Finds webpages that contain an RSS or Atom feed on a website for the terms you
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   440
search for::
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   441
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   442
  site:www.nytimes.com hasfeed:football
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   443
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   444
url:
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   445
----
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   446
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   447
Checks whether the listed domain or web address is in the Bing index.
a898726dc330 Google historical corpus statistics.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
   448
989
b0902fc3fd99 Statistic.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 967
diff changeset
   449
Statistic.
b0902fc3fd99 Statistic.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 967
diff changeset
   450
==========
b0902fc3fd99 Statistic.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 967
diff changeset
   451
b0902fc3fd99 Statistic.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 967
diff changeset
   452
  http://marketshare.hitslink.com/
b0902fc3fd99 Statistic.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 967
diff changeset
   453
                Market Share for Mobile and Desktop. Browsers, Operating
b0902fc3fd99 Statistic.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 967
diff changeset
   454
                Systems, Search Engines and Social Media Marketing