web-site.rst
author Oleksandr Gavenko <gavenkoa@gmail.com>
Mon, 22 Feb 2016 12:46:36 +0200
changeset 1905 fba288d59662
parent 1334 9bf0d5a1f0cf
child 1912 8b81a8f0f692
permissions -rw-r--r--
Include only local subsections into TOC. This prevent duplication of TOC when build single page HTML document. Also this make unnecessary CSS hack to hide document title as top level section.
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
1334
9bf0d5a1f0cf Include common header with quick links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1331
diff changeset
     1
.. -*- coding: utf-8; -*-
9bf0d5a1f0cf Include common header with quick links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1331
diff changeset
     2
.. include:: HEADER.rst
1325
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     3
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     4
===========
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     5
 Web site.
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     6
===========
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     7
.. contents::
1905
fba288d59662 Include only local subsections into TOC. This prevent duplication of
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1334
diff changeset
     8
   :local:
1325
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
     9
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    10
Speeding up web site loading.
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    11
=============================
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    12
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    13
  http://developer.yahoo.com/performance/rules.html
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    14
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    15
robots.txt.
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    16
===========
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    17
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    18
To exclude all robots from the entire server::
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    19
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    20
  User-agent: *
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    21
  Disallow: /
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    22
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    23
To exclude all robots from part of the server::
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    24
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    25
  User-agent: *
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    26
  Disallow: /cgi-bin/
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    27
  Disallow: /tmp/
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    28
  Disallow: /junk/
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    29
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    30
To allow a single robot::
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    31
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    32
  User-agent: Google
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    33
  Disallow:
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    34
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    35
  User-agent: *
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    36
  Disallow: /
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    37
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    38
To allow all robots complete access::
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    39
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    40
  User-agent: *
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    41
  Disallow:
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    42
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    43
See:
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    44
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    45
  http://www.robotstxt.org/
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    46
                home page
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    47
  http://www.robotstxt.org/robotstxt.html
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    48
                About /robots.txt
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    49
  http://www.robotstxt.org/faq.html
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    50
                Frequently Asked Questions
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    51
  http://googlewebmastercentral.blogspot.com/2008/06/improving-on-robots-exclusion-protocol.html
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    52
                Improving on Robots Exclusion Protocol
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    53
1331
7d93a4940822 Sitemap.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1325
diff changeset
    54
Sitemap.
7d93a4940822 Sitemap.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1325
diff changeset
    55
========
7d93a4940822 Sitemap.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1325
diff changeset
    56
7d93a4940822 Sitemap.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1325
diff changeset
    57
Sitemaps protocol allows a webmaster to inform search engines about URLs on a
7d93a4940822 Sitemap.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1325
diff changeset
    58
website that are available for crawling.
7d93a4940822 Sitemap.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1325
diff changeset
    59
7d93a4940822 Sitemap.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1325
diff changeset
    60
  http://www.sitemaps.org/protocol.html
7d93a4940822 Sitemap.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1325
diff changeset
    61
                Sitemap protocol.
7d93a4940822 Sitemap.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1325
diff changeset
    62
  http://en.wikipedia.org/wiki/Sitemaps
7d93a4940822 Sitemap.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1325
diff changeset
    63
                Wikipedia article.
7d93a4940822 Sitemap.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents: 1325
diff changeset
    64
1325
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    65
Web document structure useage.
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    66
==============================
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    67
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    68
  http://dev.opera.com/articles/view/mama/
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    69
                Metadata Analysis and Mining Application
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    70
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    71
Validation.
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    72
===========
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    73
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    74
  http://validator.w3.org/
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    75
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    76
Add search to your site.
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    77
========================
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    78
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    79
  http://www.google.com/support/customsearch/
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    80
                Custom Search Help
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    81
  http://help.yahoo.com/l/uk/yahoo/search/basics/basics-13.html
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    82
                Can I add a Yahoo! Search box to my site?
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    83
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    84
Check websites for broken links.
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    85
================================
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    86
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    87
  http://linkchecker.sourceforge.net/
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    88
                linkchecker home page.
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    89
  http://arthurdejong.org/webcheck/
ea51f96a6a47 Check websites for broken links.
Oleksandr Gavenko <gavenkoa@gmail.com>
parents:
diff changeset
    90
                webcheck home page.