SEO tips. robots.txt. rel nofollow attribute in link.
authorOleksandr Gavenko <gavenkoa@gmail.com>
Sat, 06 Aug 2016 23:53:20 +0300
changeset 2007 ebd98c98304b
parent 2006 ddd4e1ddd8b8
child 2008 5919a8903b3f
SEO tips. robots.txt. rel nofollow attribute in link.
web-seo.rst
--- a/web-seo.rst	Sat Aug 06 00:07:09 2016 +0300
+++ b/web-seo.rst	Sat Aug 06 23:53:20 2016 +0300
@@ -6,17 +6,136 @@
 .. contents::
    :local:
 
-Informing search engine about site structure.
-=============================================
+General search engines
+======================
+
+Major search engines are:
+
+* https://google.com from Google Co.
+* https://bing.com from Microsoft Co.
+* https://yahoo.com or https://search.yahoo.com/ from Yahoo Co and it uses Bing
+  engine.
+
+Major local search engine are:
+
+* https://yandex.com from Yandex Co that have large market on Russia, Ukraine, Belarus.
+
+SEO tips
+========
+
+* Use pain text rather than images, CSS, JavaScript, Flash, SilverLite or other
+  technologies for important content.
+
+* Make sure your site work without with CSS, JavaScript, Flash, Java, etc
+  disabled. Check your site with screen reader or text browser (lynx, w3m).
+
+* Avoid HTML frames and iframes.
+
+* Market your site so other sites link to yours.
+
+* Tend to focus on single topic per page.
+
+* Avoid cryptic URLs. Craft human attractive URLs as URL words directly can be
+  indexed. Use dashes instead of spaces.
+
+* Shutdown mirrors or use ``link rel="canonical"`` tag to point to page on main
+  site.
+
+https://support.google.com/webmasters/answer/139066
+  Use canonical URLs
+https://en.wikipedia.org/wiki/Canonical_link_element
+  Wiki article.
+
+Informing search engine about site structure
+============================================
 ::
 
   http://www.example.org/sitemap.xml
   http://www.example.org/robots.txt
 
- * http://en.wikipedia.org/wiki/Sitemaps
- * http://www.sitemaps.org/
- * http://en.wikipedia.org/wiki/Robots.txt
- * http://www.robotstxt.org/
+robots.txt
+==========
+
+Start line with ``#`` to commet it out.
+
+Disable all robots (that respect ``robots.txt``)::
+
+  User-agent: *
+  Disallow: /
+
+To allow all robots complete access::
+
+  User-agent: *
+  Disallow:
+
+To allow a single robot::
+
+  User-agent: Google
+  Disallow:
+
+  User-agent: *
+  Disallow: /
+
+All valuable search engines support ``Allow`` and ``Sitemap`` tags::
+
+  User-agent: *
+  Allow: /
+  Sitemap: http://example.com/sitemap1.xml
+  Sitemap: http://example.com/my_sitemap2.xml
+
+The longer path the higher priority of ``Allow`` and ``Disallow`` tags.
+
+
+http://www.robotstxt.org/orig.html
+  Original specification. Only ``User-agent`` and ``Disallow`` tags supported.
+http://robotstxt.org.ru/
+  Russian followup to http://www.robotstxt.org
+https://developers.google.com/webmasters/control-crawl-index/docs/robots_txt
+  Google extension specification.
+https://yandex.ru/support/webmaster/controlling-robot/robots-txt.xml
+  Yandex extension specification.
+http://en.wikipedia.org/wiki/Robots.txt
+  Wiki article.
+
+meta robots
+===========
+
+Instruction to not index current page::
+
+  <meta name="robots" content="noindex">
+
+Instruction to not follow links on current page::
+
+  <meta name="robots" content="nofollow">
+
+Pair combinations::
+
+  <meta name="robots" content="noindex, nofollow">
+  <meta name="robots" content="noindex, follow">
+  <meta name="robots" content="index, nofollow">
+  <meta name="robots" content="index, follow">
+
+rel nofollow attribute in link
+==============================
+
+To prevent spam on forums and blogs or to including commertial links::
+
+  <a href="http://spam.com/sale/20-off" rel="nofollow">You win 100$!!</a>
+
+https://support.google.com/webmasters/answer/96569
+  Use rel="nofollow" for specific links.
+https://support.google.com/webmasters/answer/81749
+  Ways to Prevent Comment Spam.
+https://en.wikipedia.org/wiki/Nofollow
+  Wiki article.
+https://codex.wordpress.org/Nofollow
+  Wordpress suggestions.
+
+sitemap
+=======
+
+* http://en.wikipedia.org/wiki/Sitemaps
+* http://www.sitemaps.org/
 
 Alexa.
 ======