<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/2.2.3" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>Sebastian's Pamphlets &#187; Recommendations</title>
	<link>http://sebastians-pamphlets.com</link>
	<description>If you've read my articles somewhere on the Internet, expect something different here.</description>
	<pubDate>Wed, 11 Aug 2010 18:57:05 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.2.3</generator>
	<language>en</language>
			<item>
		<title>Validate your robots.txt - Googlebot becomes smarter</title>
		<link>http://sebastians-pamphlets.com/validate-your-robots-txt-or-google-might-deindex-your-site/</link>
		<comments>http://sebastians-pamphlets.com/validate-your-robots-txt-or-google-might-deindex-your-site/#comments</comments>
		<pubDate>Tue, 20 Nov 2007 18:58:18 +0000</pubDate>
		<dc:creator>Sebastian</dc:creator>
		
		<category><![CDATA[Web development]]></category>

		<category><![CDATA[Testing]]></category>

		<category><![CDATA[Recommendations]]></category>

		<category><![CDATA[Crawler Directives]]></category>

		<category><![CDATA[Webmaster Central]]></category>

		<category><![CDATA[SEO]]></category>

		<category><![CDATA[robots.txt]]></category>

		<category><![CDATA[Google]]></category>

		<guid isPermaLink="false">http://sebastians-pamphlets.com/validate-your-robots-txt-or-google-might-deindex-your-site/</guid>
		<description><![CDATA[
Last week I reported that Google experiments with new crawler directives for use in robots.txt. Today Google has confirmed that Googlebot understands experimental REP syntax like Noindex:.
That means that forgotten &#8211;and, until recently, ignored&#8211; statements in your robots.txt might change the crawler&#8217;s behavior all of a sudden, without notice. I don&#8217;t know for sure which [...]]]></description>
			<content:encoded><![CDATA[
<p><img src="http://sebastians-pamphlets.com/img/google/validate-your-robots-txt.png" width="250" height="280" style="margin-left:4px;" align="right" alt="Validate your robots.txt!" title="Validate your robots.txt!"  />Last week I reported that <a href="http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/">Google experiments with new crawler directives for use in robots.txt</a>. Today <a href="http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-710">Google has confirmed</a> that Googlebot understands experimental REP syntax like <code>Noindex:</code>.</p>
<p>That means that forgotten &#8211;and, until recently, ignored&#8211; statements in your robots.txt might change the crawler&#8217;s behavior all of a sudden, without notice. I don&#8217;t know for sure which experimental crawler directives Google has implemented, but for example a line like<br />
<code><b>Noindex: /</b></code><br />
in your robots.txt will now deindex your complete Web site. </p>
<p>&#8220;Noindex:&#8221; is not defined in the Robots Exclusion Protocol from 1994, and not mentioned in <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=40360">Google&#8217;s official documents</a>.</p>
<p><a href="http://johnmu.com/">John Müller</a> from Google Zürich states:</p>
<blockquote><p>At the moment we will usually accept the &#8220;noindex&#8221; directive in the robots.txt, but we are not yet at a point where we are willing to set it into stone and announce full support.</p>
<p>[&#8230;] <b>I just want to remind everyone again that this is something that may still change over time. Be careful when playing with things like this.</b></p>
</blockquote>
<p><b>My understanding of &#8220;be careful&#8221; is:</b></p>
<ul>
<li>Create a separate section for Googlebot. Do not rely on directives addressing all Web robots. Especially when you&#8217;ve a Googlebot section already, Google&#8217;s crawler will ignore directives set under &#8220;all user agents&#8221; and process only the Googlebot section. Repeat all statements under <code>User-agent: *</code> in <code>User-agent: Googlebot</code> to make sure that Googlebot obeys them.</li>
<li><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=40364">RTFM</a></li>
<li>Do not use other crawler directives than<br />
Disallow:<br />
Allow:<br />
Sitemap:<br />
in the Googlebot section.</li>
<li>Don&#8217;t mess-up <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=40364">pattern matching</a>.<br />
<code>*</code> matches a sequence of characters<br />
<code>$</code> specifies the end of the URL<br />
<code>?</code> separates the path from the query string, you can&#8217;t use it as wildcard!</li>
<li>Validate your robots.txt with the cool <a href="https://www.google.com/webmasters/tools/robots?siteUrl=" rel="nofollow incomplete-uri">robots.txt analyzer</a> in your <a href="https://www.google.com/webmasters/tools/" rel="nofollow incomplete-uri">Google Webmaster Console</a>.</li>
</ul>
<p>Folks put the funniest stuff into their robots.txt, for example images or crawl delays like &#8220;Don&#8217;t crawl this site during our office hours&#8221;. Crawler directives from robots meta tags aren&#8217;t very popular, but they appear in many robots.txt files. Hence it makes sound sense to use what people express, regardless the syntax errors. </p>
<p>Also, having the opportunity to manage page specific crawler directives like &#8220;noindex&#8221;, &#8220;nofollow&#8221;, &#8220;noarchive&#8221; and perhaps even &#8220;nopreview&#8221; on site level is a huge time saver, and eliminates many points of failure. Kudos to Google for this initiative, I hope it will make it into the standards.</p>
<p>I&#8217;ll test the experimental robots.txt directives and post the results. Perhaps I&#8217;ll set up a live test like <a href="http://sebastians-pamphlets.com/noindex/">this one</a>. </p>
<p>Take care.</p>
<hr width="126" color="silver" /></p>
<p><b id="live-robots-txt-test">Update:</b> Here is the live test of suspected respectively <a href="http://sebastians-pamphlets.com/in-need-of-a-web-robot-directives-standard/">desired</a> new crawler directives for robots.txt. I&#8217;ve added a few unusual statements to my robots.txt and uploaded scripts to monitor search engine crawling. The test pages provide links to search queries so you can check whether Google indexed them or not. </p>
<p>Please don&#8217;t link to the crawler traps, I&#8217;ll update <a href="http://sebastians-pamphlets.com/validate-your-robots-txt-or-google-might-deindex-your-site/#live-robots-txt-test" title="Canonical link">this post</a> with my findings. Of course I appreciate links, so here is the canonical URL:<br />
<small>http://sebastians-pamphlets.com/validate-your-robots-txt-or-google-might-deindex-your-site/#live-robots-txt-test</small></p>
<p><b>Please note that you should not make use of the crawler directives below on production systems!</b> Bear in mind that you can achive all that with simple <a href="http://sebastians-pamphlets.com/links/categories/?cat=x-robots-tag">X-Robots-Tags in the HTTP headers</a>. That&#8217;s a bullet-proof way to apply robots meta tags to files without touching them, and it works with virtual URIs too. X-Robots-Tags are sexy, but many site owners can&#8217;t handle them due to various reasons, whereas corresponding robots.txt syntax would be usable for everybody (not suffering from restrictive and/or free hosts).</p>
<h3>Noindex:</h3>
<p><b>robots.txt:</b><code><br />
<b>Noindex: /repstuff/noindex.php</b></code><br />
<b>Expected behavior:</b><br />
No crawling/indexing. It <a href="http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/">seems</a> Google interprets &#8220;Nofollow:&#8221; as &#8220;Disallow:&#8221;.<br />
<b>Desired behavior:</b><br />
&#8220;Follow:&#8221; is the REP&#8217;s default, hence Google should fetch everything and follow the outgoing links, but shouldn&#8217;t deliver Noindex&#8217;ed contents on the SERPs, not even as URL-only listings.<br />
<b>Google&#8217;s robots.txt validator:</b><br />
http://sebastians-pamphlets.com/repstuff/noindex.php 	Blocked by line 30: Noindex: /repstuff/noindex.php<br />
<b>Status:</b><br />
<a href="http://sebastians-pamphlets.com/repstuff/noindex.php">See test page</a><br />
<b>Google&#8217;s crawler / indexer:</b><br />
2007-11-21: crawled (possibly caused by an outdated robots.txt cache).<br />
2007-11-23: indexed and cached.<br />
2007-11-21: crawled a page linked only from noindex.php.<br />
2007-11-23: indexed and cached a page linked only from noindex.php.<br />
(If an outdated robots.txt cache falsely allowed crawling, the search result(s) should disappear shortly after the next crawl.)<br />
2007-11-26: deindexed, the same goes for the linked page (without recrawling).<br />
2007-12-07: appeared under &#8220;URLs restricted by robots.txt&#8221; in GWC.<br />
2007-12-17: I consider this case closed. <b><em>Noindex:</em> blocks crawling, deindexes previously indexed pages, and is suspected to block incoming PageRank.</b></p>
<h3>Nofollow:</h3>
<p><b>robots.txt:</b><code><br />
<b>Nofollow: /repstuff/nofollow.php</b></code><br />
<b>Expected behavior:</b><br />
Crawling, indexing, and following the links as if there&#8217;s no &#8220;Nofollow:&#8221;.<br />
<b>Desired behavior:</b><br />
Crawling, indexing, and ignoring outgoing links.<br />
<b>Google&#8217;s robots.txt validator:</b><br />
Line 31:  Nofollow: /repstuff/nofollow.php 	Syntax not understood<br />
http://sebastians-pamphlets.com/repstuff/nofollow.php 	Allowed<br />
<b>Status:</b><br />
<a href="http://sebastians-pamphlets.com/repstuff/nofollow.php">See test page</a><br />
<b>Google&#8217;s crawler / indexer:</b><br />
2007-11-21: crawled.<br />
2007-11-23: indexed and cached.<br />
2007-11-21: crawled a page linked only from nofollow.php (21 Nov 2007 23:19:37 GMT, for some reason not logged properly).<br />
2007-11-23: indexed and cached a page linked only from nofollow.php.<br />
2007-11-26: recrawled, deindexed, no longer cached. The same goes for the linked page.<br />
2007-11-28: cached again, the timestamp on the cached copy &#8220;27 Nov 2007 01:11:12 GMT&#8221; doesn&#8217;t match the last crawl on &#8220;2007-11-26 16:47:11 EST&#8221; (EST = GMT-5).<br />
2007-12-07: recrawled, still deindexed, cached. Linked page recrawled, cached.<br />
2007-12-17: recrawled, still deindexed (probably caused by near duplicate content on noarchive.php and other pages involved in this test), cached copy dated 2007-12-07. Cache of the linked page still dated 2007-11-21. I consider this case closed. <b><em>Nofollow:</em> doesn&#8217;t work as expected, Google doesn&#8217;t support this statement.</b></p>
<h3>Noarchive:</h3>
<p><b>robots.txt:</b><code><br />
<b>Noarchive: /repstuff/noarchive.php</b></code><br />
<b>Expected behavior:</b><br />
Crawling, indexing, following links, but no &#8220;Cached&#8221; links on the SERPs and no access to cached copies from the toolbar.<br />
<b>Desired behavior:</b><br />
Crawling, indexing, following links, but no &#8220;Cached&#8221; links on the SERPs and no access to cached copies from the toolbar.<br />
<b>Google&#8217;s robots.txt validator:</b><br />
http://sebastians-pamphlets.com/repstuff/noarchive.php 	Allowed<br />
<b>Status:</b><br />
<a href="http://sebastians-pamphlets.com/repstuff/noarchive.php">See test page</a><br />
<b>Google&#8217;s crawler / indexer:</b><br />
2007-11-21: crawled.<br />
2007-11-23: indexed and cached.<br />
2007-11-21: crawled a page linked only from noarchive.php.<br />
2007-11-23: indexed and cached a page linked only from noarchive.php.<br />
2007-11-26: recrawled, deindexed, no longer cached. The linked page was deindexed without recrawling.<br />
2007-11-28: cached again, the timestamp on the cached copy &#8220;27 Nov 2007 01:11:19 GMT&#8221; doesn&#8217;t match the last crawl on &#8220;2007-11-26 16:47:18 EST&#8221; (EST = GMT-5).<br />
2007-11-29: recrawled, cache not yet updated.<br />
2007-12-07: recrawled. Linked page recrawled.<br />
2007-12-08: recrawled.<br />
2007-12-11: recrawled the linked page, which is cached but not indexed.<br />
2007-12-12: recrawled.<br />
2007-12-17: still indexed, cached copy dated 2007-12-08. I consider this case closed. <b><em>Noarchive:</em> doesn&#8217;t work as expected, actually it does nothing although according to the robots.txt validator that&#8217;s supported &#8211;or at least known and accepted&#8211; syntax.</b></p>
<p>(It looks like Google understands <code><b>Nosnippet:</b></code> too, but I didn&#8217;t test that.)</p>
<h3>Nopreview:</h3>
<p><b>robots.txt:</b><code><br />
<b>Nopreview: /repstuff/nopreview.pdf</b></code><br />
<b>Expected behavior:</b><br />
None, unfortunately.<br />
<b>Desired behavior:</b><br />
No &#8220;view as HTML&#8221; links on the SERPs. Neither &#8220;nosnippet&#8221; nor &#8220;noarchive&#8221; suppress these helpful preview links, which can be pretty annoying in some cases. See <a href="http://sebastians-pamphlets.com/nopreview-the-missing-x-robots-tag/">NOPREVIEW: The missing X-Robots-Tag</a>.<br />
<b>Google&#8217;s robots.txt validator:</b><br />
Line 33:  Nopreview: /repstuff/nopreview.pdf 	Syntax not understood<br />
http://sebastians-pamphlets.com/repstuff/nopreview.pdf 	Allowed<br />
<b>Status:</b><br />
Crawler requests of <a href="http://sebastians-pamphlets.com/repstuff/nopreview.pdf">nopreview.pdf</a> are <a href="http://sebastians-pamphlets.com/repstuff/nopreview.php">logged here</a>.<br />
<b>Google&#8217;s crawler / indexer:</b><br />
2007-11-21: crawled the nopreview-pdf and the log page nopreview.php.<br />
2007-11-23: indexed and cached the log file nopreview.php.<br />
[2007-11-23: I replaced the PDF document with a version carrying a hidden link to an HTML file, and resubmitted it via Google&#8217;s add-url page and a sitemap.]<br />
2007-11-26: The old version of the PDF is cached as a &#8220;view-as-HTML&#8221; version without links (considering the PDF was a captured print job, that&#8217;s a pretty decent result), and appears on SERPs for a quoted search. The page linked from the PDF and the new PDF document were not yet crawled.<br />
2007-12-02: PDF recrawled. Googlebot followed the hidden link in the PDF and crawled the linked page.<br />
2007-12-03: &#8220;View as HTML&#8221; preview not yet updated, the linked page not yet indexed.<br />
2007-12-04: PDF recrawled. The preview link reflects the content crawled on 12/02/2007. The page linked from the PDF is not yet indexed.<br />
2007-12-07: PDF recrawled. Linked page recrawled.<br />
2007-12-09: PDF recrawled.<br />
2007-12-10: recrawled linked page.<br />
2007-12-14: PDF recrawled. Cached copy of the linked page dated 2007-12-11.<br />
2007-12-17: I consider this case closed. <b>Neither <em>Nopreview:</em> nor <em>Noarchive:</em> (in robots.txt since 2007-12-04) are suitable to suppress the HTML preview of PDF files.</b></p>
<h3>Noindex: Nofollow:</h3>
<p><b>robots.txt:</b><code><br />
<b>Noindex: /repstuff/noindex-nofollow.php<br />
Nofollow: /repstuff/noindex-nofollow.php</b></code><br />
<b>Expected behavior:</b><br />
No crawling/indexing, invisible on SERPs.<br />
<b>Desired behavior:</b><br />
No crawling/indexing, and no URL-only listings, ODP titles/descriptions and stuff like that on the SERPs. &#8220;Noindex:&#8221; in combination with &#8220;Nofollow:&#8221; is a paraphrased &#8220;Disallow:&#8221;.<br />
<b>Google&#8217;s robots.txt validator:</b><br />
http://sebastians-pamphlets.com/repstuff/noindex-nofollow.php 	Blocked by line 35: Noindex: /repstuff/noindex-nofollow.php<br />
Line 36:  Nofollow: /repstuff/noindex-nofollow.php 	Syntax not understood<br />
<b>Status:</b><br />
<a href="http://sebastians-pamphlets.com/repstuff/noindex-nofollow.php">See test page</a><br />
<b>Google&#8217;s crawler / indexer:</b><br />
2007-11-21: crawled.<br />
2007-11-23: indexed and cached.<br />
2007-11-21: crawled a page linked only from noindex-nofollow.php.<br />
2007-11-23: indexed and cached a page linked only from noindex-nofollow.php.<br />
2007-11-26: deindexed without recrawling, the same goes for the linked page.<br />
2007-11-29: the cached copy retrieved on 11/21 reappeared.<br />
2007-12-08: appeared under &#8220;URL restricted by robots.txt&#8221; in my GWC acct.<br />
2007-12-17: <b>Case closed, see <em>Noindex:</em> above.</b></p>
<h3>Noindex: Follow:</h3>
<p><b>robots.txt:</b><code><br />
<b>Noindex: /repstuff/noindex-follow.php<br />
Follow: /repstuff/noindex-follow.php</b></code><br />
<b>Expected behavior:</b><br />
No crawling/indexing, hence unfollowed links.<br />
<b>Desired behavior:</b><br />
Crawling, following and indexing outgoing links, but no SERP listings.<br />
<b>Google&#8217;s robots.txt validator:</b><br />
http://sebastians-pamphlets.com/repstuff/noindex-follow.php 	Blocked by line 38: Noindex: /repstuff/noindex-follow.php<br />
Line 39:  Follow: /repstuff/noindex-follow.php 	Syntax not understood<br />
<b>Status:</b><br />
<a href="http://sebastians-pamphlets.com/repstuff/noindex-follow.php">See test page</a><br />
<b>Google&#8217;s crawler / indexer:</b><br />
2007-11-21: crawled.<br />
2007-11-23: indexed and cached.<br />
2007-11-21: crawled a page linked only from noindex-follow.php.<br />
2007-11-23: indexed and cached a page linked only from noindex-follow.php.<br />
2007-11-26: deindexed without recrawling, the same goes for the linked page.<br />
2007-12-08: appeared under &#8220;URL restricted by robots.txt&#8221; in my GWC acct.<br />
2007-12-17: Case closed, see <em>Noindex:</em> above. <b>Google didn&#8217;t crawl respectively deindexed despite the <em>Follow:</em> directive.</b></p>
<h3>Index: Nofollow:</h3>
<p><b>robots.txt:</b><code><br />
<b>Index: /repstuff/index-nofollow.php<br />
Nofollow: /repstuff/index-nofollow.php</b></code><br />
<b>Expected behavior:</b><br />
Crawling/indexing, following links.<br />
<b>Desired behavior:</b><br />
Crawling/indexing but ignoring outgoing links.<br />
<b>Google&#8217;s robots.txt validator:</b><br />
Line 41:  Index: /repstuff/index-nofollow.php 	Syntax not understood<br />
Line 42:  Nofollow: /repstuff/index-nofollow.php 	Syntax not understood<br />
http://sebastians-pamphlets.com/repstuff/index-nofollow.php 	Allowed<br />
<b>Status:</b><br />
<a href="http://sebastians-pamphlets.com/repstuff/index-nofollow.php">See test page</a><br />
<b>Google&#8217;s crawler / indexer:</b><br />
2007-11-21: crawled.<br />
2007-11-23: indexed and cached.<br />
2007-11-21: crawled a page linked only from from index-nofollow.php.<br />
2007-11-23: indexed and cached a page linked only from from index-nofollow.php.<br />
2007-11-26: recrawled and deindexed. The linked page was deindexed witout recrawling.<br />
2007-11-28: cached again, the timestamp on the cached copy &#8220;27 Nov 2007 01:11:26 GMT&#8221; doesn&#8217;t match the last crawl on &#8220;2007-11-26 16:47:25 EST&#8221; (EST = GMT-5).<br />
2007-12-02: recrawled, the cached copy has vanished.<br />
2007-12-07: recrawled. Linked page recrawled.<br />
2007-12-08: recrawled.<br />
2007-12-09: recrawled.<br />
2007-12-10: recrawled.<br />
2007-12-17: cached under 2007-12-10, not indexed. Linked page not cached, not indexed. I consider this case closed. <b>Google currently doesn&#8217;t support <em>Index:</em> nor <em>Nofollow:</em>.</b></p>
<p>(I didn&#8217;t test <code><b>Noodp:</b></code> and <code><b>Unavaliable_after [RFC 850 formatted timestamp]:</b></code>, although both directives would make sense in robots.txt too.)</p>
<p><b>2007-11-20:</b><br />
Added the experimental statements to robots.txt.</p>
<p><b>2007-11-21:</b><br />
Linked the test pages. Google crawled all of them, including the pages submitted via links on test pages.</p>
<p><b>2007-11-23:</b><br />
Most (all but the PDF document) URLs appear on search result pages. If an outdated robots.txt cache falsely allowed crawling although the WC-validator said &#8220;Blocked&#8221;, the search results should disappear shortly after the next crawl. I&#8217;ve created a sitemap for all URLs above and submitted it. Although I&#8217;ve &#8211;for the sake of this experiment&#8211; cloaked text as well as links and put white links on white background, luckily there is no &#8220;we caught you black hat spammer&#8221; message in my Webmaster Console. Googlebot nicely followed the cloaked links and indexed everything.</p>
<p><b>2007-11-26:</b><br />
Google recrawled a few pages (noarchive.php, index-nofollow.php and nofollow.php), then deindexed all of them. Only the PDF document is indexed, and Google created a &#8220;view-as-HTML&#8221; preview from this captured print job. It seems that Google crawled something from another host than &#8220;*.googlebot.com&#8221;, unfortunately I didn&#8217;t log all requests. Probably the deindexing was done by a sneaky bot discovering the simple cloaking. Since the linked URLs are out and 3rd party links to them can&#8217;t ruin the experiment any longer, I&#8217;ve stopped cloaking and show the same text/links to bots and users (actually, users see one more link but that should be fine with Google). There&#8217;s still no &#8220;thou shalt not cloak&#8221; message in my GWC account. Well, those pages are fairly new, perhaps not fully settled in the search index, so lets see what happens next.</p>
<p><b id="robots-txt-test-results-2007-11-28">2007-11-28</b><br />
The <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=NOPREVIEW%3A+The+missing+X-Robots-Tag+filetype%3Apdf" rel="nofollow not-indexable">PDF file</a> as well as the three pages recrawled on 11/26/2007 21:45:00 GMT were <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=%22Below+is+a+list+of+search+engine+crawlers+from+Google%2C+Yahoo%2C+MSN+and+Ask+which+requested+it.%22&#038;filter=0" rel="nofollow not-indexable">reindexed</a>, but the timestamp on the cached copies says &#8220;retrieved on 27 Nov 2007 01:15:00 GMT&#8221;. Maybe the date/time displayed on cached page copies doesn&#8217;t reflect Ms. Googlebot&#8217;s &#8220;fetched&#8221; timestamp, but the time the indexer pulled the page out of the centralized crawl results cache 3.5 hours after crawling. </p>
<p>It seems the &#8220;Noarchive:&#8221; directive doesn&#8217;t work, because noarchive.php was crawled and indexed twice providing a cached page copy. My &#8220;Nopreview:&#8221; creation isn&#8217;t supported either, but maybe Dan Crow&#8217;s team picks it up for a future update of their neat X-Robots-Tags (I hope so). </p>
<p><span id="just-in-case-rant"></span>The noindex&#8217;ed pages (noindex.php, noindex-nofollow.php and noindex-follow.php) weren&#8217;t recrawled and remain deindexed. Interestingly, they don&#8217;t appear under &#8220;URLs blocked by robots.txt&#8221; in my GWC account. Provided the first crawling and indexing on 11/21/2007 was a &#8220;mistake&#8221; caused by a way too long cached robots.txt, and the second crawl on 11/26/2007 obeyed the &#8220;Noindex:&#8221; but ignored the (implicit) &#8220;Follow:&#8221;, it seems that indeed Google interprets &#8220;Noindex:&#8221; in robots.txt as &#8220;Disallow:&#8221;. <b>If that is so and if it&#8217;s there to stay, they&#8217;re going to totally mess up the <a href="http://www.robotstxt.org/">REP</a>.</b> </p>
<p><b>&lt;rant&gt;</b> I mean, promoting a rel-nofollow microformat that &#8211;at least at launchtime&#8211; didn&#8217;t share its semantics with the REP&#8217;s meta tags nor the &#8211;later introduced&#8211; X-Robots-Tags was <strike>evil</strike> bad enough. Ok, meanwhile they&#8217;ve corrected this <a href="http://microformats.org/wiki/rel-nofollow#open_issues"><strike>conspiracy</strike> flaw</a> by altering the rel-nofollow semantics step by step until &#8220;nofollow&#8221; in the REL attribute actually means <em>nofollow</em>&nbsp; and no longer <em>pass no reputation</em>, at least at Google. Other engines still handle rel-nofollow according to the initial and officially still binding <a href="http://microformats.org/wiki/rel-nofollow">standard</a>, and a gazillion Webmasters are confused as hell. In other words only a few search geeks understand what rel-nofollow is all about, but Google jauntily penalizes the great unwashed for not complying to the incomprehensible. By the way, that&#8217;s <a href="http://sebastians-pamphlets.com/links/categories/?cat=nofollow">why I code <code>rel="nofollow crap"</code></a>. Standards should be clear and unambiguous. <b>&lt;/rant&gt;</b></p>
<p>If Google really would introduce a &#8220;Noindex:&#8221; directive in robots.txt that equals &#8220;Disallow:&#8221;, that would be totally evil. A few sites out there might have an erroneous &#8220;Noindex:&#8221; statement in their robots.txt that could mean &#8220;Disallow:&#8221;, and it&#8217;s nice that Google tries to do them a favor. Screwing the REP for the sole purpose of complying to syntax errors on the other hand makes no sense. &#8220;Noindex&#8221; means crawl it, follow its links, but don&#8217;t index it. Semantically &#8220;Noindex: Nofollow:&#8221; equals &#8220;Disallow:&#8221;, but a &#8220;Noindex:&#8221; alone implies a &#8220;Follow:&#8221;, hence crawling is not only allowed but required. </p>
<p>I really hope that we watch an experiment in its early stage, and that Google will do the right thing eventually. Allowing the REP&#8217;s page specific crawler directives in robots.txt is a fucking brilliant move, because technically challenged publishers can&#8217;t handle the HTTP header&#8217;s X-Robots-Tag, and applying those directives to groups of URIs is a great method to steer crawling and indexing not only with static sites. </p>
<p>Dear Google engineers, please consider the nopreview directive too, and implement (no)index, (no)follow, noarchive, nosnippet, noodp/noydir and unavailable_after with the REP&#8217;s meaning. And while you&#8217;re at it, I want <a href="http://sebastians-pamphlets.com/in-need-of-a-web-robot-directives-standard/">block level instructions in robots.txt</a> too. For example <code><br />
Area: /products/ DIV.hMenu,TD#bNav,SPAN.inherited "noindex,nofollow"</code><br />
could instruct crawlers to ignore duplicated properties in product descriptions and the horizontal menu as well as the navigation elements in a table cell with the DOM-ID &#8220;bNav&#8221; at the very bottom of all pages in /products/, <code><br />
Area: / A.advertising REL="nofollow"</code><br />
could condomize all links with the class name &#8220;advertising&#8221;, and so on.</p>
<p><b>2007-11-29</b><br />
The pages linked from the test pages still don&#8217;t come up in search results, noarchive.php was recrawled and remains cached, the cached copy of noindex-nofollow.php retrieved on 11/21/2007 reappeared (probably a DC roller coaster issue).</p>
<p><b>2007-11-30</b><br />
Three URLs remain indexed: nopreview.pdf, noarchive.php and noindex-nofollow.php. The cached copies show the content crawled on Nov/21/2007. Everything else is deindexed. That&#8217;s not to stay (index roller coaster).<br />
As a side note: the <a href="http://sebastians-pamphlets.com/noindex/">URL from my first noindex-robots.txt test</a> appeared in my GWC account under &#8220;URLs restricted by robots.txt (Nov/27/2007)&#8221;, three days after the unsuccessful crawl.</p>
<p><b>2007-12-02</b><br />
A few pages were recrawled, Googlebot followed the hidden link in the PDF file.</p>
<p><b>2007-12-03</b><br />
In my GWC crawl stats noindex-nofollow.php appeared under &#8220;URLs restricted by robots.txt&#8221;, but it&#8217;s still indexed. </p>
<p><b>2007-12-04</b><br />
The preview (cache) of nopreview.pdf was updated. Since obviously Nopreview: doesn&#8217;t work, I&#8217;ve added <code><b><br />
Noarchive: /repstuff/nopreview.pdf</b></code><br />
to my robots.txt. Lets see whether Google removes the cache respectively the HTML preview or not.</p>
<p><b>2007-12-06</b><br />
Shortly after the change in robots.txt (Noarchive: /repstuff/nopreview.pdf) Googlebot recrawled the PDF file on 12/04/2007. Today it&#8217;s still cached, the HTML preview is still available and linked from SERPs.</p>
<p><b>2007-12-07</b><br />
Googlebot has recrawled a few pages. Everything except noarchive.php and nopreview.pdf is deindexed.</p>
<p><b>2007-12-17</b><br />
I consider the test closed, but I&#8217;ll keep the test pages up so that you can monitor crawling and <a href="http://www.google.com/search?q=site:sebastians-pamphlets.com+inurl:%5C.php&#038;num=100&#038;hl=en&#038;safe=off&#038;filter=0">indexing</a> yourself. <b><em>Noindex:</em> is the only directive that somewhat works, but it&#8217;s implemented completely wrong and is not acceptable in its current shape.</b></p>
<p>Interestingly the sitemaps report in my GWC account says that 9 pages from 9 submitted URLs were indexed. Obviously &#8220;indexed&#8221; means something like &#8220;crawled at least once, perhaps indexed, maybe not, so if you want to know that definitively then get your lazy butt to check the SERPs yourself&#8221;. How expensive would it be to tell something like &#8220;Total URLs in sitemap: 9 | Indexed URLs in sitemap: 2&#8243;?</p>
<hr />Copyright &copy; 2010 <strong><a href="http://sebastians-pamphlets.com/">Sebastian`s Pamphlets</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator/feed reader, the site you are looking at is guilty of copyright infringement and will be put down immediately. Please contact sebastians-pamphlets.com so we can take legal action immediately.<br /><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="topsy_widget_data topsy_theme_light-green" style="float: right;margin-left: 0.75em;"><!-- { "url": "http://sebastians-pamphlets.com/validate-your-robots-txt-or-google-might-deindex-your-site/", "style": "big", "title": "Validate your robots.txt - Googlebot becomes smarter" } --></div>
]]></content:encoded>
			<wfw:commentRss>http://sebastians-pamphlets.com/validate-your-robots-txt-or-google-might-deindex-your-site/feed/</wfw:commentRss>
		</item>
		<item>
		<title>The day the routers died</title>
		<link>http://sebastians-pamphlets.com/the-day-the-routers-died/</link>
		<comments>http://sebastians-pamphlets.com/the-day-the-routers-died/#comments</comments>
		<pubDate>Mon, 29 Oct 2007 11:08:10 +0000</pubDate>
		<dc:creator>Sebastian</dc:creator>
		
		<category><![CDATA[Recommendations]]></category>

		<category><![CDATA[Fun]]></category>

		<category><![CDATA[Google]]></category>

		<guid isPermaLink="false">http://sebastians-pamphlets.com/the-day-the-routers-died/</guid>
		<description><![CDATA[
Why the fuck do we dumb and clueless Internet marketers care about Google&#8217;s Toolbar PageRank when the Internet faces real issues? Well, both the toolbar slider as well as IPv4 are somewhat finite.
I can hear the IM crowd singing &#8220;The day green pixels died&#8221; &#8230; whilst Matt&#8217;s gang in building 43 intones &#8220;No mercy, smack [...]]]></description>
			<content:encoded><![CDATA[
<p>Why the fuck do we dumb and clueless Internet marketers care about <a href="http://sebastians-pamphlets.com/google-pagerank-deductions-october-2007/">Google&#8217;s Toolbar PageRank</a> when the Internet faces real issues? Well, both the toolbar slider as well as IPv4 are somewhat finite.</p>
<p>I can hear the <acronym title="Internet Marketing">IM</acronym> crowd singing &#8220;<a href="http://seo2.0.onreact.com/the-day-pagerank-died">The day green pixels died</a>&#8221; &#8230; whilst <a href="http://mattcutts.com/blog/">Matt&#8217;s</a> <a href="http://www.brianwhite.org/">gang</a> <a href="http://www.bladam.com/">in</a> <a href="http://valleywag.com/tech/google/25-things-to-see-at-the-googleplex-before-you-die-234103.php">building 43</a> intones &#8220;<a href="http://cre8pc.com/blog/archives/372">No mercy</a>, <a href="http://www.highrankings.com/advisor/paid-link-smack/">smack paid links</a>, <a href="http://sebastians-pamphlets.com/google-pagerank-deductions-october-2007/">no place to hide</a> <a href="http://sebastians-pamphlets.com/a-pragmatic-defense-against-googles-anti-paid-links-campaign/">for TLA links</a>&#8221; &#8230; <b>Enjoy this video, it&#8217;s friggin&#8217; hilarious</b>:
</p>
<p><object width="425" height="366">
<param name="movie" value="http://www.youtube.com/v/_y36fG2Oba0&#038;rel=1&#038;border=0"></param>
<param name="wmode" value="transparent"></param><embed src="http://www.youtube.com/v/_y36fG2Oba0&#038;rel=1&#038;border=0" type="application/x-shockwave-flash" wmode="transparent" width="425" height="366"></embed></object></p>
<p>&nbsp;
</p>
<p></p>
<p>Since <a href="http://www.secret-wg.org/">Gary Feldman</a>&#8217;s song &#8220;<b>The Day The Routers Died</b>&#8221; will become an evergreen soon, I thought you might be interested in a transcript:</p>
<p>
<blockquote>
A long long time ago<br />
I can still remember<br />
When my laptop could connect elsewhere.</p>
<p>And I tell you all there was a day<br />
The network card I threw away<br />
Had a purpose and it worked for you and me.</p>
<p>But 18 years completely wasted<br />
With each address we&#8217;ve aggregated<br />
The tables overflowing<br />
The traffic just stopped flowing.</p>
<p>And now we&#8217;re bearing all the scars<br />
And all my traceroutes showing stars<br />
The packets would travel faster in cars<br />
The day the routers died.</p>
<p><b>So bye bye, folks at RIPE:55<br />
Be persuaded to upgrade it or your network will die<br />
IPv6 makes me let out a sigh<br />
But I spose we&#8217;d better give it a try<br />
I suppose we’d better give it a try!</b></p>
<p>Now did you write an RFC<br />
That dictated how we all should be<br />
Did we listen like we should that day?</p>
<p>Now were you back at RIPE fifty-four<br />
Where we heard the same things months before<br />
And the people knew they’d have to change their ways.</p>
<p>And we knew that all the ISPs<br />
Could be future proof for centuries.</p>
<p>But that was then not now<br />
Spent too much time playing WoW.</p>
<p>Ooh there was time we sat on IRC<br />
Making jokes on how this day would be<br />
Now there&#8217;s no more use for TCP<br />
The day the routers died.</p>
<p><b>So bye bye, folks at RIPE:55<br />
Be persuaded to upgrade it or your network will die<br />
IPv6 just makes me let out a sigh<br />
But I spose we’d better give it a try<br />
I suppose we’d better give it a try!</b></p>
<p>I remember those old days I mourn<br />
Sitting in my room, downloading porn<br />
Yeah that’s how it used to be.</p>
<p>When the packets flowed from A to B<br />
Via routers that could talk IP<br />
There was data [that] could be exchanged between you and me.</p>
<p>Oh but I could see you all ignore<br />
The fact we’d fill up IPv4!</p>
<p>But we all lost the nerve<br />
And we got what we deserved!</p>
<p>And while we threw our network kit away<br />
And wished we&#8217;d heard the things they say<br />
Put all our lives in disarray<br />
The day the routers died.</p>
<p><b>So bye bye, folks at RIPE:55<br />
Be persuaded to upgrade it or your network will die<br />
IPv6 just makes me let out a sigh<br />
But I spose we&#8217;d better give it a try<br />
I suppose we’d better give it a try!</b></p>
<p>Saw a man with whom I used to peer<br />
Asked him to rescue my career<br />
He just sighed and turned away.</p>
<p>I went down to the &#8216;net cafe<br />
That I used to visit everyday<br />
But the man there said I might as well just leave.</p>
<p>[And] now we&#8217;ve all lost our purpose<br />
My cisco shares completely worthless<br />
No future meetings for me<br />
At the Hotel Krasnapolsky.</p>
<p>And the men that make us push and push<br />
Like Geoff Huston and Randy Bush<br />
Should&#8217;ve listened to what they told us<br />
The day the routers died.</p>
<p><b>So bye bye, folks at RIPE:55<br />
Be persuaded to upgrade it or your network will die<br />
IPv6 just makes me let out a sigh<br />
But I spose we’d better give it a try<br />
[I suppose we&#8217;d better give it a try!]</b></p></blockquote>
<p><small>Recorded at the <a href="http://www.ripe.net/ripe/meetings/ripe-55/"><b>RIPE:55</b> meeting</a> in Amsterdam (NL) at the Krasnapolsky Hotel between 22 and 26 October 2007.<br /></small></p>
<p>Just in case the video doesn&#8217;t load, here is <a href="http://www.youtube.com/watch?v=phSpBCdWq1U">another recording</a>.</p>
<hr />Copyright &copy; 2010 <strong><a href="http://sebastians-pamphlets.com/">Sebastian`s Pamphlets</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator/feed reader, the site you are looking at is guilty of copyright infringement and will be put down immediately. Please contact sebastians-pamphlets.com so we can take legal action immediately.<br /><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="topsy_widget_data topsy_theme_light-green" style="float: right;margin-left: 0.75em;"><!-- { "url": "http://sebastians-pamphlets.com/the-day-the-routers-died/", "style": "big", "title": "The day the routers died" } --></div>
]]></content:encoded>
			<wfw:commentRss>http://sebastians-pamphlets.com/the-day-the-routers-died/feed/</wfw:commentRss>
		</item>
		<item>
		<title>How to get the perfect logo for your blog</title>
		<link>http://sebastians-pamphlets.com/how-to-get-the-perfect-logo-for-your-blog/</link>
		<comments>http://sebastians-pamphlets.com/how-to-get-the-perfect-logo-for-your-blog/#comments</comments>
		<pubDate>Wed, 19 Sep 2007 14:20:23 +0000</pubDate>
		<dc:creator>Sebastian</dc:creator>
		
		<category><![CDATA[Blogging]]></category>

		<category><![CDATA[Recommendations]]></category>

		<guid isPermaLink="false">http://sebastians-pamphlets.com/how-to-get-the-perfect-logo-for-your-blog/</guid>
		<description><![CDATA[
When I moved my blog from blogspot to this domain, using my avatar image (which I&#8217;ve dropped at tons of places) in the blog&#8217;s logo was a natural thing to do. It&#8217;s somewhat unique in my atmosphere, it helps folks to remember my first name, and branding with an image in a signal color like [...]]]></description>
			<content:encoded><![CDATA[
<p>When I moved my blog from blogspot to this domain, using my avatar image (which I&#8217;ve dropped at tons of places) in the blog&#8217;s logo was a natural thing to do. It&#8217;s somewhat unique in my atmosphere, it helps folks to remember my first name, and branding with an image in a signal color like red fitted my marketing instincts. </p>
<p><img src="http://sebastians-pamphlets.com/img/blog/sebastian-red-120x120.png" id="avatar" border="0" width="48" height="48" align="left" alt="My red crab avatar" title="My red crab avatar" style="margin-right:3px;" />Now the bad news. A few days later my 5yo daughter taught me that it was not exactly clever. She got a Disney video she wanted to watch with me, and (not so much to my surprise) my blog&#8217;s logo was one of the kingpins. Way back when I got the image from a freelance designer, I didn&#8217;t think about copyright issues, because I planned to use the image only as thumbnail respectively icon connecting my name to a rememberable picture (of the little mermaid&#8217;s crab Sebastian). The bigger version on top of all pages here however had way too much similarities with the Disney character. </p>
<p>Kinda dilemma. Reverting to a text logo was no option. Fortunately I click new home page links on all of my comments, so I remembered that not long ago a <a href="http://www.clangnuts.com/">blogging cartoonist</a> submitted a <a href="http://sebastians-pamphlets.com/free-hosted-blogger-flee-now/#comment-264">note</a> to one of my posts. I wrote an email telling him that I need a new red crab, he replied with a reasonable quote, thus I ordered the logo design. Long story short, I was impressed by his professional attitude, and now you can admire his drawing skills at my blog&#8217;s header and footer as well. </p>
<p>Before I bore you with the longish story of my red crab, think of your (blog&#8217;s) branding. <strong>Do you have a unique logo? Is it compelling and rememberable?</strong> If you put it on a page along with 100+ icons gathered from your usual hangouts, <strong>will its thumbnail stick out? Does it represent you, your business, your niche, or whatever you blog for?</strong> Do you brand yourself at all? Why not? Do it. </p>
<p>Look at a <a href="http://andybeard.eu/">few</a><img src="http://sebastians-pamphlets.com/img/blog/favicos/andybeard-favicon.ico" style="margin-bottom:-2px;" /> <a href="http://www.cornwallseo.com/">very</a><img src="http://sebastians-pamphlets.com/img/blog/favicos/cornwallseo-favicon.ico" style="margin-bottom:-2px;" /> <a href="http://www.jimboykin.com/">popular</a><img src="http://sebastians-pamphlets.com/img/blog/favicos/jimboykin-favicon.ico" style="margin-bottom:-2px;" /> <a href="http://www.seobook.com/">marketing</a><img src="http://sebastians-pamphlets.com/img/blog/favicos/seobook-favicon.ico" style="margin-bottom:-2px;" /> <a href="http://www.shoemoney.com/">blogs</a><img src="http://sebastians-pamphlets.com/img/blog/favicos/shoemoney-favicon.ico" style="margin-bottom:-2px;" /> and <a href="http://www.fantomaster.com/fantomNews/">follow</a><img src="http://sebastians-pamphlets.com/img/blog/favicos/fantomaster-favicon.ico" width="16" height="16" style="margin-bottom:-2px;" /> the <a href="http://www.wolf-howl.com/">authors</a><img src="http://sebastians-pamphlets.com/img/blog/favicos/graywolf-favicon.ico" style="margin-bottom:-2px;" /> to their <a href="http://sphinn.com/">hangouts</a>. You&#8217;ll spot that they make consistent use of their individual logos respectively avatars. That&#8217;s not coincidence, that&#8217;s <a href="http://www.cornwallseo.com/search/index.php/2007/09/01/avatar-branding-consultant/">professionalism</a>. For the records, you can become a rockstar without a logo. If you&#8217;re <a href="http://www.vanessafoxnude.com/">Vanessa Fox</a> or <a href="http://www.johnon.com/">John Andrews</a> you get away with frequently changing icons and even <acronym title="Not Safe For Work">NSFW</acronym> domain names. However, a conspicuous and witty logo makes branding easier, but <a href="http://sethgodin.typepad.com/seths_blog/2007/06/logos.html">a logo is not your brand</a>. </p>
<p>Become creative, but please don&#8217;t use a red Disney character known as Sebastian as avatar, or red crabs at all, because I&#8217;ve trademarked that. Ok, if you can imagine a cartoonized logo might fit your blog, then read on. </p>
<p><a href="http://sebastians-pamphlets.com/img/blog/red-crab-draft-1-rough.jpg"><img src="http://sebastians-pamphlets.com/img/blog/red-crab-draft-1-thumbnail.png" border="0" width="50" height="53" align="left" alt="Red crab - rough draft 1" title="Click to view the very first red crab (complete rough draft)" style="margin-right:3px;" /></a>As order confirmation Steven from <a href="http://www.clangnuts.com/">Clangnuts</a> sent me his first ideas asking whether he&#8217;s on the right track or not. Indeed he was, and I liked it. Actually, I liked it a lot. </p>
<p><a href="http://sebastians-pamphlets.com/img/blog/red-crab-draft-2-rough.jpg"><img src="http://sebastians-pamphlets.com/img/blog/red-crab-draft-2-thumbnail.png" border="0" width="110" height="69" align="left" alt="Red crab - rough draft 2" title="Click to view the next red crab (2nd draft)" style="margin-right:3px;" /></a>Shortly after my reply I got the next previews. Steven had nicely worked in my few wishes. He even showed me with an edited screen shot how the new red crab would look in my blog&#8217;s header in variations (manually colored vs. photoshop colored). It looked great. </p>
<p>Finally he sent me four poses to choose from. Bummer. I liked all of them:<br />
<img src="http://sebastians-pamphlets.com/img/blog/red-crab-4-versions.png" border="0" width="498" height="373" align="center" alt="My red crab - 4 versions" title="Do you agree that the first crab is the best choice?" /><br />
I picked the first, and got the colored version today. Thanks Steven, you did a great job! I hereby declare that when you need an outstanding logo for your blog you better contact <a href="mailto:clangnuts@gmail.com?subject=Sebastians_recommendation">Steven at Clangnuts dot com</a> before you fire up photoshop yourself.</p>
<p><b>What do you think, is #1 the best choice?</b> Feel free to vote in the comments! </p>
<hr />Copyright &copy; 2010 <strong><a href="http://sebastians-pamphlets.com/">Sebastian`s Pamphlets</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator/feed reader, the site you are looking at is guilty of copyright infringement and will be put down immediately. Please contact sebastians-pamphlets.com so we can take legal action immediately.<br /><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="topsy_widget_data topsy_theme_light-green" style="float: right;margin-left: 0.75em;"><!-- { "url": "http://sebastians-pamphlets.com/how-to-get-the-perfect-logo-for-your-blog/", "style": "big", "title": "How to get the perfect logo for your blog" } --></div>
]]></content:encoded>
			<wfw:commentRss>http://sebastians-pamphlets.com/how-to-get-the-perfect-logo-for-your-blog/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Share Your Sphinn Love!</title>
		<link>http://sebastians-pamphlets.com/share-your-sphinn-love/</link>
		<comments>http://sebastians-pamphlets.com/share-your-sphinn-love/#comments</comments>
		<pubDate>Wed, 19 Sep 2007 11:26:40 +0000</pubDate>
		<dc:creator>Sebastian</dc:creator>
		
		<category><![CDATA[Blogging]]></category>

		<category><![CDATA[Folks]]></category>

		<category><![CDATA[Ego Food]]></category>

		<category><![CDATA[Recommendations]]></category>

		<category><![CDATA[Social Web]]></category>

		<guid isPermaLink="false">http://sebastians-pamphlets.com/share-your-sphinn-love/</guid>
		<description><![CDATA[
Donna started a meme with a very much appreciated compliment - Thanks Donna! Like me she discovered a lot of &#8220;new&#8221; folks at Sphinn and enjoyed their interesting blogs. 
Savoring Sphinn comes with a duty Donna thinks, so she appeals to share the love. She&#8217;s right. All of us benefit from Sphinn love, it&#8217;s only [...]]]></description>
			<content:encoded><![CDATA[
<p><img src="http://sebastians-pamphlets.com/img/posts/sphinn-rockstars.png" border="0" width="200" height="235" alt="Sphinn Rockstars" align="right" style="margin-left:3px;" title="Rockstars at Sphinn" /><a href="http://www.seo-scoop.com/">Donna</a> started a meme with a very much appreciated <a href="http://www.seo-scoop.com/2007/09/18/3-new-feed-worthy-seo-blogs/">compliment</a> - Thanks Donna! Like me she discovered a lot of &#8220;new&#8221; folks at <a href="http://sphinn.com/">Sphinn</a> and enjoyed their interesting blogs. </p>
<p>Savoring Sphinn comes with a duty Donna thinks, so she appeals to share the love. She&#8217;s right. All of us benefit from Sphinn love, it&#8217;s only fair to spread it a little. However, picking only three people I&#8217;d never have come accross without <a href="http://daggle.com/">Danny</a>&#8217;s newest donation to the Internet marketing community is a tough task. Hence I wrote a long numbered list and diced. Alea iacta est. Here are three of the many nice people I met at Sphinn:</p>
<table align="center" width="99%" style="margin-bottom:20px;">
<tr>
<th align="center" valign="top" width="30%"><a href="http://sphinn.com/user/view/profile/HamletBatista">Hamlet Batista</a></th>
<th align="center" valign="top" width="30%"><a href="http://sphinn.com/user/view/profile/onreact">Tadeusz Szewczyk</a></th>
<th align="center" valign="top" width="30%"><a href="http://sphinn.com/user/view/profile/login/Tinu">Tinu Abayomi-Paul</a></th>
</tr>
<tr>
<td align="center" valign="middle"><img src="http://sebastians-pamphlets.com/img/blog/favicos/hamlet-batista.jpg" border="0" width="48" height="48" alt="Hamlet Batista" style="margin:3px;"  /></td>
<td align="center" valign="middle"><img src="http://sebastians-pamphlets.com/img/blog/favicos/tadeusz-szewczyk.jpg" border="0" width="50" height="38" alt="Tadeusz Szewczyk" style="margin:3px;"  /></td>
<td align="center" valign="middle"><img src="http://sebastians-pamphlets.com/img/blog/favicos/tinu-abayomi-paul.jpg" border="0" width="43" height="50" alt="Tinu Abayomi-Paul" style="margin:3px;"  /></td>
</tr>
<tr>
<td align="center" valign="top" style="text-align:center;"><a href="http://hamletbatista.com/">Blog</a></td>
<td align="center" valign="top" style="text-align:center;"><a href="http://seo2.0.onreact.com/">Blog</a></td>
<td align="center" valign="top" style="text-align:center;"><a href="http://www.freetraffictip.com/">Blog</a></td>
</tr>
<tr>
<td align="center" valign="top" style="text-align:center;"><a href="http://hamletbatista.com/feed/">Feed</a></td>
<td align="center" valign="top" style="text-align:center;"><a href="http://seo2.0.onreact.com/feed">Feed</a></td>
<td align="center" valign="top" style="text-align:center;"><a href="http://feeds.feedburner.com/FreeTrafficTips">Feed</a></td>
</tr>
<tr>
<td align="center" valign="top" style="text-align:center;"><a href="http://hamletbatista.com/2007/09/15/success-is-about-not-giving-up/">A post I like</a></td>
<td align="center" valign="top" style="text-align:center;"><a href="http://seo2.0.onreact.com/57-reasons-why-you-shouldnt-read-my-seo-blog">A post I like</a></td>
<td align="center" valign="top" style="text-align:center;"><a href="http://www.freetraffictip.com/google-tuesdays-all-the-google-blogs-that-fit-with-feeds-and-discussion-groups.php">A post I like</a></td>
</tr>
</table>
<p><span style="margin-top:10px;"></span>To those who didn&#8217;t make it on this list: That&#8217;s just kismet, not bad karma! I bet you&#8217;ll appear in someone&#8217;s <b>share the sphinn love</b> post in no time.</p>
<p><a href="http://hamletbatista.com/">To</a> <a href="http://seo2.0.onreact.com/">you</a> <a href="http://www.freetraffictip.com/">three</a>: Get out your sphinn love post and choose three sphinners writing a <acronym title="Means: 'must have feed subsctiptions' according to Donna">feed-worthy</acronym> blog, preferably people not yet featured elsewhere. I&#8217;ve subscribed to a couple feeds of blogs discovered at Sphinn, and so did you. There&#8217;s so much great stuff at Sphinn that you&#8217;re spoilt for choice. </p>
<hr />Copyright &copy; 2010 <strong><a href="http://sebastians-pamphlets.com/">Sebastian`s Pamphlets</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator/feed reader, the site you are looking at is guilty of copyright infringement and will be put down immediately. Please contact sebastians-pamphlets.com so we can take legal action immediately.<br /><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="topsy_widget_data topsy_theme_light-green" style="float: right;margin-left: 0.75em;"><!-- { "url": "http://sebastians-pamphlets.com/share-your-sphinn-love/", "style": "big", "title": "Share Your Sphinn Love!" } --></div>
]]></content:encoded>
			<wfw:commentRss>http://sebastians-pamphlets.com/share-your-sphinn-love/feed/</wfw:commentRss>
		</item>
		<item>
		<title>How to bait link baiters and attention whores properly</title>
		<link>http://sebastians-pamphlets.com/how-to-bait-link-baiters-and-attention-whores-properly/</link>
		<comments>http://sebastians-pamphlets.com/how-to-bait-link-baiters-and-attention-whores-properly/#comments</comments>
		<pubDate>Thu, 09 Aug 2007 08:14:00 +0000</pubDate>
		<dc:creator>Sebastian</dc:creator>
		
		<category><![CDATA[Internet Marketing]]></category>

		<category><![CDATA[Recommendations]]></category>

		<category><![CDATA[Link Building]]></category>

		<category><![CDATA[Social Web]]></category>

		<guid isPermaLink="false">http://sebastians-pamphlets.com/how-to-bait-link-baiters-and-attention-whores-properly/</guid>
		<description><![CDATA[
What a brilliant marketing stunt. Click here! Err&#8230; click: Brilliant. Marketing. Stunt.
Best of luck John 
Copyright &#169; 2010 Sebastian`s Pamphlets. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator/feed reader, the site you are looking at is guilty of copyright infringement and will be put [...]]]></description>
			<content:encoded><![CDATA[
<p>What a <a href="http://www.johnon.com/381/sphinn-this-page.html" style="text-decoration:none; font-weight:normal;">brilliant marketing stunt</a>. <u title="NOT!" style="color:blue; font-weight:bolder;">Click here</u>! Err&#8230; click: <a href="http://sphinn.com/story/2432" title="SPHINN THIS">Brilliant</a>. <a href="http://www.johnon.com/" title="John Andrews">Marketing</a>. <a href="http://sphinn.com/greatesthits/" title="Sphinn's hall of fame">Stunt</a>.</p>
<p>Best of luck John <img src='http://sebastians-pamphlets.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /></p>
<hr />Copyright &copy; 2010 <strong><a href="http://sebastians-pamphlets.com/">Sebastian`s Pamphlets</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator/feed reader, the site you are looking at is guilty of copyright infringement and will be put down immediately. Please contact sebastians-pamphlets.com so we can take legal action immediately.<br /><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="topsy_widget_data topsy_theme_light-green" style="float: right;margin-left: 0.75em;"><!-- { "url": "http://sebastians-pamphlets.com/how-to-bait-link-baiters-and-attention-whores-properly/", "style": "big", "title": "How to bait link baiters and attention whores properly" } --></div>
]]></content:encoded>
			<wfw:commentRss>http://sebastians-pamphlets.com/how-to-bait-link-baiters-and-attention-whores-properly/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Rediscover Google&#8217;s free ranking checker!</title>
		<link>http://sebastians-pamphlets.com/rediscover-googles-free-ranking-checker/</link>
		<comments>http://sebastians-pamphlets.com/rediscover-googles-free-ranking-checker/#comments</comments>
		<pubDate>Fri, 27 Jul 2007 21:29:00 +0000</pubDate>
		<dc:creator>Sebastian</dc:creator>
		
		<category><![CDATA[Recommendations]]></category>

		<category><![CDATA[Tools]]></category>

		<category><![CDATA[SEO]]></category>

		<category><![CDATA[Google]]></category>

		<guid isPermaLink="false">http://sebastians-pamphlets.com/rediscover-googles-free-ranking-checker/</guid>
		<description><![CDATA[
Nowadays we&#8217;re searching via toolbar, personalized homepage, or in the browser address bar by typing in &#8220;google&#8221; to get the search box, typing in a search query using &#8220;I feel lucky&#8221; functionality, or -my favorite- typing in google.com/search?q=free+pizza+service+nearby. 
Old fashioned, uncluttered and nevertheless sexy user interfaces are forgotten, and pretty much disliked due to the [...]]]></description>
			<content:encoded><![CDATA[
<p>Nowadays we&#8217;re searching via toolbar, personalized homepage, or in the browser address bar by typing in &#8220;google&#8221; to get the search box, typing in a search query using &#8220;I feel lucky&#8221; functionality, or -my favorite- typing in google.com/search?q=free+pizza+service+nearby. </p>
<p>Old fashioned, uncluttered and nevertheless sexy user interfaces are forgotten, and pretty much disliked due to the lack of nifty rounded corners. Luckily Google still maintains them. Look at this beautiful SERP:<br /><a href="http://www.smart-it-consulting.com/img/misc/google-free-ranking-checker.png"><img  src="http://www.smart-it-consulting.com/img/misc/google-free-ranking-checker.png" width="99%" border="0" alt="Google's free ranking checker" title="Google's free ranking checker" /></a><br />It&#8217;s free of personalized search, wonderful uncluttered because the snippets appear as tooltip only, results are nicely numbered from 1 to 1,000 on just 10 awesome fast loading pages, and when I&#8217;ve visited my URLs before I spot my purple rankings quickly.</p>
<p><a href="http://www.google.com/ie?num=100&#038;hl=en&#038;safe=off&#038;q=minimalist"><b>http://google.com/ie?num=100&#038;q=keyword1+keyword2</b></a> is an ideal free ranking checker. It supports <b>&#038;filter=0</b> and other URL parameters, so it&#8217;s a perfect tool when I need to lookup particular search terms.</p>
<p>Mass ranking checks are totally and utterly useless, at least for the average site, and <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66357">penalized</a> by Google. Well, I can think of ways to semi-automate a couple queries, but honestly, I almost never need that. Providing fully automated ranking reports to clients gave SEO services a more or less well deserved snake oil reputation, because nice rankings for preselected keywords may be great ego food, but they don&#8217;t pay the bills. I admit that with some setups automated mass ranking checks make sense, but those are off-topic here.</p>
<p>By the way, <a href="http://sebastians-pamphlets.com/google-assists-serp-click-through-optimization/">Google&#8217;s query stats</a> are a pretty useful resource too.</p>
<hr />Copyright &copy; 2010 <strong><a href="http://sebastians-pamphlets.com/">Sebastian`s Pamphlets</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator/feed reader, the site you are looking at is guilty of copyright infringement and will be put down immediately. Please contact sebastians-pamphlets.com so we can take legal action immediately.<br /><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="topsy_widget_data topsy_theme_light-green" style="float: right;margin-left: 0.75em;"><!-- { "url": "http://sebastians-pamphlets.com/rediscover-googles-free-ranking-checker/", "style": "big", "title": "Rediscover Google's free ranking checker!" } --></div>
]]></content:encoded>
			<wfw:commentRss>http://sebastians-pamphlets.com/rediscover-googles-free-ranking-checker/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Hey, there is content in the widgets!</title>
		<link>http://sebastians-pamphlets.com/hey-there-is-content-in-the-widgets/</link>
		<comments>http://sebastians-pamphlets.com/hey-there-is-content-in-the-widgets/#comments</comments>
		<pubDate>Wed, 25 Jul 2007 01:38:00 +0000</pubDate>
		<dc:creator>Sebastian</dc:creator>
		
		<category><![CDATA[Recommendations]]></category>

		<guid isPermaLink="false">http://sebastians-pamphlets.com/hey-there-is-content-in-the-widgets/</guid>
		<description><![CDATA[
Yeah, I do know the layout of this blog is somewhat cluttered. Especially the sidebar with all the JS script calls slowing down page loads. Not that Blogger page load times are exiting at all, especially not with the classic template. Forgive me, I just can&#8217;t stay away from fancy stuff. 
Perhaps you&#8217;re not exactly [...]]]></description>
			<content:encoded><![CDATA[
<p>Yeah, I do know the layout of this blog is somewhat cluttered. Especially the sidebar with all the JS script calls slowing down page loads. Not that Blogger page load times are exiting at all, especially not with the classic template. Forgive me, I just can&#8217;t stay away from fancy stuff. </p>
<p>Perhaps you&#8217;re not exactly interested in my <a href="http://twitter.com/SebastianX">twits</a> telling you that my monsters are asleep and I can code untroubled, or that I&#8217;ve dugg or sphunn my friends&#8217; posts. Perfectly legit votings of course, since we share that many interests so that I often like what my buddies write and submit to whatever social bookmarking services or communities. </p>
<p>Of course you couldn&#8217;t care less on stats like how many blogs in the Technorati universe (which is a tiny subset of the GoogleBlogSearch universe, which is a tiny subset of the blogosphere, which is a tiny subset of the Web &#8230; Ok, you don&#8217;t give a f***) link to my pamplets. Actually, here you could help me out, just put me on your blogroll. Honestly, the lack of backlinks is scandalous. Everybody reads my stuff but very few of you dear readers link to me. I don&#8217;t consider scrapers readers, so their links don&#8217;t count. Since my audience consists of 99% Webmasters, I hope all of you understand the syntax of my beloved <a href="http://www.smart-it-consulting.com/article.htm?node=155&#038;page=90">A element</a>. I promote lots of nice folks in my diverse blogroll sections, but very few return the honor. Not even the <a href="http://googleblog.blogspot.com/">Google blog</a> lists me under &#8220;What We&#8217;re Reading&#8221; (please notice the capital &#8220;W&#8221; indicating a pluralis majestatis), although I spam FeedFetcher with Google bashing quite frequently. Weird &#8230;</p>
<p>And no, the MBL users list doesn&#8217;t count as content (but it&#8217;s nice to see who visited), and the AdSense stuff is just informational (and remains unclicked by the way, you guys and gals are way too savvy). Oups, I did it again: four inexpressively paragraphs before I come to the point - vice. </p>
<p>Since I add widgets when I discover them, you&#8217;ve to scroll down for the <a href="http://www.google.com/reader/">GoogleReader</a> <a href="http://www.google.com/reader/shared/17784309474383380743">thingy</a>. It&#8217;s titled &#8220;<b>Sebastian&#8217;s picked gems</b>&#8220;, and I mean that. </p>
<p>When I stumble upon a great post, I share it. That does not mean that I agree 100%, perhaps I even disagree 100%, but when I share a post I believe it&#8217;s worth reading. Honestly, you wouldn&#8217;t read my pamplets if you wouldn&#8217;t share (a few of) my pet peeves, would you? </p>
<p>I guess it&#8217;s safe to assume that you&#8217;ll enjoy reading my shared articles. Good news is, you can subscribe to the <a href="http://www.google.com/reader/public/atom/user/17784309474383380743/state/com.google/broadcast">feed of my selected readings</a>. I don&#8217;t recycle news, so I don&#8217;t blog every tidbit I find on the &#8216;Net. Hence you should subscribe to the feed and read the content I&#8217;d like to have on my blog although I&#8217;m too busy (Ok Ok, that&#8217;s just a lame excuse for laziness) to publish it myself. </p>
<p>If you read my blog in your preferred feed reader, you&#8217;ll miss out on some exciting stuff!</p>
<hr />Copyright &copy; 2010 <strong><a href="http://sebastians-pamphlets.com/">Sebastian`s Pamphlets</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator/feed reader, the site you are looking at is guilty of copyright infringement and will be put down immediately. Please contact sebastians-pamphlets.com so we can take legal action immediately.<br /><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="topsy_widget_data topsy_theme_light-green" style="float: right;margin-left: 0.75em;"><!-- { "url": "http://sebastians-pamphlets.com/hey-there-is-content-in-the-widgets/", "style": "big", "title": "Hey, there is content in the widgets!" } --></div>
]]></content:encoded>
			<wfw:commentRss>http://sebastians-pamphlets.com/hey-there-is-content-in-the-widgets/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
