<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.2.3" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>
<channel>
	<title>Comments on: Q&#038;A: An undocumented robots.txt crawler directive from Google</title>
	<link>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/</link>
	<description>If you've read my articles somewhere on the Internet, expect something different here.</description>
	<pubDate>Thu, 24 Jul 2008 16:24:31 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.2.3</generator>

	<item>
		<title>By: XML Sitemaps and Competitive Intelligence - Pocket SEO</title>
		<link>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-887</link>
		<dc:creator>XML Sitemaps and Competitive Intelligence - Pocket SEO</dc:creator>
		<pubDate>Wed, 26 Dec 2007 09:34:55 +0000</pubDate>
		<guid>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-887</guid>
		<description>[...] engines can&#8217;t index your sitemap files you might be able to block them with robots.txt (e.g., noindex directive), and/or send an x-robots-tag HTTP header telling Google and Yahoo not to index them.   This entry [...]</description>
		<content:encoded><![CDATA[<script type='text/javascript' src='http://www.sezwho.com/widgets/profile/js_output/wp/abeautifulday/1.3/1.3/8bd533845c1fc43d8202c6362e715395/47a227eeb615e'></script><script type="text/javascript">var sz_global_config_params = {cppluginurl:"http://sebastians-pamphlets.com/wp-content/plugins/sezwho",cpserverurl:"http://www.sezwho.com", sitekey:"8bd533845c1fc43d8202c6362e715395",blogkey:"47a227eeb615e",blogid:"0", plugin_version:"1.3"} ; </script><p>[&#8230;] engines can&#8217;t index your sitemap files you might be able to block them with robots.txt (e.g., noindex directive), and/or send an x-robots-tag HTTP header telling Google and Yahoo not to index them.   This entry [&#8230;]<script type="text/javascript" id="szCommentHiddenTag:887">var sz_comment_config_params = {use_cross_domain_posting:1,post_id:"215", comment_rating_submit_path:"/cpratingsubmit.php",sortOrder:"",sz_auto_comment:0,sz_auto_option_bar:0,comment_number:20, sz_comment_data:[]};sz_comment_config_params.sz_comment_data[0]= {comment_id:"887", comment_author:"XML%20Sitemaps%20and%20Competitive%20Intelligence%20-%20Pocket%20SEO", comment_author_url:"http://pocketseo.com/site-analysis/204", comment_author_email:"",sz_score:"0",comment_score:"0"};</script></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sebastian</title>
		<link>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-743</link>
		<dc:creator>Sebastian</dc:creator>
		<pubDate>Mon, 26 Nov 2007 15:46:04 +0000</pubDate>
		<guid>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-743</guid>
		<description>Jordan, thanks and give &lt;a href="http://sebastians-pamphlets.com/cloak-the-hell-out-of-your-robots-txt/"&gt;this&lt;/a&gt; a try.</description>
		<content:encoded><![CDATA[<p>Jordan, thanks and give <a href="http://sebastians-pamphlets.com/cloak-the-hell-out-of-your-robots-txt/">this</a> a try.<script type="text/javascript" id="szCommentHiddenTag:743">sz_comment_config_params.sz_comment_data[1]= {comment_id:"743", comment_author:"Sebastian", comment_author_url:"http://sebastians-pamphlets.com/about/", comment_author_email:"GX%2FM%2FRMuCXKp3Lga0Efp6euKyPuFADogwnj7IIDCOc0QVXurbSYXOqS%2FFjGx%2BQr2Y5HaNjs8D9NPHItf8b%2BFHXhUyjH%2B2WHBvp4YTH8lXHsLU%2FAo0iQXMKWR%2FpWTRNwhScKR3Hcx2UTNb0NbMzuH41bqmlQgJQTJnlJ2Gb6LWG4%3D",sz_score:"7.1",comment_score:"5.0"};</script></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Advantages of a smart robots.txt file</title>
		<link>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-741</link>
		<dc:creator>Advantages of a smart robots.txt file</dc:creator>
		<pubDate>Mon, 26 Nov 2007 15:16:29 +0000</pubDate>
		<guid>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-741</guid>
		<description>[...] loyal reader of my pamphlets asked me: I foresee many new capabilities with robots.txt in the future due to this [Google&#8217;s [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] loyal reader of my pamphlets asked me: I foresee many new capabilities with robots.txt in the future due to this [Google&#8217;s [&#8230;]<script type="text/javascript" id="szCommentHiddenTag:741">sz_comment_config_params.sz_comment_data[2]= {comment_id:"741", comment_author:"Advantages%20of%20a%20smart%20robots.txt%20file", comment_author_url:"http://sebastians-pamphlets.com/cloak-the-hell-out-of-your-robots-txt/", comment_author_email:"",sz_score:"0",comment_score:"0"};</script></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Utah SEO Pro</title>
		<link>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-737</link>
		<dc:creator>Utah SEO Pro</dc:creator>
		<pubDate>Sun, 25 Nov 2007 22:01:12 +0000</pubDate>
		<guid>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-737</guid>
		<description>This discovery is quite the milestone. I foresee many new capabilities with robots.txt in the future due to this. However, how the hell can a webmaster hide their robots.txt from the public while serving it up to bots without doing anything shady? Do you have any example .htaccess code? Cuz I know you're the pimp master at that kinda shit.

Way to lay it thick on Igor. I love how you're so forward. haha</description>
		<content:encoded><![CDATA[<p>This discovery is quite the milestone. I foresee many new capabilities with robots.txt in the future due to this. However, how the hell can a webmaster hide their robots.txt from the public while serving it up to bots without doing anything shady? Do you have any example .htaccess code? Cuz I know you&#8217;re the pimp master at that kinda shit.</p>
<p>Way to lay it thick on Igor. I love how you&#8217;re so forward. haha<script type="text/javascript" id="szCommentHiddenTag:737">sz_comment_config_params.sz_comment_data[3]= {comment_id:"737", comment_author:"Utah%20SEO%20Pro", comment_author_url:"http://www.jordankasteler.com/utah-seo-pro-blog/", comment_author_email:"blzD9QeFcPZ4q7C6BHrx9rOwn4tqNvEKuBEQIE0hUM600V7798IrfRPDNhS%2FmGLiWTaJ22sQVmJ6b1t7FOxvQ58LQZp9fX78Q2JRgVYTf%2F%2FlMFYV8VIDE39oKCXrTGFRTNZqhDQepOJMw2UhW49SwUqMYDC4Kzw11DTmwL9IIIE%3D",sz_score:"6.0",comment_score:"6.0"};</script></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sebastian</title>
		<link>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-718</link>
		<dc:creator>Sebastian</dc:creator>
		<pubDate>Wed, 21 Nov 2007 10:00:09 +0000</pubDate>
		<guid>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-718</guid>
		<description>Probably a trust issue, no SEO-voodoo involved. It seems mine is one of very few sites still allowing you to leave your nick. I don't try to outrank your absurd home page, why should I?

You've created your nick yourself, and it's suitable. At Google I just told you that trolling is not appreciated. 

Igor, you were warned that insane and childish off-topic comments will get you banned eventually. I've deleted a few of them and if you don't behave you're history. That's my last warning. Behave yourself or get the fuck outta here.</description>
		<content:encoded><![CDATA[<p>Probably a trust issue, no SEO-voodoo involved. It seems mine is one of very few sites still allowing you to leave your nick. I don&#8217;t try to outrank your absurd home page, why should I?</p>
<p>You&#8217;ve created your nick yourself, and it&#8217;s suitable. At Google I just told you that trolling is not appreciated. </p>
<p>Igor, you were warned that insane and childish off-topic comments will get you banned eventually. I&#8217;ve deleted a few of them and if you don&#8217;t behave you&#8217;re history. That&#8217;s my last warning. Behave yourself or get the fuck outta here.<script type="text/javascript" id="szCommentHiddenTag:718">sz_comment_config_params.sz_comment_data[4]= {comment_id:"718", comment_author:"Sebastian", comment_author_url:"http://sebastians-pamphlets.com/about/", comment_author_email:"GX%2FM%2FRMuCXKp3Lga0Efp6euKyPuFADogwnj7IIDCOc0QVXurbSYXOqS%2FFjGx%2BQr2Y5HaNjs8D9NPHItf8b%2BFHXhUyjH%2B2WHBvp4YTH8lXHsLU%2FAo0iQXMKWR%2FpWTRNwhScKR3Hcx2UTNb0NbMzuH41bqmlQgJQTJnlJ2Gb6LWG4%3D",sz_score:"7.1",comment_score:"5.0"};</script></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Igor The Troll</title>
		<link>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-716</link>
		<dc:creator>Igor The Troll</dc:creator>
		<pubDate>Wed, 21 Nov 2007 09:20:49 +0000</pubDate>
		<guid>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-716</guid>
		<description>Sebasian, why do you rank number 1 for Igor The Troll in Google?

Is Sebstain Igor The Troll?
http://www.google.com/search?hl=en&#38;q=Igor+The+Troll

I know you created me in GWHG, but now you are taking over my Nick...are you some Alien being that sucks the souls out of people and their Websites?

NP, Everyone knows that Sabestian is the father of Igor The Troll

Or, are you gaming Google with Black Hat Seo?</description>
		<content:encoded><![CDATA[<p>Sebasian, why do you rank number 1 for Igor The Troll in Google?</p>
<p>Is Sebstain Igor The Troll?<br />
<a href="http://www.google.com/search?hl=en&amp;q=Igor+The+Troll">http://www.google.com/search?hl=en&amp;q=Igor+The+Troll</a></p>
<p>I know you created me in GWHG, but now you are taking over my Nick&#8230;are you some Alien being that sucks the souls out of people and their Websites?</p>
<p>NP, Everyone knows that Sabestian is the father of Igor The Troll</p>
<p>Or, are you gaming Google with Black Hat Seo?<script type="text/javascript" id="szCommentHiddenTag:716">sz_comment_config_params.sz_comment_data[5]= {comment_id:"716", comment_author:"Igor%20The%20Troll", comment_author_url:"http://www.igorthetroll.com/dontfollow", comment_author_email:"OO1TNf9xmvcAoAP54GzdmdcuVEgM%2FHn8P079X6GuEj1lj0Dz3QrAWm1IZa7yHygio8J1pPcBhv3CEJOf9aARRmvaQLmX3fy2IU2VW4yAq%2Fsbuj%2Bec92p4OSAw6s21NwPRunJ4nuAoI52FgKDKc%2Ft86ICTU5klzYKrigN2OIGiEY%3D",sz_score:"6.0",comment_score:"6.0"};</script></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Validate your robots.txt - Googlebot becomes smarter</title>
		<link>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-714</link>
		<dc:creator>Validate your robots.txt - Googlebot becomes smarter</dc:creator>
		<pubDate>Tue, 20 Nov 2007 18:58:27 +0000</pubDate>
		<guid>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-714</guid>
		<description>[...] week I reported that Google experiments with new crawler directives for use in robots.txt. Today Google has confirmed that Googlebot understands experimental REP syntax like [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] week I reported that Google experiments with new crawler directives for use in robots.txt. Today Google has confirmed that Googlebot understands experimental REP syntax like [&#8230;]<script type="text/javascript" id="szCommentHiddenTag:714">sz_comment_config_params.sz_comment_data[6]= {comment_id:"714", comment_author:"Validate%20your%20robots.txt%20-%20Googlebot%20becomes%20smarter", comment_author_url:"http://sebastians-pamphlets.com/validate-your-robots-txt-or-google-might-deindex-your-site/", comment_author_email:"",sz_score:"0",comment_score:"0"};</script></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JohnMu</title>
		<link>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-712</link>
		<dc:creator>JohnMu</dc:creator>
		<pubDate>Tue, 20 Nov 2007 12:12:28 +0000</pubDate>
		<guid>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-712</guid>
		<description>Wildcards should be ok, as they are for the normal disallow: and allow: directives. I just want to remind everyone again that this is something that may still change over time. Be careful when playing with things like this :) .</description>
		<content:encoded><![CDATA[<p>Wildcards should be ok, as they are for the normal disallow: and allow: directives. I just want to remind everyone again that this is something that may still change over time. Be careful when playing with things like this <img src='http://sebastians-pamphlets.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> .<script type="text/javascript" id="szCommentHiddenTag:712">sz_comment_config_params.sz_comment_data[7]= {comment_id:"712", comment_author:"JohnMu", comment_author_url:"http://www.google.com/", comment_author_email:"c8rdBbcnLYU11j10vtGM0Hto8b8MtUJUHZWmv1dKT2rAl3BUNaqXB3eFGRiM5udOayZarc3vcb203oXnCc5y%2FvP8KoSdcfJnpdMP5bW3z0CsWheWUbuavZC5eAiEm1qJ%2B4AAoe8EiXc2IGNIdIfcwxfJzM%2FDgYrBIHvjfIYo4LM%3D",sz_score:"5.0",comment_score:"5.0"};</script></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sebastian</title>
		<link>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-711</link>
		<dc:creator>Sebastian</dc:creator>
		<pubDate>Tue, 20 Nov 2007 11:59:22 +0000</pubDate>
		<guid>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-711</guid>
		<description>Thanks for your confirmation John!  Googlebot has fetched the &lt;a href="http://sebastians-pamphlets.com/noindex/"&gt;crawler trap&lt;/a&gt; on 2007-11-14 07:23:03 and it's not &lt;a href="http://www.google.com/search?q=noindex+crawler+trap+site:sebastians-pamphlets.com&#038;num=100&#038;hl=en&#038;safe=off&#038;filter=0"&gt;indexed&lt;/a&gt;. Maybe the page makes it in the index if you at the time of crawling and indexing had a cached robots.txt without the Noindex: statement, but I doubt it. I'll watch that for a while and then launch another experiment w/o telling the URI. ;)

Do you support wildcards with "Noindex:" too, for example 
Noindex: /*.txt$ 
Noarchive: /cloaked/*.html$
Nofollow: /links/*.php$
Nopreview: /scholar/*.pdf$
or so? That would be exciting! Please LMK :)</description>
		<content:encoded><![CDATA[<p>Thanks for your confirmation John!  Googlebot has fetched the <a href="http://sebastians-pamphlets.com/noindex/">crawler trap</a> on 2007-11-14 07:23:03 and it&#8217;s not <a href="http://www.google.com/search?q=noindex+crawler+trap+site:sebastians-pamphlets.com&#038;num=100&#038;hl=en&#038;safe=off&#038;filter=0">indexed</a>. Maybe the page makes it in the index if you at the time of crawling and indexing had a cached robots.txt without the Noindex: statement, but I doubt it. I&#8217;ll watch that for a while and then launch another experiment w/o telling the URI. <img src='http://sebastians-pamphlets.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /><br />
Do you support wildcards with &#8220;Noindex:&#8221; too, for example<br />
Noindex: /*.txt$<br />
Noarchive: /cloaked/*.html$<br />
Nofollow: /links/*.php$<br />
Nopreview: /scholar/*.pdf$<br />
or so? That would be exciting! Please LMK <img src='http://sebastians-pamphlets.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> <script type="text/javascript" id="szCommentHiddenTag:711">sz_comment_config_params.sz_comment_data[8]= {comment_id:"711", comment_author:"Sebastian", comment_author_url:"http://sebastians-pamphlets.com/about/", comment_author_email:"GX%2FM%2FRMuCXKp3Lga0Efp6euKyPuFADogwnj7IIDCOc0QVXurbSYXOqS%2FFjGx%2BQr2Y5HaNjs8D9NPHItf8b%2BFHXhUyjH%2B2WHBvp4YTH8lXHsLU%2FAo0iQXMKWR%2FpWTRNwhScKR3Hcx2UTNb0NbMzuH41bqmlQgJQTJnlJ2Gb6LWG4%3D",sz_score:"7.1",comment_score:"5.0"};</script></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JohnMu</title>
		<link>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-710</link>
		<dc:creator>JohnMu</dc:creator>
		<pubDate>Tue, 20 Nov 2007 11:23:51 +0000</pubDate>
		<guid>http://sebastians-pamphlets.com/about-noindex-crawler-directives-in-robots-txt/#comment-710</guid>
		<description>Good catch, Sebastian. How is your experiment going? At the moment we will usually accept the "noindex" directive in the robots.txt, but we are not yet at a point where we are willing to set it into stone and announce full support.</description>
		<content:encoded><![CDATA[<p>Good catch, Sebastian. How is your experiment going? At the moment we will usually accept the &#8220;noindex&#8221; directive in the robots.txt, but we are not yet at a point where we are willing to set it into stone and announce full support.<script type="text/javascript" id="szCommentHiddenTag:710">sz_comment_config_params.sz_comment_data[9]= {comment_id:"710", comment_author:"JohnMu", comment_author_url:"http://www.google.com/", comment_author_email:"c8rdBbcnLYU11j10vtGM0Hto8b8MtUJUHZWmv1dKT2rAl3BUNaqXB3eFGRiM5udOayZarc3vcb203oXnCc5y%2FvP8KoSdcfJnpdMP5bW3z0CsWheWUbuavZC5eAiEm1qJ%2B4AAoe8EiXc2IGNIdIfcwxfJzM%2FDgYrBIHvjfIYo4LM%3D",sz_score:"5.0",comment_score:"5.0"};</script></p>
]]></content:encoded>
	</item>
</channel>
</rss>
