<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/2.2.3" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>Sebastian's Pamphlets &#187; Netscape</title>
	<link>http://sebastians-pamphlets.com</link>
	<description>If you've read my articles somewhere on the Internet, expect something different here.</description>
	<pubDate>Mon, 30 Jun 2008 20:12:40 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.2.3</generator>
	<language>en</language>
			<item>
		<title>No search, more fun: Netscape spamming Google</title>
		<link>http://sebastians-pamphlets.com/no-search-more-fun-netscape-spamming-google/</link>
		<comments>http://sebastians-pamphlets.com/no-search-more-fun-netscape-spamming-google/#comments</comments>
		<pubDate>Mon, 21 May 2007 15:54:00 +0000</pubDate>
		<dc:creator>Sebastian</dc:creator>
		
		<category><![CDATA[Netscape]]></category>

		<category><![CDATA[Spam]]></category>

		<category><![CDATA[Crap]]></category>

		<category><![CDATA[Google]]></category>

		<guid isPermaLink="false">http://sebastians-pamphlets.com/no-search-more-fun-netscape-spamming-google/</guid>
		<description><![CDATA[Google dislikes crawlable SERPs. But Google still indexes huge chunks of SERPs, and to make it worse, these disliked URLs sometimes rank above other useless webspam from Amazon, Ebay, and cohorts on the very first search result page.
For example Netscape is still flooding Google&#8217;s search index with crap as per the quality guidelines, which clearly [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.google.com/support/webmasters/bin/search.py?query=Use+robots.txt+to+prevent+crawling+of+search+results&#038;ctx=en%3Asearchbox&#038;Action.Search=Search">Google</a> <a href="http://www.mattcutts.com/blog/search-results-in-search-results/">dislikes</a> crawlable SERPs. But Google still indexes huge chunks of SERPs, and to make it worse, these disliked URLs sometimes rank above other useless webspam from Amazon, Ebay, and cohorts on the very first search result page.</p>
<p>For example Netscape is still flooding Google&#8217;s search index with <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=%22Web+results+for%22+site%3Anetscape.com+&#038;filter=0">crap</a> as per the quality guidelines, which clearly state:<br />
<blockquote>Use robots.txt to prevent crawling of search results pages [&#8230;] that don&#8217;t add much value for users coming from search engines.</p></blockquote>
<p>Netscape.com lacks a <a href="http://www.netscape.com/robots.txt">robots.txt</a>, but how many patterns does it need to identify these pages as <a href="http://www.netscape.com/search/?show=&#038;s=serp">SERPs</a>? Next search.netscape.com has a <a href="http://search.netscape.com/robots.txt">robots.txt</a>, but it lacks a <code><a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;c2coff=1&#038;q=site%3Asearch.netscape.com+&#038;filter=0">Disallow: /</a></code> directive, respectively Disallow&#8217;s of all their scripts generating search results.</p>
<p>Is it that simple to get gazillions of useless autogenerated pages ranking at Google? Indeed. Following the Netscape precedent every assclown out there can buy a SE-script, can crawl the Web for a bunch of niche keywords, and will earn free Google traffic just because he has &#8220;forgotten&#8221; to upload a proper robots.txt file and Google isn&#8217;t capable of detecting SERPs. I mean when they don&#8217;t run a few tests with Netscape-SERPs, where&#8217;s the point of an unenforced no-crawlable-SERPs policy?</p>
<p>I just found another <a href="http://www.google.com/support/webmasters/bin/search.py?query=If+a+site+doesn%27t+meet+our+quality+guidelines%2C+it+may+be+blocked+from+the+index.&#038;ctx=en%3Asearchbox&#038;Action.Search=Search">interesting snippet</a> in Google&#8217;s quality guidelines:<br />
<blockquote>If a site doesn&#8217;t meet our quality guidelines, it may be <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=40052">blocked from the index</a>.</p></blockquote>
<p>I certainly will not miss <a href="http://www.google.com/search?q=site%3Anetscape.com&#038;filter=0&#038;num=100">1,360,000 URLs from a spamming site</a> <img src='http://sebastians-pamphlets.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /></p>
<hr />Copyright &copy; 2008 <strong><a href="http://sebastians-pamphlets.com/">Sebastian`s Pamphlets</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator/feed reader, the site you are looking at is guilty of copyright infringement and will be put down immediately. Please contact sebastians-pamphlets.com so we can take legal action immediately.<br /><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span>]]></content:encoded>
			<wfw:commentRss>http://sebastians-pamphlets.com/no-search-more-fun-netscape-spamming-google/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
