<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/2.2.3" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>Sebastian's Pamphlets &#187; Hotlinking</title>
	<link>http://sebastians-pamphlets.com</link>
	<description>If you've read my articles somewhere on the Internet, expect something different here.</description>
	<pubDate>Wed, 11 Aug 2010 18:57:05 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.2.3</generator>
	<language>en</language>
			<item>
		<title>Getting the most out of Google&#8217;s 404 stats</title>
		<link>http://sebastians-pamphlets.com/getting-the-most-out-of-googles-404-stats/</link>
		<comments>http://sebastians-pamphlets.com/getting-the-most-out-of-googles-404-stats/#comments</comments>
		<pubDate>Mon, 16 Jul 2007 23:10:00 +0000</pubDate>
		<dc:creator>Sebastian</dc:creator>
		
		<category><![CDATA[Tools]]></category>

		<category><![CDATA[Testing]]></category>

		<category><![CDATA[404grabber]]></category>

		<category><![CDATA[Hotlinking]]></category>

		<category><![CDATA[.htaccess]]></category>

		<category><![CDATA[SEO]]></category>

		<category><![CDATA[Webmaster Central]]></category>

		<category><![CDATA[Google]]></category>

		<guid isPermaLink="false">http://sebastians-pamphlets.com/getting-the-most-out-of-googles-404-stats/</guid>
		<description><![CDATA[
The 404 reports in Google&#8217;s Webmaster Central panel are great to debug your site, but they contain URLs generated by invalid &#8211;respectively truncated&#8211; URL drops or typos of other Webmasters too. Are you sick of wasting the link love from invalid inbound links, just because you lack a suitable procedure to 301-redirect all these 404 [...]]]></description>
			<content:encoded><![CDATA[
<p>The 404 reports in Google&#8217;s Webmaster Central panel are great to debug your site, but they contain URLs generated by invalid &#8211;respectively truncated&#8211; URL drops or typos of other Webmasters too. Are you sick of wasting the link love from invalid inbound links, just because you lack a suitable procedure to 301-redirect all these 404 errors to canonical URLs? </p>
<p>Your pain ends here. At least when you&#8217;re on a *ix server running Apache with PHP 4+ or 5+ and .htaccess enabled. (If you suffer from IIS <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=funeral+services">go</a> search another hobby.)</p>
<p>I&#8217;ve developed a tool which grabs all 404 requests, letting you map a canonical URL to each 404 error. The tool captures and records 404s, and you can add invalid URLs from Google&#8217;s 404-reports, if these aren&#8217;t recorded (yet) from requests by Ms. Googlebot.  </p>
<p>It&#8217;s kinda layer between your standard 404 handling and your error page. If a request results in a 404 error, your .htaccess calls the tool instead of the error page. If you&#8217;ve assigned a canonical URL to an invalid URL, the tool 301-redirects the request to the canonical URL. Otherwise it sends a 404 header and outputs your standard 404 error page. Google&#8217;s 404-probe requests during the Webmaster Tools verification procedure are unredirectable (is this a word?).</p>
<p>Besides 1:1 mappings of invalid URLs to canonical URLs you can assign keywords to canonical URLs. For example you can define that all invalid requests go to <code>/fruit</code> when the requested URI or the HTTP referrer (usually a SERP) contain the strings &#8220;apple&#8221;, &#8220;orange&#8221;, &#8220;banana&#8221; or &#8220;strawberry&#8221;. If there&#8217;s no persistent mapping, these requests get 302-redirected to the guessed canonical URL, thus you should view the redirect log frequently to find invalid URLs which deserve a persistent 301-redirect.</p>
<p>Next there are tons of bogus requests from spambots searching for exploits or whatever, or hotlinkers, resulting in 404 errors, where it makes no sense to maintain URL mappings. Just update an ignore list to make sure those get 301-redirected to <code>example.com/goFuckYourself</code> or a cruel and scary image hosted on your domain or a free host of your choice. </p>
<p>Everything not matching a persistent redirect rule or an expression ends up in a 404 response, as before, but logged so that you can define a mapping to a canonical URL. Also, you can use this tool when you plan to change (a lot of) URLs, it can 301-redirect the old URL to the new one without adding those to your .htaccess file.</p>
<p>I&#8217;ve tested this tool for a while on a couple of smaller sites and I think it can get trained to run smoothly without too many edits once the ignore lists etcetera are up to date, that is matching the site&#8217;s requisites. A couple of friends got the script and they will provide useful input. Thanks! <a href="http://www.smart-it-consulting.com/contact.htm?cSubject=404Grabber_BETA">If you&#8217;d like to join the BETA test drop me a message</a>. </p>
<p>Disclaimer: All data get stored in flat files. With large sites we&#8217;d need to change that to a database. The UI sucks, I mean it&#8217;s usable but it comes with the browser&#8217;s default fonts and all that. IOW the current version is still in the stage of &#8220;proof of concept&#8221;. But it works just fine <img src='http://sebastians-pamphlets.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /></p>
<hr />Copyright &copy; 2010 <strong><a href="http://sebastians-pamphlets.com/">Sebastian`s Pamphlets</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator/feed reader, the site you are looking at is guilty of copyright infringement and will be put down immediately. Please contact sebastians-pamphlets.com so we can take legal action immediately.<br /><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="topsy_widget_data topsy_theme_light-green" style="float: right;margin-left: 0.75em;"><!-- { "url": "http://sebastians-pamphlets.com/getting-the-most-out-of-googles-404-stats/", "style": "big", "title": "Getting the most out of Google's 404 stats" } --></div>
]]></content:encoded>
			<wfw:commentRss>http://sebastians-pamphlets.com/getting-the-most-out-of-googles-404-stats/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Please don&#8217;t run your counter on my servers</title>
		<link>http://sebastians-pamphlets.com/please-dont-run-your-counter-on-my-servers/</link>
		<comments>http://sebastians-pamphlets.com/please-dont-run-your-counter-on-my-servers/#comments</comments>
		<pubDate>Wed, 18 Apr 2007 19:43:00 +0000</pubDate>
		<dc:creator>Sebastian</dc:creator>
		
		<category><![CDATA[Copyrights]]></category>

		<category><![CDATA[Copy+Paste-Penalties]]></category>

		<category><![CDATA[Plagiarism]]></category>

		<category><![CDATA[Hotlinking]]></category>

		<category><![CDATA[.htaccess]]></category>

		<guid isPermaLink="false">http://sebastians-pamphlets.com/please-dont-run-your-counter-on-my-servers/</guid>
		<description><![CDATA[
I deeply understand that sharing other peoples resources makes sense sometimes. I just ask you to rethink your technical approach. Running your page view stats on my server comes with a serious disadvantage: my server logs and referrer reports are protected, hence you can&#8217;t read your stats. Rest assured I&#8217;m really not eager to know [...]]]></description>
			<content:encoded><![CDATA[
<p><img src="http://www.smart-it-consulting.com/img/misc/do-not-hotlink-beauty.jpg" border="0" align="left" alt="DO NOT HOTLINK" title="GTFOH" style="margin-right:5px;" />I deeply understand that sharing other peoples resources makes sense sometimes. I just ask you to rethink your technical approach. Running your page view stats on my server comes with a serious disadvantage: my server logs and referrer reports are protected, hence you can&#8217;t read your stats. Rest assured I&#8217;m really not eager to know who views your pages.</p>
<p>So please: when you copy my HTML code, be so kind and steal the invisible 1&#215;1px images too. It&#8217;s really not that hard to upload them to your server and edit my HTML in a way that your visitors&#8217; user agents request these images from your server. </p>
<p>Signing up at a free counter service not adding hidden links to all your pages gives less hassles than my reaction when I get annoyed. </p>
<p>Disclaimer: I don&#8217;t like it when you steal my code coz for some reasons it&#8217;s often crappy enough to break your layout. Also copying code without permission is as bad as content theft. So don&#8217;t copy, but feel free to ask.</p>
<p>Go to <a href="http://www.htmlbasix.com/disablehotlinking.shtml">HTML Basix</a> to figure out how you can block hotlinking with .htaccess:<br /><code style="font-family:arial;font-size:78%;">RewriteEngine on<br />RewriteCond %{HTTP_REFERER} !^http://(www\.)?sebastianx.blogspot.com(/)?.*$     [NC]<br />RewriteRule .*\.(gif|jpg|jpeg|bmp|png)$ http://www.smart-it-consulting.com/img/misc/do-not-hotlink-beauty.jpg [R,NC]</code><br />But please don&#8217;t steal or hotlink the <a href="http://www.seroundtable.com/archives/013227.html">offensive blonde beauty</a> <img src='http://sebastians-pamphlets.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<hr />Copyright &copy; 2010 <strong><a href="http://sebastians-pamphlets.com/">Sebastian`s Pamphlets</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator/feed reader, the site you are looking at is guilty of copyright infringement and will be put down immediately. Please contact sebastians-pamphlets.com so we can take legal action immediately.<br /><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><div class="topsy_widget_data topsy_theme_light-green" style="float: right;margin-left: 0.75em;"><!-- { "url": "http://sebastians-pamphlets.com/please-dont-run-your-counter-on-my-servers/", "style": "big", "title": "Please don't run your counter on my servers" } --></div>
]]></content:encoded>
			<wfw:commentRss>http://sebastians-pamphlets.com/please-dont-run-your-counter-on-my-servers/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
