<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Google Bot has privileges</title>
	<atom:link href="http://www.ghacks.net/2007/08/15/google-bot-has-privileges/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.ghacks.net/2007/08/15/google-bot-has-privileges/</link>
	<description>A technology blog covering software, mobile phones, gadgets, security, the Internet and other relevant areas.</description>
	<lastBuildDate>Tue, 24 Nov 2009 21:14:15 -0600</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: simpleminded</title>
		<link>http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-429156</link>
		<dc:creator>simpleminded</dc:creator>
		<pubDate>Thu, 31 Jul 2008 10:53:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-429156</guid>
		<description>In case that wasn&#039;t very clear, I&#039;ll simplify it.

Websites don&#039;t grant Googlebot special access to their pages so that people go to them and have to register to see. 

&lt;b&gt;Googlebot just breaks in. It&#039;s not a lure to make you register.&lt;/b&gt;</description>
		<content:encoded><![CDATA[<p>In case that wasn&#8217;t very clear, I&#8217;ll simplify it.</p>
<p>Websites don&#8217;t grant Googlebot special access to their pages so that people go to them and have to register to see. </p>
<p><b>Googlebot just breaks in. It&#8217;s not a lure to make you register.</b></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: simpleminded</title>
		<link>http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-429139</link>
		<dc:creator>simpleminded</dc:creator>
		<pubDate>Thu, 31 Jul 2008 10:30:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-429139</guid>
		<description>Actually it never was &quot;a trick&quot;. It never worked to begin with. It&#039;s just been people saying &quot;oh I bet this would work&quot;. It doesn&#039;t work because the concept is flawed. Googlebot does indeed have special access to &quot;registered only&quot; pages, but it&#039;s not because of its user-agent, IP address, or anything like that.

Websites (forums particularly) don&#039;t allow Googlebot special permissions, and they don&#039;t check to see if Googlebot&#039;s IP or user-agent is spoofed, unless an admin does it manually out of suspicion. The most they do is disallow certain parts of the site to Googlebot through the use of Robots.txt.

The REAL reason Googlebot gets into &quot;registered only&quot; pages is because the website didn&#039;t disallow Googlebot from crawling them in Robots.txt, and Googlebot just breaks in essentially, because the only rules it follows are those defined in Robots.

I&#039;m not sure what methods it uses to do such, but as an admin on several forums, I know for fact that it just &quot;breaks into&quot; our registered users only pages, as I’ve seen the Googlebot show up in logs as accessing anything from the main index to user profiles and private messages, even the Moderators Only sections of the forum, as a guest (or anonymous) user.

The only way to stop it is Robots.txt or to ban the entire Googlebot IP range (and even that doesn&#039;t work for long because there&#039;s always new IP ranges).

So pretending you&#039;re Googlebot will never, ever work. You have to actually BE the Googlebot software.</description>
		<content:encoded><![CDATA[<p>Actually it never was &#8220;a trick&#8221;. It never worked to begin with. It&#8217;s just been people saying &#8220;oh I bet this would work&#8221;. It doesn&#8217;t work because the concept is flawed. Googlebot does indeed have special access to &#8220;registered only&#8221; pages, but it&#8217;s not because of its user-agent, IP address, or anything like that.</p>
<p>Websites (forums particularly) don&#8217;t allow Googlebot special permissions, and they don&#8217;t check to see if Googlebot&#8217;s IP or user-agent is spoofed, unless an admin does it manually out of suspicion. The most they do is disallow certain parts of the site to Googlebot through the use of Robots.txt.</p>
<p>The REAL reason Googlebot gets into &#8220;registered only&#8221; pages is because the website didn&#8217;t disallow Googlebot from crawling them in Robots.txt, and Googlebot just breaks in essentially, because the only rules it follows are those defined in Robots.</p>
<p>I&#8217;m not sure what methods it uses to do such, but as an admin on several forums, I know for fact that it just &#8220;breaks into&#8221; our registered users only pages, as I’ve seen the Googlebot show up in logs as accessing anything from the main index to user profiles and private messages, even the Moderators Only sections of the forum, as a guest (or anonymous) user.</p>
<p>The only way to stop it is Robots.txt or to ban the entire Googlebot IP range (and even that doesn&#8217;t work for long because there&#8217;s always new IP ranges).</p>
<p>So pretending you&#8217;re Googlebot will never, ever work. You have to actually BE the Googlebot software.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nobody</title>
		<link>http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-195922</link>
		<dc:creator>Nobody</dc:creator>
		<pubDate>Sun, 11 Nov 2007 13:28:42 +0000</pubDate>
		<guid isPermaLink="false">http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-195922</guid>
		<description>Just use Google&#039;s cache of the page.</description>
		<content:encoded><![CDATA[<p>Just use Google&#8217;s cache of the page.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John</title>
		<link>http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-163911</link>
		<dc:creator>John</dc:creator>
		<pubDate>Sat, 18 Aug 2007 04:25:03 +0000</pubDate>
		<guid isPermaLink="false">http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-163911</guid>
		<description>hey anyone know how to do so in opera browser? like i do opera:config then go to user agent but i dont know how to add google bot.</description>
		<content:encoded><![CDATA[<p>hey anyone know how to do so in opera browser? like i do opera:config then go to user agent but i dont know how to add google bot.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Threshold</title>
		<link>http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-163836</link>
		<dc:creator>Threshold</dc:creator>
		<pubDate>Fri, 17 Aug 2007 00:55:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-163836</guid>
		<description>Thanks Martin,
I tried several different combinations of googlebot found on the Internet but it hasn&#039;t worked for me on any website I have tried so far.

I guess webmasters have wised up since this trick has been around for almost 3 years.

Anybody has had any more luck with this?</description>
		<content:encoded><![CDATA[<p>Thanks Martin,<br />
I tried several different combinations of googlebot found on the Internet but it hasn&#8217;t worked for me on any website I have tried so far.</p>
<p>I guess webmasters have wised up since this trick has been around for almost 3 years.</p>
<p>Anybody has had any more luck with this?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Martin</title>
		<link>http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-163777</link>
		<dc:creator>Martin</dc:creator>
		<pubDate>Thu, 16 Aug 2007 06:11:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-163777</guid>
		<description>Threshold try the following settings: 
Description: Googlebot 2.1
User-Agent: Googlebot 2.1</description>
		<content:encoded><![CDATA[<p>Threshold try the following settings:<br />
Description: Googlebot 2.1<br />
User-Agent: Googlebot 2.1</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Threshold</title>
		<link>http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-163768</link>
		<dc:creator>Threshold</dc:creator>
		<pubDate>Thu, 16 Aug 2007 02:14:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-163768</guid>
		<description>Well, in case this actually works, it would be nice if you explained how to fill the Googlebot details in User-Agent-Switcher for the teckie-challenged out here.

What goes into each field?

This trick is reported all over the net but not one that explains it clearly.</description>
		<content:encoded><![CDATA[<p>Well, in case this actually works, it would be nice if you explained how to fill the Googlebot details in User-Agent-Switcher for the teckie-challenged out here.</p>
<p>What goes into each field?</p>
<p>This trick is reported all over the net but not one that explains it clearly.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ser0</title>
		<link>http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-163731</link>
		<dc:creator>ser0</dc:creator>
		<pubDate>Wed, 15 Aug 2007 09:42:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.ghacks.net/2007/08/15/google-bot-has-privileges/#comment-163731</guid>
		<description>I think everyone should report sites like EE to Google for violating their policy of showing the bot one page and giving users a completely different page.

I only used the cached version of EE pages now as the normal linked page is completely useless.</description>
		<content:encoded><![CDATA[<p>I think everyone should report sites like EE to Google for violating their policy of showing the bot one page and giving users a completely different page.</p>
<p>I only used the cached version of EE pages now as the normal linked page is completely useless.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
