<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>m&#38;i Tutorials Blog&#187; robots txt free download</title>
	<atom:link href="http://www.learnhowtodopro.com/seobooks/tag/robots-txt-free-download/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.learnhowtodopro.com/seobooks</link>
	<description>Be brave enough to live creatively.  Alan Alda</description>
	<lastBuildDate>Fri, 09 Apr 2010 19:35:38 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Robots.txt Guidelines</title>
		<link>http://www.learnhowtodopro.com/seobooks/robots-txt-guidelines/</link>
		<comments>http://www.learnhowtodopro.com/seobooks/robots-txt-guidelines/#comments</comments>
		<pubDate>Tue, 09 Mar 2010 23:15:31 +0000</pubDate>
		<dc:creator>Irina Overy</dc:creator>
				<category><![CDATA[Search Engine Optimization]]></category>
		<category><![CDATA[robots txt free download]]></category>
		<category><![CDATA[robots txt guidelines]]></category>
		<category><![CDATA[robots txt options]]></category>

		<guid isPermaLink="false">http://www.learnhowtodopro.com/seobooks/?p=444</guid>
		<description><![CDATA[<p>Why do I need a robots.txt file?  Are there robots.txt options?  Where can I get robots.txt free download?  In this article I am going to give answers to all these questions.</p>
<p>I have just created a new blog situated at http://irinaovery.com A few days after the blog submission to Google AdSense I noticed that AdSense ads stopped [...]]]></description>
			<content:encoded><![CDATA[<p>Why do I need a robots.txt file?  Are there robots.txt options?  Where can I get robots.txt free download?  In this article I am going to give answers to all these questions.</p>
<p>I have just created a new blog situated at <a title="My Affiliate Marketing" href="http://irinaovery.com/" target="_blank">http://irinaovery.com</a> A few days after the blog submission to Google AdSense I noticed that AdSense ads stopped coming.  I checked Google AdSense Site Diagnostics page and found that my blog URL were blocked, because of Robots.txt file.  What does it mean?</p>
<ul>
<li>The      robots.txt file ‘disallows’ or does not let Google crawl my blog</li>
<li>The      robots.txt file does not exist in the root directory of my blog</li>
</ul>
<p>First thing I had to do was to check if I had a robots.txt file in my blog root directory.  For this purpose I used FTP FileZilla Client application.  By the way it is free and you can download following the link below.</p>
<p><a title="FTP Solution" href="http://filezilla-project.org/" target="_blank">http://filezilla-project.org/</a></p>
<p>I opened my blog root directory and found that a robots.txt file did not exist.</p>
<p>Next step was to create a robots.txt file.  The purpose of robots.txt file is to provide the search engine spiders with information. The information lets the search engines know which pages should be crawled and which should not.  Theoretically, if you do not have a robots.txt file, the search engines would crawl and index all your pages.  However, in reality, it could be a different situation.  This is what had happened in my case.</p>
<p>Even if you have the file, I would recommend amending it in the way you want the search engines crawl your pages.</p>
<p>For example, your website has a lot of images.  These images could be spidered by the search engines such as Google Images.  It could increase your server bandwidth and create a problem.  For this reason you may not want to authorize search engines spiders to crawl your image folder, which is on the root directory.</p>
<p>Another reason why you may not want Google to crawl your pages is a sale of a copyrighted digital product on your website.  If you prevent the search engine spiders from crawling a page where your product is located, then nobody can download your product without paying.</p>
<p>For creating a robots.txt file you can use any text editor.  For example, it could be Notepad.  However, if you know nothing about creating a robots.txt file, you can use the robots.txt generator at <a title="SEO Tools" href="http://www.mcanerin.com/EN/search-engine/robots-txt.asp" target="_blank">http://www.mcanerin.com/EN/search-engine/robots-txt.asp</a></p>
<p>For my blog I created the following code:</p>
<p><strong>User-agent: googlebot-image</strong></p>
<p><strong>Disallow: /</strong></p>
<p><strong>User-agent: psbot</strong></p>
<p><strong>Disallow: /</strong></p>
<p><strong>User-agent: *</strong></p>
<p><strong>Disallow:</strong></p>
<p><strong>Disallow: /cgi-bin/</strong></p>
<p><strong>‘User-agent’</strong> line names search engine spiders.  The next line is a message for these search engines spiders.  I do not want Google Images to crawl images on pages.  I let know ‘<strong>googlebot-image</strong>’ do not spider pages – ‘<strong>Disallow:/</strong>’.</p>
<p>I also did not allow MSN PicSearch crawling pages:</p>
<p><strong>User-agent: psbot</strong></p>
<p><strong>Disallow: /</strong></p>
<p>For all other search engines spiders (<strong>User-agent: *</strong>) I permitted to crawl (<strong>Disallow:</strong>) pages.</p>
<p>Restricted directories for every search engine is <strong>/cgi-bin/</strong>.</p>
<p>Thus creating a robots.txt file could help you instruct search engines spiders to crawl a particular page.</p>
<p>Do not forget to upload your robots.txt file to your server using FTP application.</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://www.learnhowtodopro.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a> </p>]]></content:encoded>
			<wfw:commentRss>http://www.learnhowtodopro.com/seobooks/robots-txt-guidelines/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
