Every website should have a robots.txt file! But, unfortunately in the rush to get a new website launched, this unseen file seens to get forgeooten and most people simply forget all about their unseen robots.txt file .
When using wordpress, and trying to minimize your duplicate content exposure to the search engines, then having a proper robots.txt file is really important. I blogged about this a few months ago, and you can find the post here.
So a good wordpress robots.txt file should look something like this, including the link to your sitemap (in XML format) at the bottom.
User-agent: *
Disallow: /wp-content/
Disallow: /wp-content/cache/
Disallow: /wp-content/themes/
Disallow: /wp-content/plugins/
Disallow: /wp-admin/
Disallow: /wp-login.php
Disallow: /wp-includes/
Disallow: /trackback/
Disallow: /cgi-bin/
Disallow: /*/feed/$
Disallow: /*/feed/rss/$
Disallow: /*/trackback/$
Disallow: /wp-
Disallow: /wp-admin/
Disallow: /wp-includes/
Disallow: /trackback/
Disallow: /feed/
Disallow: /comments/
Disallow: /page/
Disallow: /date/
Disallow: /category/
Disallow: /archive/
Disallow: /rss/
# Google Image
User-agent: Googlebot-Image
Disallow:
Allow: /*
sitemap:http://www.yoursite.com/sitemap.xml
But, like the true geek I really am. I got tired of typing this in every time I made a new site. So I created a wordpress plugin that makes this plugin for you.
Yes, there are other wordpress plugin out there, some better, some worse, but this one works exactly like I want it to work. In otherwords, it optimized to the MAX!
So if you want to get the plugin for free, simply download it from here.
Go to your plugin menu in the admin area, and activate.
Then click on the BH Robots.txt menu item in the Settings sections and DON’T forget to remove the leading # symbole at the start of the last line. Only remove it if you have a sitemap, otherwise leave it there.
Let me know what you think about this plugin.
Tags: duplicate content, robots.txt, search engine optimization, search engines, wordpress, wordpress plugin
Have you tried this online service to see if your robots.txt file validates? http://tool.motoricerca.info/robots-checker.phtml
I find it very helpful to get things in order to pass validation.
Thanks for your help with this plugin. I appreciate it.
Hey there fellow warrior, I went looking for you on http://www.brucehearder.com and found you were retooling.
Going to d/l and install your robots.txt plug in and want to reblog this post on one of our blogs, will link back here of course.
I also bought Blog Farming 101 ebook from you (and resell rights) was hoping to find a sales page to mimick because I want to sell it from one of DIY SEO sites.
I loved your blog Making WordPress and Robots.txt work for you It relates to what I’ve been searching on the internet for online work, been searching for a long time, so finally I’ve found it, and I’m thankful! Gosh, it’s already Saturday today, I forgot to do my groceries
I’ll make sure I’ll order some pizza tonight. I know it’s unhealthy but as long as I have something to eat, I don’t mind! Yum! Pizza’s are delicious!