Home Webmaster tools and tips SEO Tips What is robots.txt file?

What is robots.txt file?

E-mail Print PDF

Sometimes you may not want the search engines to spider specific directories of your site because you don’t want the informationto be read by the public. You can accomplish this by creating a robots.txt file and placing those files in it.

The robots.txt file tells the search engine spider, which Web pages of your site should be indexed and which Web pages should be ignored.

The robots.txt file is a simple text file (not an HTML file), which can be created in Notepad and placed in your root directory, for example:

http://www.yourwebsite.com/robots.txt

Benefits of robots.txt file

All the major search engines look for the robots.txt file on your site. I recommend including a robots.txt file even if you don’t need to prevent spiders from accessing any part of your site. It helps invite spiders to crawl your web site.

Here are some circumstances for excluding search engines from your web site:

1. During the site design

I often create a new web site in a sub directory of my main site. I therefore don’t want the client’s site to be spidered while it is being built. An alternative method is to create a password protected directory. The client can only access his site with a user name and
password.

2. Prevent certain directories from being crawled

Directories such as your cgi-bin don’t need to be crawled. You may have a directory containing images you designed and you don’t want them to be made available for public consumption. Place these directories in the robots.txt file so they can’t be crawled.

Example:

User-agent: *
Disallow: /images/

3. Prevent specific spiders

You may want to stop certain spiders from accessing your site. ie if you don’t want Google to spider your site you can add the Googlebot spider to your robots.txt file.

Here’s an example:

User-agent: googlebot
Disallow: /cgi-bin/

This robots.txt file would allow the “googlebot”, to retrieve every page from your site except for files from the “cgi-bin” directory. All files in the “cgi-bin” directory will be ignored by the googlebot.

 

Sponsored links


iVista - Easy Script Installation

Fastest free host in the world


Fastest free host in the world
Testimonials: "I was very surprised at the fastest free hosts listed. In fact, several of the fastest free web servers below were faster than a very expensive managed dedicated server I rent, and much faster than the dedicated web hosting server this website is hosted on." Reviewed by the owner and webmasters of Free-WebHosts.com .... read more

 

webmaster tools & tips

1000+ hosting answers

setup your domain for google apps

Sponsored links

About Us

Oni.cc - Top 15 Hosting Provider

Oni.cc - Certified by Godaddy

Oni.cc - Verified by Starfield Secure

Link to Our site


Login Panel

Username
Password

Register

Lost Password

Sponsored links

on Twitter

  1. oniccnews Sign up for Free Hosting: http://t.co/Fwh23rd via @vagrantweb
  2. oniccnews How to manage files via File Manager: http://t.co/3l993UZ via @vagrantweb
  3. oniccnews Website Reminder: Cleaning for security : http://t.co/iDojV5q via @vagrantweb
  4. Oni News say: Click here to Oni News. on Twitter.com

Free software download

Lastest News

Top views