awesomeindex.com awesomeindex.com
   Main :> About Us :> Security & Privacy :> ToS :> Add Your Link :> Add Article
Search:   
Get Free Links
 
 

Culture & Art

 

Home Family & Garden

 

Online Shopping

 

People & Society

 

Automobile & Automotive

 

News & Media

 

Jobs & Careers

 

Computers & Networking

 

Drink & Food

 

Science & Space

 

Academics & Education

 

Policies & Law

 

Finance & Banking

 

Companies & Business

 

Children

 

Property & Estate

 

Hotels & Travel

 

Relationship & Lifestyle

 

Self Enhancement

 

Recreation

 

Online & Indoor Games

 

Adventure & Sports

 

Medical Care

 

Health & Hygiene

 

Main › Computers & Networking › SEO Solutions
 

Robocops

 

Author: Philip Nicosia
The Robots.txt protocol, also called the 'robots exclusion standard' is designed to lock out web spiders from accessing part of a website. It is a security or privacy measure, the equivalent of hanging a 'Keep Out' sign on your door.

This protocol is used by web site administrators when there are sections or files that they would rather not be accessed by the rest of the world. This could include employee lists, or files that they are circulating internally. For example, the White House website uses robots.txt to block any inquiries on speeches by the Vice President, a photo essay of the First Lady, and profiles of the 911 victims.

How does the protocol work? It lists the files that shouldn't be scanned, and places it in the top-level directory of the website. The robots.txt protocol was created by consensus in June 1994 by members of the robots mailing list (robots-request@nexor.co.uk). There is no official standards body or RFC for the protocol, so it's difficult to legislate or mandate that the protocol be followed. In fact, the file is treated as strictly advisory, and does not have absolute guarantee that those contents won't be read.

In effect, robot.txt requires cooperation by the web spider and even the reader, since anything that is uploaded into the internet becomes publicly available. You aren't locking them out of those pages, you are just making it harder for them to get in. But it takes very little for them to ignore these instructions. Computer hackers can also easily penetrate the files and retrieve information. So the rule of thumb is'if it's that sensitive, it shouldn't be on your website to begin with

Care, however, should be taken to ensure that the Robots.txt protocol doesn't block the website robots from other areas of the website. This will dramatically affect your search engine ranking, as the crawlers rely on the robots to count the keywords, review metatags, titles and crossheads, and even register the hyperlinks.

One misplaced hyphen or dash can have catastrophic effects. For example, the robots.txt patterns are matched by simple substring comparisons, so care should be taken to make sure that patterns matching directories have the final '/' character appended: otherwise all files with names starting with that substring will match, rather than just those in the directory intended.

To avoid these problems, consider submitting your site to a search engine spider simulator, also called search engine robot simulator. These simulators'which can be bought or downloaded from the internet' use the same processes and strategies of different search engines and give you a 'dry run' of how they will read your site. They will tell you which pages are skipped, which links are ignored, and which errors are encountered. Since the simulators will also reenact how the bots will follow your hyperlinks, you'll see if your robot.txt protocol is interfering with the search engine's ability to read through all the necessary pages.

It's also important to review your robot.txt files, which will enable you to spot any problems and correct them before you submit them to real search engines.

Author Bio:

Lowcarbdiets.eu.com is a website providing information on low carb diets, Atkins diet food and diet products to help you on your way to losing weight.

You can also reach this article by using: search engine optimization services, search engine optimization firm
 
 
 

Related Articles

 
10 First Rate Tips To Getting More Ezine Subscribers
 
Two Imperative Keys For Profitable Pay Per Click Marketing
 
Best Cellular Phone Plans
 
Back It Up With Backup
 
Design vs Content: Who is KING?
 
Scum Bags On the Internet Who Attack Good Ideas
 
ZoneLabs Zone Alarm Pro
 
The Most Important Aspect Of Internet Marketing Ever
 
Worm_Grew.A Threat, Hype, or Dud?
 
Bridging the Gap Between the Page, Keywords and Copywriting
 
 
 
 

Leveraging Website Exit Strategies to Maximize Profit

Tired of tweaking your website in an effort to get more sales? A clever set of exit strategies can r ... - Matt Bacak
 

Make Money on the Internet

If you read my other articles, you know already that there are many ways to make money on the Intern ... - Nathaniel Tabares
 

Google Adsense and Content - Where To Now?

A slightly tongue-in-cheek look at where Google Adsense is going next, and the meaning of Content. - Patricia Howitt
 
 

Further Proof That Blogs Rule the World

Wal*Mart has raised the ire of some of the intelligentsia by employing a major PR firm to get blogge ... - Matt DeAngelis
 

Build a Web Site Quickly with a Web Site Builder

Building web sites by hand takes time. More than that it takes knowledge. Many people don't have the ... - Joe Duchesne
 

The Power Of Blogging

Blogs are becoming more popular nowadays. You will notice that the numbers of blogging sites are inc ... - Sandra S
 

Introduction To Regular Expressions In PHP

Regex can be scary at first but if you can get the basics, it is really not too hard to understand. ... - Bernard Peh
 

Improve Your Website And Increase Profits

To increase the money you make from your website increase the amount of visitors to your website and ... - Gregory marathonge
 
 
Main :> Security & Privacy :> ToS
© 2006-2008 www.awesomeindex.com All Rights Reserved Worldwide.