Enterprise SolutionsService Provider SolutionsSmall Business SolutionsKnowledge CentreAbout Fresh EnterpriseContact
More

Client Area

To access your client area for live reporting and statistics log in here

Username:
Password:

Fresh Newsletter

Industry news and best practices
delivered to your Inbox each month
SUBMIT

Definition of Robots/Robots.txt


The Robots.txt protocol is also known by either the robots exclusion standard or the Robots Exclusion Protocol. It is designed to prevent cooperating or honorable spiders from gaining entrance or access to every part of a website that is partially viewable by the general public. A robot.txt file that is placed on a specific website will put forth a request that clearly delineates which files or directories specified robots should ignore during their search of the site.

It is necessary to set up a separate robots.txt protocol for each sub-domain of a website. Unfortunately, nothing dictates that the spiders or robots must obey the suggestion to ignore certain areas of a website. The robots.txt can clearly indicate which directories and files should be ignored either out of a sense of wanting to keep the information private or the desire to have certain information unavailable since it might be misleading in context with the website. However, nothing dictates that this convention will receive cooperation.


External links:


Next: ROI