• Welcome to NamesLot.com Domain Name Forum

    NamesLot.com Domain Name Forum offers a fully open forum to discuss domain industry news and a 0% commission marketplace for you to buy and sell domain names.

    We have reorganized our Marketplace so now it is easier to get attention to your domain while it is also easier for buyers to find the right domain name.

    If you want us to add more threads to our marketplace, please contact us! Listing on our marketplace as always, 100% free! Register NOW or Login HERE!

Robots.txt

Status
Not open for further replies.

vjackcon

New Member
Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do. It is important to clarify that robots.txt is not a way from preventing search engines from crawling your site (i.e. it is not a firewall, or a kind of password protection) and the fact that you put a robots.txt file is something like putting a note “Please, do not enter” on an unlocked door – e.g. you cannot prevent thieves from coming in but the good guys will not open to door and enter.
Structure of a Robots.txt File

The structure of a robots.txt is pretty simple (and barely flexible) – it is an endless list of user agents and disallowed files and directories. Basically, the syntax is as follows:

User-agent:

Disallow:

“User-agent” are search engines' crawlers and disallow: lists the files and directories to be excluded from indexing. In addition to “user-agent:” and “disallow:” entries, you can include comment lines – just put the # sign at the beginning of the line:

# All user agents are disallowed to see the /temp directory.

User-agent: *

Disallow: /temp/
 
"Robots.txt" is a approved text file that through its name, has appropriate aim to the majority of "honorable" robots on the web. By defining a few rules in this text file, you can inform robots to not crawl and basis definite files, directories within your site, or at all.
 
Robots.txt is a regular text file here. Its name has a special meaning to the majority of honorable robots on the web. This file is very helpful to us.
 
It contains restrictions for Web Spiders, telling them where they have permission to search. It is like defining rules for search engine spiders (robots) what to follow and what not to. It provides you with more functionality than Meta robots tag which is available only partially to control behaviour of search engines. You can use it to prevent indexing totally, prevent certain areas of your site from being indexed or to issue individual indexing instructions to specific search engines. Robot.txt protocols are simply advisory though. There is no law requiring websites to have Robot.txt files, or to use them on their web pages.



Thanks,
 
from what i have learned.. .robots are there to be used as parameters.. .or in a way limit search engines for your data base to be indexed.. .so, i guess its kinda in your side.. .cause it sounds so cool.. .
 
Robot.txt is a text file where in search engine spiders will read important pages from your sites and disregard unnecessary pages.
 
Status
Not open for further replies.

Members online

No members online now.

Forum statistics

Threads
19,799
Messages
69,796
Members
44,585
Latest member
tsscgroup
Active members today
1
New members today
0
New threads today
3
New posts today
3

Follow NamesLot on Twitter!

NamesLot proudly supported by

NamesLot proudly supported by

Top