What is the use of Robots.txt ?

jerrysmith · Oct 26, 2016

Hello friends,

I want to know that What is the use of Robots.txt ? tell me ..

Cory Huffman · Oct 26, 2016

More or less

Site proprietors utilize the/robots.txt document to give guidelines about their webpage to web robots; this is known as The Robots Exclusion Protocol.

It works prefers this: a robot needs to vists a Web webpage URL, say http://www.example.com/welcome.html. Before it does as such, it firsts checks for http://www.example.com/robots.txt, and finds:

Client specialist: *

Prohibit:/

The "Client specialist: *" implies this area applies to all robots. The "Prohibit:/" tells the robot that it ought not visit any pages on the site.

There are two essential contemplations when utilizing/robots.txt:

robots can overlook your/robots.txt. Particularly malware robots that output the web for security vulnerabilities, and email address gatherers utilized by spammers will give careful consideration.

the/robots.txt record is a freely accessible document. Anybody can see what segments of your server you don't need robots to utilize.

So don't attempt to utilize/robots.txt to shroud data.

The subtle elements

The/robots.txt is an accepted standard, and is not possessed by any norms body. There are two authentic portrayals:

the first 1994 A Standard for Robot Exclusion report.

a 1997 Internet Draft determination A Method for Web Robots Control

Likewise there are outer assets:

HTML 4.01 determination, Appendix B.4.1

Wikipedia - Robots Exclusion Standard

nareshverma · Oct 26, 2016

Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol.

SoonamSingh · Oct 27, 2016

A robots.txt file is a file at the root of your site that indicates those parts of your site you don’t want to be accessed by search engine crawlers. The file uses the Robots Exclusion Standard, which is a protocol with a small set of commands that can be used to indicate access to your site by section and by specific kinds of web crawlers (such as mobile crawlers vs desktop crawlers).

Patrick Ferris · Oct 27, 2016

Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol. The "User-agent: *" means this section applies to all robots. The "Disallow: /" tells the robot that it should not visit any pages on the site.

ShreyaKoushik23 · Oct 27, 2016

Use of Robots.txt - The most common usage of Robots.txt is to ban crawlers from visiting private folders or content that gives them no additional information.

Robots.txt Allowing Access to Specific Crawlers.
Allow everything apart from certain patterns of URLs.

aditiaggarwal · Oct 28, 2016

Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol. The "User-agent: *" means this section applies to all robots. The "Disallow: /" tells the robot that it should not visit any pages on the site.

andrea992 · Nov 7, 2016

The robots exclusion standard, also known as the robots exclusion protocol or simply robots.txt, is a standard used by websites to communicate with web crawlers and other web robots. The standard specifies how to inform the web robot about which areas of the website should not be processed or scanned.

emilia · Nov 7, 2016

SoonamSingh said:
A robots.txt file is a file at the root of your site that indicates those parts of your site you don’t want to be accessed by search engine crawlers. The file uses the Robots Exclusion Standard, which is a protocol with a small set of commands that can be used to indicate access to your site by section and by specific kinds of web crawlers (such as mobile crawlers vs desktop crawlers).

Yes,I agree with you.

vali chính hãng LUG · Nov 7, 2016

A robots.txt file is a file at the root of your site that indicates those parts of your site you don’t want to be accessed by search engine crawlers

Rammadhur · Nov 7, 2016

A robot.txt is a text file ,is used to mention the search robot about which one web page you do not allow and it consists of endless list of directories and user agents.

arindamdutta16 · Nov 7, 2016

The robots.txt is a simple text file in your web site that informs search engine bots how to crawl and index website or web pages. It is great when search engines frequently visit your site and index your content but often there are cases when indexing parts of your online content is not what you want.

Ajaysharma · Nov 7, 2016

When search engine crawlers (robots) look at a website, the first file they will look at is not your index.html or index.php page. It is your robots.txt file.

What is the use of Robots.txt ?

jerrysmith

New member

Cory Huffman

New member

nareshverma

New member

SoonamSingh

New member

Patrick Ferris

New member

ShreyaKoushik23

New member

aditiaggarwal

New member

andrea992

New member

emilia

New member

vali chính hãng LUG

New member

Rammadhur

New member

arindamdutta16

New member

Ajaysharma

New member