Prohibition of indexing of different types of robots.txt files

by Nikolaenko Maxim · Published September 20, 2012 · Updated January 4, 2017

Recently on the Internet, in one English forum, found the list commands to block file indexing for expansion and various addresses on the site through a file robots.txt. I decided that it might be useful to someone in three cases.

If you do not want to show hackers sites that you yourself programmed.
In order to prevent indexing of canonical pages, pages that are similar and are not taken into account by search engines, but they can lower the site in search results. Although only developers of search engines and analytical systems can judge this, if they do it.
When developing a closed site, it is also desirable to indicate a ban on indexing, but you can make a complete ban on site indexing.

So, maybe if instead of using
User-agent: Googlebot-Image
Disallow: / 

You tried:
User-agent: Googlebot-Image
Disallow: /

User-agent: Googlebot
Disallow: /images/
Disallow: /img/
Disallow: /icons/
Disallow: /icons/small/
Disallow: /gallery/
Disallow: /graphics/
Disallow: /gfx/
Disallow: /buttons/
Disallow: /thumbs/
Disallow: /thumbnails/
Disallow: /*.pdf$
Disallow: /*.ico$
Disallow: /*.tif$
Disallow: /*.pict$
Disallow: /*.png$
Disallow: /*.gif$
Disallow: /*.jpg$
Disallow: /*.jpeg$
Disallow: /*.doc$
Disallow: /*.xls$
Disallow: /*.pps$
Disallow: /*.ppt$
Disallow: /*.eml$
Disallow: /*.url$
Disallow: /*.log$
Disallow: /*.txt$
Disallow: /*.js$
Disallow: /*.pac$
Disallow: /*.css$
Disallow: /*.csv$
Disallow: /*.ext$
Disallow: /*.class$
Disallow: /*.cls$
Disallow: /*.jar$
Disallow: /*.java$
Disallow: /*.c$
Disallow: /*.htx$
Disallow: /*.idc$
Disallow: /*.qry$
Disallow: /*.wo$
Disallow: /*.woa$
Disallow: /*.wos$
Disallow: /*.lp$
Disallow: /*.ls$
Disallow: /*.lsp$
Disallow: /*.au$
Disallow: /*.mid$
Disallow: /*.wav$
Disallow: /*.avi$
Disallow: /*.dat$
Disallow: /*.mov$
Disallow: /*.mpeg$
Disallow: /*.mpg$
Disallow: /*.dir$
Disallow: /*.dcr$
Disallow: /*.dxr$
Disallow: /*.aam$
Disallow: /*.aas$
Disallow: /*.aab$
Disallow: /*.fh$
Disallow: /*.spl$
Disallow: /*.swf$
Disallow: /*.fla$
Disallow: /*.ipx$
Disallow: /*.bin$
Disallow: /*.hqx$
Disallow: /*.sea$
Disallow: /*.sit$
Disallow: /*.dmg$
Disallow: /*.conf$
Disallow: /*.plist$
Disallow: /*.cab$
Disallow: /*.dll$
Disallow: /*.exe$
Disallow: /*.zip$
Disallow: /*.tar$
Disallow: /*.gz$
Disallow: /*.gzip$
Disallow: /*?
Disallow: /*.t$
Disallow: /*.cgi$
Disallow: /*.pl$
Disallow: /*.plx$
Disallow: /*.pm$
Disallow: /*.py$
Disallow: /*.pyc$

Complete prohibition of the site for indexing through robots.txt done like this:

User-Agent: *
Disallow: /

Prevent indexing of php files:

User-agent: Googlebot
Disallow: /*.php$

Online tutoring services. List of courses I teach

Basic web design course;
Site layout;
General course on CMS WordPress and continuation of the course on template development;
Website development in PHP.

Nastya says:
December 5, 2012 at 9:54 pm
Hello, Tell me please, what does string mean 88 Disallow: /*? – prohibition of what? pages without extension?
- WordPress Tutorials says:
  December 6, 2012 at 3:03 pm
  I think it's correct to read it like this. Do not index if internal pages of the site have a GET request.
  For example:
  So it will index:
  http://wp-admin.com.ua/zapret-indeksatsii-raznyih-tipov-faylov-robots-txt/#comment-728559463
  But there will be no such link:
  http://wp-admin.com.ua/zapret-indeksatsii-raznyih-tipov-faylov-robots-txt?zapros=123
  * – in this case means any number of any characters between the first (root) slash in the address and a question mark. Roughly speaking, all pages in which there is a question mark in the address.
  If something is not clear write.

WordPress Tutorials

Prohibition of indexing of different types of robots.txt files

You may also like...

2 Responses

Leave a Reply Cancel reply

Translation

WordPress

Programming

Responsive

Free video tutorials

Ukrainian software

The best freelance in Ukraine

Services: WordPress

SEO services

For students / Clients

Website development from our studio

Hosting for USA and Europe

Where to earn

Prohibition of indexing of different types of robots.txt files

You may also like...

How to Read a Waterfall Chart

IT IS: keyword effectiveness index

How to speed up a wordpress site?

2 Responses

Leave a Reply Cancel reply

Translation

WordPress

Programming

Responsive

Free video tutorials

Ukrainian software

The best freelance in Ukraine

Services: WordPress

SEO services

For students / Clients

Website development from our studio

Hosting for USA and Europe

Where to earn