# $Id: robots.txt,v 1.9.2.1 2008/12/10 20:12:19 goba Exp $ # # robots.txt # # This file is to prevent the crawling and indexing of certain parts # of your site by web crawlers and spiders run by sites like Yahoo! # and Google. By telling these "robots" where not to go on your site, # you save bandwidth and server resources. # # This file will be ignored unless it is at the root of your host: # Used: http://example.com/robots.txt # Ignored: http://example.com/site/robots.txt # # For more information about the robots.txt standard, see: # http://www.robotstxt.org/robotstxt.html # # For syntax checking, see: # http://www.frobee.com/robots-txt-check User-agent: PetalBot Disallow: / User-agent: SemrushBot Disallow: / User-agent: SemrushBot-SA Disallow: / User-agent: yacybot Disallow: / User-agent: dotbot Disallow: / User-agent: MJ12bot Disallow: / User-agent: AhrefsBot disallow: / User-agent: sistrix Disallow: / User-agent: Yandex Disallow: / User-agent: BotOnParade Disallow: / User-agent: amibot Disallow: / User-agent: gonzo* Disallow: / User-agent: Baiduspider Disallow: / User-agent: GingerCrawler Disallow: / User-agent: ICCrawler - iCjobs Disallow: / User-agent: Slurp Disallow: / User-agent: googlebot-image Allow: /content/ User-agent: googlebot-mobile Allow: /content/ User-agent: yahoo-mmcrawler Disallow: / User-agent: TurnitinBot Disallow: / User-agent: psbot Disallow: / User-agent: asterias Disallow: / User-agent: yahoo-blogs/v3.9 Disallow: / User-agent: URLSpion Disallow: / User-agent: Googlebot Allow: /content/ User-agent: RBot Disallow: / User-agent: * Disallow: /BAUSATZ Allow: /content/ Crawl-delay: 120 User-agent: metajobbot Disallow: / # Directories Disallow: /content/sites/default/files/css Disallow: /content/sites/default/files/color Disallow: /content/sites/default/files/js Disallow: /includes/ Disallow: /misc/ Disallow: /modules/ Disallow: /profiles/ Disallow: /scripts/ Disallow: /themes/ # Files Disallow: /CHANGELOG.txt User-agent: * Disallow: /AAAevil.php Allow: /AAAnice.php User-agent: Cliqzbot Disallow: / Sitemap: http://aatis.de/content/sitemap.xml