Question 1

Can I share one Robots.txt across subdomains?

Accepted Answer

No. Robots.txt files are handled per hostname and protocol. Every subdomain (e.g., blog.site.com) must have its own file located at its own root directory. If a file isn't found at the subdomain root, crawlers assume no restrictions exist for that specific host.

Question 2

Does robots.txt stop a page from appearing in search results?

Accepted Answer

No. Robots.txt controls crawling, but not indexing. If other websites link to a blocked URL, Google may still index it. To stop indexing, use a noindex meta tag instead.

Question 3

Why is my robots.txt file ignored by some bots?

Accepted Answer

The Robots Exclusion Protocol is a voluntary code of conduct. While major search engines like Google and Bing respect it, malicious bots or newer AI scrapers may ignore it entirely.

Question 4

What is the difference between Robots.txt and a Sitemap?

Accepted Answer

Robots.txt is a set of instructions for where bots are allowed to crawl, while a Sitemap is a list of your important pages to encourage crawling. You should link your Sitemap inside your Robots.txt.

Robots.txt Directive "Humanizer"

Robots.txt Symbol Cheat Sheet

Can I share one Robots.txt across subdomains?

Common Issues: The Query String Pitfall

Technical SEO Guide: Robots.txt FAQ

Does robots.txt stop a page from appearing in search results?

Why do some bots ignore my robots.txt file?

How does robots.txt help my "Crawl Budget"?