Robots.txt Generator
Generate a valid robots.txt file for your website.
What is robots.txt?
The robots.txt file is a plain text file placed in the root of a website that instructs search engine crawlers which pages or sections they are allowed or not allowed to crawl. It is part of the Robots Exclusion Protocol (REP), a widely adopted standard.
How to Use This Tool
- Select which user-agents (crawlers) you want to configure.
- Add disallow rules for paths you want to block (e.g., /admin/, /private/).
- Add your sitemap URL.
- Download the generated robots.txt file.
Important Notes
robots.txt is a suggestion, not an enforcement mechanism. Malicious bots may ignore it. Do not use robots.txt to hide sensitive content — use authentication instead. Disallowing a page in robots.txt does not remove it from Google's index if it has inbound links.
Frequently Asked Questions
Does robots.txt prevent pages from being indexed?
Disallowing a URL in robots.txt prevents crawling but not indexing. Google may still index a URL it has never crawled if other sites link to it. Use the noindex meta tag to prevent indexing.
Where do I place robots.txt?
The file must be at the root domain: yourdomain.com/robots.txt. It cannot be in a subdirectory.