What is crawl-delay and does Google respect it?

**Crawl-delay** is the number of seconds between consecutive bot requests. `Crawl-delay: 5` means "wait 5 seconds". **Googlebot ignores it** since 2019 and uses its own algorithm (tune in Search Console > Settings > Crawl rate). **Bingbot, Yandex, Yahoo Slurp respect it**. Values 1-10 are the typical range, more usually makes no sense.

How does Disallow differ from noindex?

**Disallow** in robots.txt blocks **access** to the file, **noindex** in a meta tag blocks **indexing**. If you want a page **completely invisible in Google**, use **noindex** (Google must have access to see it). If you want to **save crawl budget** (e.g. for 10,000 internal search results), use **Disallow**.

What does "User-agent: *" do?

The asterisk is a **wildcard**, meaning "every bot without exception". It is the **default group** that bots without their own rules will use. In practice you **almost always** start your robots.txt with `User-agent: *` and add separate groups only for bots you want to treat differently (e.g. Googlebot with more freedom, GPTBot with a total block).

How do I block every AI bot (ChatGPT, Claude, Perplexity)?

Click **"Block AI crawlers"** in the tool. It adds separate groups for **GPTBot** (OpenAI), **ChatGPT-User** (ChatGPT browse), **ClaudeBot** (Anthropic), **anthropic-ai**, **CCBot** (Common Crawl, the source for many models), **Google-Extended** (data for Gemini), **PerplexityBot**, **Bytespider** (ByteDance), each with `Disallow: /`. This is a **non-binding request**, but every listed company publicly says they honour it.

Will "Disallow: /admin" also block "/admin/login"?

**Yes.** No trailing slash = **prefix match**. `Disallow: /admin` blocks **everything** that starts with `/admin`: `/admin`, `/admin/`, `/admin/login`, but also `/administrator` (because "administrator" starts with "admin"). If you only want the `/admin/` folder, use `Disallow: /admin/` (with trailing slash). The tool warns you when you type a path without a slash.

What happens if I leave "Disallow: /"?

**You block the entire site** for that bot. This is the **standard setting for staging and dev environments**, but if you accidentally leave it in production **the site will drop out of Google within days**. The tool will show a large warning. After deploy, always check `your-domain.com/robots.txt` to confirm the file is what you wanted.

Where do I upload robots.txt on the server?

**Root directory of your domain**, right next to your `index.html` / home page. It must be reachable at `https://your-domain.com/robots.txt` (not `https://your-domain.com/static/robots.txt` or `/public/robots.txt`). For **Next.js** drop it in as `public/robots.txt` (or use the `app/robots.ts` API route). For **WordPress** add a physical file in the root directory (via FTP); SEO plugins often do this automatically.

Can I have multiple sitemap.xml files?

**Yes.** In robots.txt you can list **several `Sitemap:` lines**, each pointing to a different file. The typical layout for large sites: `sitemap-pages.xml` (static pages), `sitemap-blog.xml` (articles), `sitemap-products.xml` (shop products). You can also have a **sitemap index** (`sitemap.xml` pointing to a list of other maps). Google happily accepts both approaches.

Why does my robots.txt not work despite correct syntax?

Most common reasons: **1)** The file is at the wrong path (must be in the domain root). **2)** CDN cache (Cloudflare). Force a cache purge. **3)** The file returns **HTTP 404** instead of **HTTP 200** (Google treats this as "no file", meaning everything is allowed). **4)** The file returns **HTTP 500** (Google stops crawling the entire site until you fix it). Check both the HTTP status **and** the [robots.txt validator](/en/robots-sitemap-validator).

robots.txt builder - free

Presets (one click)

Quick rule sets for typical situations

Rule groups

Each group = one User-agent + its rules. The first group is usually * (every bot).

Group #1

User-agent

Quick add bot:

Bot name (`*` = every bot, `Googlebot`, `Bingbot`, `GPTBot` etc.)

Allow (one path per line)Paths the bot CAN access. Typically used to whitelist a subpath inside a blocked section.Disallow (one path per line)Paths the bot cannot access. `/admin/` blocks the folder, `/` blocks everything, empty = nothing.

Crawl-delay (optional, seconds)Pause between requests. Bingbot/Yandex honour it, Googlebot ignores it since 2019.

Sitemap

Full URLs of your sitemap.xml files. One URL per line.

Warnings

No warnings, the file looks fine.

Preview

robots.txt

89 B · 5 lines

User-agent: *
Disallow: /admin/
Disallow: /api/

Sitemap: https://example.com/sitemap.xml

What is robots.txt and why bother setting one up?

robots.txt is the first file Googlebot, Bingbot or GPTBot reaches for before reading your site. It tells them: "these paths you can browse, those you can't". It's a plain text file sitting at `https://your-domain.com/robots.txt`.

Here you click and pick which bots get which rules, add sitemap URLs and immediately see the finished file ready to copy onto your server. You can also block every AI crawler with one button (GPTBot, ClaudeBot, PerplexityBot) if you don't want your content ending up in language models.

Important: robots.txt is a request, not a security measure. Well-behaved bots (Google, Bing) listen, but malicious scrapers will ignore the file. For real protection use authentication, a web application firewall or IP blocking.

How to use it

Decide what rules you want. Usually one `User-agent: *` group (all bots) is enough.

For each group add Allow paths (allowed) and Disallow paths (blocked). E.g. `Disallow: /admin/` blocks the admin panel.

Type sitemap URLs in the Sitemap field. They should be full URLs with `https://`.

Use the preset buttons (block AI, block staging, allow everything except admin) to save time.

Copy the generated file or download it as `robots.txt`. Drop it into the root of your site (next to index.html). Verify at `your-domain.com/robots.txt`.

When this tool helps

The most common scenarios where you need to set up robots.txt:

Blocking AI scrapers. GPTBot (OpenAI), ClaudeBot (Anthropic), PerplexityBot, CCBot (Common Crawl), Google-Extended. More companies don't want their content training language models. The "Block AI" preset handles it.
Hiding admin panel and API. `Disallow: /admin/`, `Disallow: /api/`, `Disallow: /wp-admin/`. Should not show in Google results.
Staging and test environments. `staging.your-company.com` must stay invisible to Google. Full block: `Disallow: /`.
Pointing to sitemaps. Google finds all pages faster when robots.txt contains a line like `Sitemap: https://your-domain.com/sitemap.xml`.
Crawl-delay for slow servers. If your server has a weak CPU and Bingbot is generating too much load, add `Crawl-delay: 10` (10-second pause between requests). Googlebot doesn't support this, use Search Console instead.
Different rules for different bots. You can let Google in everywhere but block Yandex on specific paths. Each User-agent gets its own group.

After generating the file check it with the robots.txt + sitemap.xml validator. After uploading to the server also add sitemap.xml, so Google discovers all pages faster.

Questions and answers

No. robots.txt says "don't visit this page", while `<meta name="robots" content="noindex">` says "you can visit, but don't show in results". What's more, if you block a page in robots.txt, Google won't see the noindex inside it, so the page may still end up in the index (without a description, just as a URL). For hiding content, prefer noindex; for saving crawl budget, robots.txt.

Presets (one click)

Quick rule sets for typical situations

Rule groups

Each group = one User-agent + its rules. The first group is usually * (every bot).

Group #1

User-agent

Quick add bot:

Bot name (`*` = every bot, `Googlebot`, `Bingbot`, `GPTBot` etc.)

Crawl-delay (optional, seconds)Pause between requests. Bingbot/Yandex honour it, Googlebot ignores it since 2019.

Sitemap

Full URLs of your sitemap.xml files. One URL per line.

Warnings

No warnings, the file looks fine.

Preview

robots.txt

89 B · 5 lines

User-agent: *
Disallow: /admin/
Disallow: /api/

Sitemap: https://example.com/sitemap.xml

What is robots.txt and why bother setting one up?

How to use it

Decide what rules you want. Usually one `User-agent: *` group (all bots) is enough.

For each group add Allow paths (allowed) and Disallow paths (blocked). E.g. `Disallow: /admin/` blocks the admin panel.

Type sitemap URLs in the Sitemap field. They should be full URLs with `https://`.

Use the preset buttons (block AI, block staging, allow everything except admin) to save time.

Copy the generated file or download it as `robots.txt`. Drop it into the root of your site (next to index.html). Verify at `your-domain.com/robots.txt`.

When this tool helps

The most common scenarios where you need to set up robots.txt:

Blocking AI scrapers. GPTBot (OpenAI), ClaudeBot (Anthropic), PerplexityBot, CCBot (Common Crawl), Google-Extended. More companies don't want their content training language models. The "Block AI" preset handles it.
Hiding admin panel and API. `Disallow: /admin/`, `Disallow: /api/`, `Disallow: /wp-admin/`. Should not show in Google results.
Staging and test environments. `staging.your-company.com` must stay invisible to Google. Full block: `Disallow: /`.
Pointing to sitemaps. Google finds all pages faster when robots.txt contains a line like `Sitemap: https://your-domain.com/sitemap.xml`.
Crawl-delay for slow servers. If your server has a weak CPU and Bingbot is generating too much load, add `Crawl-delay: 10` (10-second pause between requests). Googlebot doesn't support this, use Search Console instead.
Different rules for different bots. You can let Google in everywhere but block Yandex on specific paths. Each User-agent gets its own group.

After generating the file check it with the robots.txt + sitemap.xml validator. After uploading to the server also add sitemap.xml, so Google discovers all pages faster.

Questions and answers

robots.txt builder

What is robots.txt and why bother setting one up?

How to use it

When this tool helps

Questions and answers

Related tools

sitemap.xml builder

Meta tags suite generator

JSON-LD schema builder

robots.txt + sitemap.xml validator

OpenGraph / Twitter Card Preview

robots.txt builder

What is robots.txt and why bother setting one up?

How to use it

When this tool helps

Questions and answers

Related tools

sitemap.xml builder

Meta tags suite generator

JSON-LD schema builder

robots.txt + sitemap.xml validator

OpenGraph / Twitter Card Preview