Robots.txt & Indexing

You don’t need every page crawled. But you do need the right ones.

What Is robots.txt?

Instructions for Crawlers

robots.txt is a plain text file at the root of your domain that tells search engine bots which URLs they can and can’t crawl.

Important:

Example:

				
					User-agent: *
Disallow: /checkout/
Allow: /blog/

Means all bots can crawl everything except the /checkout/ folder.

What NOT to Do

Checklist:

Means all bots can crawl everything except the /checkout/ folder.

Crawl Smarter, Not Harder

Tips:

Build Around Your Stack

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

Don’t let Google waste time on URLs that don’t help you rank.