We're now on LinkedIn: For news and resources from Google Search on making your site discoverable, follow us on LinkedIn.

Controlling crawling and indexing now documented on code.google.com

Wednesday, November 24, 2010

Do you know how Google's crawler, Googlebot, handles conflicting rules in your robots.txt file? Do you know how to prevent a PDF file from being indexed? Do you know Googlebot's favorite song? The answers to these questions (except for the last one :)), along with lots of other information about controlling the crawling and indexing of your site, are now available on code.google.com:

Controlling crawling and indexing

Now site owners have a comprehensive resource where they can learn about robots.txt files, robots meta tags, and X-Robots-Tag HTTP header rules. Please share your comments, and if you have questions you can post them in our Webmaster Help Forum.

Posted by Jonathan Simon, Webmaster Trends Analyst

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

LinkedIn
Join us on LinkedIn
YouTube
Watch our videos
Blog
Subscribe to our RSS feed
Podcast
Listen to Search Off the Record
X (Twitter)
Join us on X (Twitter)