Lỗi no link to sitemap.xml found in robots.txt file
Blogger Help Sign in Google Help
Send feedback on... This help content & information General Help Center experience
Blogger Sitemaps and robots.txt files both serve important purposes for search engine crawling and indexation. A sitemap is an XML file that lists all the pages of a website. It allows search engines like Google to more efficiently crawl a site by informing them of all the URLs to be indexed. Sitemaps are especially useful for sites with a large number of pages. The robots.txt file gives crawling instructions to search bots. It can list parts of a site that should not be accessed or indexed (though it’s not its primary purpose). The robots.txt file also commonly indicates the location of the sitemap with a line like:
However, many sites omit a sitemap reference in their robots.txt file altogether. In some cases this can lead to issues with indexation, but other times it may not matter. In this article, we’ll look at when the omission of a sitemap in robots.txt can be problematic versus when it is likely fine. When your site should have the sitemap indicated in robots.txtThe robots.txt file serves as the instruction manual for search engine crawlers visiting a website. It is typically the very first file they will access to determine indexing directives. By listing the sitemap location in robots.txt, you immediately notify search bots of its existence as soon as they start crawling. This makes the sitemap URL one of the first pages fetched and parsed. Having the sitemap declared in robots.txt tends to be most crucial for:
The common thread is that declaring the sitemap in robots.txt makes Google distinctly aware of its existence. Search bots will spider and parse it early on when first crawling a website. While a sitemap can be submitted directly in Search Console, robots.txt catches any new bots and future re-crawls. Let me know if you would like me to modify or add anything to this more detailed section! When your site doesn’t need to have the XML sitemap indicated in robots.txtFor some websites, it is not crucial to have the sitemap URL declared in the robots.txt file. Omitting it may cause no issues at all. Types of sites that likely do not require a sitemap reference in robots.txt include:
The commonality is that these types of sites all have a reduced need for the fast discovery and change monitoring that declaring a sitemap in robots.txt facilitates. As long as Google is sufficiently crawling new and existing content via regular periodic scans, the omission may have minimal impact. However, adding the sitemap reference is still good practice when possible. There are few downsides, and it provides benefits like redundancy and faster crawl prioritization. |