Saturday, August 30, 2014

Sitemaps Best Practices Including Large Web Sites

Fabrice CanelAugust 28, 2014August 8, 2014426shares4CommentsOne of the key Search Engine Optimization (SEO) strategies for web sites is to have high quality sitemaps helping search engines to discover and access all relevant content posted on that web site. Sitemaps offer this really simple way for site owners to share information with every search engine about the content they have on their site instead of having to rely solely on crawling algorithms (ie: crawlers, robots) to find it.

The Sitemaps protocol defined at www.sitemaps.org, is a now widely supported. Often web sites and some Content Management Systems (CMSs) offers sitemaps by default or as an option. Bing even offers an open source server-side technology, Bing XML Sitemap Plugin, for websites running on Internet Information Services (IIS) for Windows® Server, as well as Apache HTTP Server.

Best Practices if You Want to Enable SitemapsIf you don’t have a sitemap yet, we recommend first that you explore if your web site or your CMS can manage this, or install a sitemap plugin.

If you have to, or want to, develop your own sitemaps, we suggest the following best practices:

First, follow the sitemaps reference at www.sitemaps.org. Common mistakes we see are people thinking that HTML Sitemaps are sitemaps, malformed XML Sitemaps, XML Sitemaps too large (max 50,000 links and up to 10 megabytes uncompressed) and links in sitemaps not correctly encoded.Have relevant sitemaps linking to the most relevant content on your sites. Avoid duplicate links and dead links: a best practice is to generate sitemaps at least once a day, to minimize the number of broken links in sitemaps.Select the right format:Use RSS feed, to list real-time all new and updated content posted on your site, during the last 24 hours. Avoid listing only the past 10 newest links on your site, search engines may not visit RSS as often as you want and may miss new URLs. (This can also be submitted inside Bing Webmaster Tools as a Sitemap option.)Use XML Sitemap files and sitemaps index file to generate a complete snapshot of all relevant URLs on your site daily.Consolidate sitemaps: Avoid too many XML Sitemaps per site and avoid too many RSS feeds: Ideally, have only one sitemap index file listing all relevant sitemap files and sitemap index files, and only one RSS listing the latest content on your site.Use sitemap properties and RSS properties as appropriate.Tell search engines where our sitemaps XML URLs and RSS URLs are located by referencing them in your robots.txt files or by publishing the location of your sitemaps in search engines’ Webmaster Tools.Scaling Up Sitemaps to Very Large SitesInterestingly some sites these days, are large… really large… with millions to billions of URLs. Sitemap index files or sitemap files can link up to 50,000 links, so with one sitemap index file, you can list 50,000 x 50,000 links

View the Original article