Givin' it to Google: Using an XML Sitemap to Boost Your Site

Givin' it to Google: Using an XML Sitemap to Boost Your Site

Search engine experts have been hounding Google to accept XML feeds from webmasters who want to update their site information. When Google recently introduced the beta version of Google Sitemaps, they showed experts that they have been listening – at least in part.
Google Sitemaps is a very useful feature for large shopping sites and other sites with frequently updated or changing content. It ensures that the search engine result listings related to their web site stay fresh and up to date.
The initial results of Google Sitemaps seems encouraging. In a recent test, pages were crawled by Google within 14 hours of submitting a sitemap file. Without a sitemap file, this site had taken weeks until updates were detected and applied.
Some search engine experts are discouraged that Google didn’t implement a real-time ping server that allows sitemap submissions to occur within seconds. And, that feeds need to be continually submitted when changes occur. Nevertheless, offering Google Sitemaps is a step in the right direction.
It’s also empowering one.
Until the availability of Google Sitemaps, Yahoo was the only search engine that offered webmasters the ability to submit an XML feed to notify the search engine of site changes. However, it has only been offered by paid inclusion. Google argues that no one should pay to be included in the regular search results, and is apparently not planning to implement a similar fee based service.
Nuts and Bolts
The sitemaps protocol uses very simple XML tags. Four tags are used to define individual pages:
location – This is the URL path to your web, beginning with http://.
For example, http://www.mywebsite.com

priority
– The priority of the page within your site on a scale of zero to one. By placing high priority on these pages, you will increase their importance in Google. The least important pages in your site should be assigned a priority of 0.0. The most important pages should be ranked 1.0. Pages that are common should be ranked 0.5.
last modified – When the page was last modified. this timestamp allows Google to avoid re-indexing pages that haven’t changed.
change frequency – Specifies how often a page is likely to change: Never, weekly, daily, or hourly. This help you identify pages that need to be indexed by Google most often.
These tags are wrapped inside tags used to define the sitemap. The result is a XML file that looks like this:
<?xml version=”1.0” encoding=”UTF-8”?>
<urlset xmlns=”http://www.google.com/schemas/sitemap/0.84”>
<url>
<loc>http://www.iconinteractive.com/</loc>
<lastmod>2005-07-06</lastmod>
<changefreq>weekly</changefreq>
<priority>1.0</priority>
</url>
</>
The restrictions on sitemap files are modest. URLs must not include embedded newlines; You must fully specify URLs because Google tries to crawl the URLs exactly as you provide them. Your sitemap files must use UTF-8 encoding. And, each sitemap file is limited to 50,000 URLs and 10 megabytes when uncompressed.
When you submit a sitemap file to Google, you’re notifying Google what URLs on your web site are ready to be crawled. Each time a change occurs, such as the addition of a new page, you need to resubmit your sitemap file for changes to be considered.
You can submit it in a number of ways: using the Google Sitemaps home page; using a URL; or by such as Google’s Sitemap Generator, that you install on your web server and schedule to run automatically.

Troy Janisch is president and founder of the Icon Interactive Group (www.iconinteractive.com), an industry leader helping companies integrate Internet and other Interactive media into sales channels, marketing strategies, and overall branding. He can be contacted by email at tjanisch@iconinteractive.com.

The opinions expressed herein or statements made in the above column are solely those of the author, & do not necessarily reflect the views of Wisconsin Technology Network, LLC. (WTN). WTN, LLC accepts no legal liability or responsibility for any claims made or opinions expressed herein.