Links
Previous Posts
- What's next for Travel Insurance Guide?
- How to find a travel insurance policy
- Unattended linux installation
- Calais Guide and the kind of calls we want!
- A TalkOn.it Network update
- Staff Bloggers
- For the record
- Pro-active Monitoring of a Network
- Q: Why so many websites? A: Toyota and Technology
- .htaccess vs httpd.conf
Archives
- December 2008
- November 2008
- October 2008
- September 2008
- June 2008
- May 2008
- April 2008
- March 2008
- February 2008
- January 2008
- October 2007
- September 2007
- August 2007
- June 2007
- April 2007
- March 2007
- February 2007
- November 2006
- August 2006
- June 2006
- March 2006
- February 2006
- January 2006
- December 2005
- November 2005
- October 2005
- September 2005
- August 2005
- July 2005
- June 2005
- May 2005
- April 2005

Sitemap Validation
Posted 4:31 PM Wednesday September 17, 2008 by Alex Buell
Introduction
Hello, my name is Alex Buell, I am profoundly deaf, and work as a Linux system administrator within the Fubra infosphere. I usually spend most of my time working on open source projects, giving back to the community extra value in tools that allow us to do our job.
What are sitemaps?
They provide a way for webmasters (people who run websites) to give out information about the content on their websites. Search engines (i.e. www.google.co.uk) look ('crawling') through websites to build up indexes to allow people to search for things that they are interested in looking for.
Essentially a sitemap is just a file containing URLs in XML format, along with additional metadata about each of these URLs (when was it last updated, how often does it change, how vital it is), enabling search engines to make intelligent decisions about searching for links to or from pages on the web sites.
Sitemaps have to be regularly updated so we have a tool that automatically generate sitemaps which are stored within the web site, search engines can download these files and read through all the URLs.
A new tool for validating sitemaps
We at Fubra have developed a web tool to help with validating sitemaps on our websites. The tool does two things; reads the sitemap files off the website, and presents to us in a human readable form, and if asked, check through the URLs, displaying the http code (i.e. 404, 301, or 200) in the status icons on the right side.
How to use the Validator webtool
To use the tool, point your browser at the Sitemap Validator site, and type in the URL (i.e. www.talkfootball.co.uk), and see what the tool does with it.
As it stands, the validator tool will look through the URLs and its associated metadata for validity and colour the icon on the right accordingly. Red icons means that the URL is a duplicate, orange means the date and time associated with the URL is invalid, whilst green means that the URL and its metadata is valid. The tool sorts the URLs in a way that all invalid URLs comes first and all valid URLs comes last. This makes it easy to see what is wrong with the sitemap.
Additionally, there is a button 'Check' in the top row, which, if clicked upon, will run a check on all the URLs on the site, and display the http code.
Any other business?
All comments, flames and feedback welcome, please do drop me an email at alex at fubra dot com
