FOI release

URLs handled by the GOV.UK Transition Tool

Published 30 January 2015

Thank you for your email dated 12 January 2015 requesting information under the terms of the Freedom of Information Act 2000.

1. The GDS Transition Tool

The Transition Tool is an application built by the Government Digital Service (GDS) to support the transition of government agency and departmental websites to GOV.UK. It’s used to manage the database of URL mappings for source websites being redirected to GOV.UK.

These mappings are then read and implemented by a separate application called ‘Bouncer’. Because Bouncer applies further rules to respond to all known variants of the URLs in the Transition Tool data, it handles many more URLs than are present in the data.

The attached files explain what happens where traffic is being redirected via GDS applications. Some traffic is handled internally by departments and agencies. GDS doesn’t hold information on this traffic, so it isn’t included in the attached data. Some redirects set up internally by departments and agencies may also override the redirects in this data.

2. Description of the files

This response is accompanied by 3 CSV files containing mapping data from 27 January 2015. The data will continue to change as a result of ongoing work to review and improve redirects following transition.

2.1 Websites (‘sites-2015-01-27T10:26:13+00:00.csv’)

This file contains data about all websites handled by the Transition Tool.

Field Description
Abbreviation A unique identifier for each website that links the data across the 3 files
Launch date Manually entered as a single date for each site, so may not reflect redirect dates for websites that were transitioned in stages, or where transitions were rearranged
New homepage URL that the homepage of the old website redirects to
TNA timestamp Identifies the date of the default National Archives crawl, which is used to construct links on archive pages (eg the link to the National Archives on the Cabinet Office Charter Mark archive page)
Significant querystring parameters (colon separated) Used for websites where the CMS generates complex query strings at the end of URLs that need to be mapped differently
Global HTTP status Used to indicate if a whole website redirects to a single URL or is archived, along with two additional fields to specify behaviour (‘New URL for global 301’ and ‘Append requested path to new URL’)
New URL for global 301 Single URL for a whole website if all source URLs redirect to one place
Append requested path to new URL Specifies whether the path from the request is added to the end of the ‘new URL for global 301’ for the redirect (true or false)

2.2 Hosts (‘hosts-2015-01-27T10:26:13+00:00.csv’)

This file contains a list of all domains for each website.

Field Description
Site abbreviation Unique website identifier from ‘sites’ CSV
Hostname Full list of all domains for each department and agency website

2.3 Mappings (‘mappings-2015-01-27T10:26:13+00:00.csv’)

This file contains a list of mappings for each website. It excludes superseded mappings for sites that now have global HTTP statuses (where the whole site now redirects to a single URL).

Field Description
Site abbreviation Unique website identifier from ‘sites’ CSV
Old path Source URL on redirected website
HTTP status 301 for redirected content, or 410 for archived content
Redirect URL Destination URL for redirected content
Custom archive URL Used to override the default National Archives URL (which is constructed by Bouncer using the website’s TNA timestamp)
Suggested URL Additional URL where an archive page links on to recommended content that doesn’t sit on a government domain (eg archived guidance on firework safety)