TB notifications reported to enhanced surveillance systems quality and methodology information report
Published 4 September 2025
About this report
This report outlines the quality and methodology information (QMI) relevant to the Tuberculosis (TB) notifications reported to enhanced TB surveillance systems: UK official statistics release published by the UK Health Security Agency (UKHSA).
This QMI report supports users in understanding the strengths and limitations of these statistics, ensuring that UKHSA is compliant with the quality standards stated in the Code of Practice for Statistics. The report covers the following areas:
- The strengths and limitations of the data used to produce the statistics.
- The methods used to produce the statistics.
- The quality of the statistical outputs.
About the statistics
Tuberculosis (TB) is an infectious disease caused by bacteria of the Mycobacterium tuberculosis (M. tuberculosis) complex. It is predominantly spread by the respiratory route - people with infection in their lungs breathe out infectious bacteria, which may then be inhaled by others. TB is a treatable disease with a combination of multiple antibiotics of normally at least 6 months duration and up to 24 or 36 months in those with multidrug-resistant TB (MDR-TB) or in those with complex disease. TB is a notifiable disease, meaning that clinicians have a statutory duty to notify local authorities or a local UKHSA region of suspected cases.
The UK annual official statistics report presents people with TB disease notified to TB surveillance systems in the UK.
The data in the statistics is provisional and is subject to revision.
Geographical coverage: England, Wales, Scotland and Northern Ireland
Publication frequency: annually
Change log
4 September 2025: QMI report first published
Contact
Lead analyst: Sharon Cox
Contact information: TBSection@ukhsa.gov.uk
Suitable data sources
Statistics should be based on the most appropriate data to meet intended uses.
This section describes the data used to produce the statistics.
Data sources
The UK annual official statistics report uses data from the National Tuberculosis Surveillance System (NTBS), which is a live, user-entered, database. It was launched in 2021 and replaced 2 historical surveillance systems: the Enhanced Tuberculosis Surveillance System (ETS) and the London TB Register (LTBR). Data sets from 2018 onwards were extracted from ETS and LTBR and were migrated into NTBS between July and December 2021, with all users utilising NTBS by December 2021. Scotland went live with using NTBS from March 2025 with data from 2022 being backfilled. Scottish data prior to this is sourced from the historical system: Enhanced Surveillance of Mycobacterial Infections (ESMI).
TB is a notifiable disease therefore, any TB diagnosis must be entered onto the database. Clinical teams at the TB service level provide information on TB cases either directly through the web-based system entered at the clinic, or in Northern Ireland, case report forms are entered to the system by the health protection team. The data includes notification details, demographic information, social risk factors, clinical and microbiological information.
All new TB cases that meet one of the 2 following case definitions must be notified:
- Culture-confirmed cases due to M. tuberculosis complex (including M. tuberculosis, M. bovis, M. africanum or M. microti). In other words, a sample is taken from the patient, tested, and confirmed to be TB.
- In the absence of culture confirmation, cases that meet the following criteria:
- a clinician’s judgement that the patient’s clinical and/or radiological signs and/or symptoms are compatible with tuberculosis
- a clinician’s decision to treat the patient with a full course of anti-TB therapy
NTBS receives a feed of reference laboratory results including patient identifiers, which are matched daily to notifications using a probabilistic matching algorithm. Matches below the threshold are notified to users with appropriate permissions. Users can then manually match or un-match from the provided list of potential matches. Additional cleaning and review are periodically carried out to identify duplicate records, inconsistencies and missing data by the national team and regional field service teams who coordinate responses from the TB services responsible for entering the information.
NTBS also contains information on TB treatment. Users enter the date treatment was started and the date treatment was completed or other TB treatment outcomes including loss to follow-up, treatment stopped or if the person died. NTBS sends out automated reminders for users to enter treatment outcomes at 12, 24 and 36 months and outcomes are then grouped into these blocks of duration. Users can also record outcomes of ‘still on treatment’ or ‘not evaluated’ at these time points if none of the other outcomes were previously applicable.
The population data used is sourced from the Office for National Statistics (ONS), incorporating data from the National Records of Scotland and Northern Ireland Statistics and Research Agency.
Data quality
The data that we use to produce statistics must be fit for purpose. Poor quality data can cause errors and can hinder effective decision making.
We have assessed the quality of the source data against the data quality dimensions in the Government Data Quality Framework.
This assessment covers the quality of the data that was used to produce the statistics, not the quality of the final statistical outputs. The quality summary section below assesses the quality of the final statistical outputs.
Strengths and limitations of the NTBS data
The following strengths and limitations of the data have been identified:
- reporting of TB cases is mandatory, so the NTBS data is a comprehensive record of TB notifications in the UK
- NTBS is a live system and notifications are available to the TB team as soon as they are entered
- cleaning and review are regularly carried out on the data, probabilistic matching helps ensure accuracy, and validation rules mean that essential fields must be completed properly
- users cannot submit notifications without completing mandatory fields, meaning that all the required information is collected for each notification
The following limitations of the data have been identified:
- the location of people with TB is assigned to an UKHSA region based on an individual’s residence – this can differ to where people are diagnosed and treated
- before 2022 NTBS did not include people diagnosed or managed with TB in Scotland, therefore some people who are normally a resident in England, but diagnosed or managed in Scotland, will not appear in the data until previous years data is uploaded to NTBS
NTBS is the most appropriate source of data for the statistics. TB is a notifiable disease, which means that NTBS holds a comprehensive record of TB cases. The design of NTBS helps ensure that the data is accurate and valid.
Accuracy
Accuracy is about the degree to which the data reflects the real world. This can refer to correct names, addresses or represent factual and up-to-date data.
Notification of TB is required within 3 days of a suspected or confirmed TB diagnosis. In 2021, over half of notifications (57.2%) were notified within 3 days of diagnosis and 75.0% within 7 days. These numbers have not notably changed since 2016.
Notifications cannot be deleted, but rather can be de-notified when necessary. A case is usually de-notified if the diagnosis changes, or if the record has been created in error. This helps ensure that records in NTBS are accurate and up to date.
Provisional data (which we also refer to as live data) is all data entered into NTBS after the date of the final extraction date of the current cleaned and validated analytical data set used for the generation of the TB annual report and other outputs. Provisional data will not have had all checks and validation completed by the time of analysis for publication.
Completeness
Completeness describes the degree to which records are present.
For a data set to be complete, all records are included, and the most important data is present in those records. This means that the data set contains all the records that it should and all essential values in a record are populated.
Completeness is not the same as accuracy as a full data set may still have incorrect values.
NTBS contains 13 mandatory fields that must be completed in order to notify a case. The mandatory fields are personal details of patients such as date of birth, sex, ethnic group, postcode and birth country. It is also mandatory to enter at least one site of disease and date of diagnosis. This ensures that all of the necessary information is recorded for each notification.
Where there are significant numbers of missing values compared with what are due to be recorded at that point in time, these are explicitly included in the report. For example, information on social risk factors is not always available to users or people may legitimately refuse to answer certain questions.
Uniqueness
Uniqueness describes the degree to which there is no duplication in records. This means that the data contains only one record for each entity it represents, and each value is stored once.
Some fields, such as National Insurance numbers, should be unique. Some data is less likely to be unique, for example geographical data such as town of birth.
To create a new notification, NTBS users must first perform a search to check whether the notification has already been recorded. This helps reduce the number of duplicate records.
Consistency
Consistency describes the degree to which values in a data set do not contradict other values representing the same entity. For example, a person’s date of birth should be before their date of death.
Data is consistent if it matches data in another data set. For example, if the date of birth recorded for the same person in 2 different data sets is the same.
The probabilistic matching algorithm matches laboratory results to NTBS notifications. Cases can only be matched if key fields are consistent across notifications and laboratory results.
The national and regional teams conduct routine checks of data fields for consistency to identify potential errors and return queries to the relevant case managers for resolution to ensure data consistency.
Timeliness
Timeliness describes the degree to which the data is an accurate reflection of the period that it represents, and that the data and its values are up to date.
Some data, such as date of birth, may stay the same whereas some, such as income, may not.
Data is timely if the time lag between collection and availability is appropriate for the intended use.
NTBS is a live database that is managed by the TB team. Hence there is no delay between the data collection and availability.
There may be some considerable delay in reporting of treatment events and manual matching of laboratory results, as the case manager needs to return to NTBS and complete the fields. However, to mitigate this NTBS generates automated reminders, lists of notifications with important missing values and required actions such as reviewing laboratory matches and accepting transfers of people between services at each logon as appropriate for the user level of access.
Validity
Validity describes the degree to which the data is in the range and format expected. For example, date of birth does not exceed the present day and is within a reasonable range.
Valid data is stored in a data set in the appropriate format for that type of data. For example, a date of birth is stored in a date format rather than in plain text.
NTBS prevents users from entering invalid data for most of the mandatory fields: for example, date of birth must be entered in a date format and ethnic group is selected from a drop-down menu. These rules ensure that the data is entered in the correct format. There are at least 135 validations made by the system to ensure user data is entered correctly.
Sound methods
Statistical outputs should be made using the best available methods and recognised standards.
This section describes how the statistics were produced and quality assured.
Data set production
Data used in the UK annual official statistics report comes from NTBS after having been cleaned by the TB unit to create a UK dataset. A similar example is the annual report of which uses the same data set but isolated to England.
Data was extracted from the surveillance systems on 23 April 2025, then cleaned and validated. For Scotland, NTBS data was extracted on 10 June 2025 and subsequently cleaned and validated.
TB notifications are reported by area of residence and by calendar year of notification.
Overall TB rates per 100,000 population, as well as those by area of reporting, are calculated using the mid-year population estimates provided by the ONS. Average annual rates per 100,000 for the 3-year period were calculated by dividing the numerator (the number of TB notifications in the 3-year period) by the denominator (the sum of the mid-year population estimates for the same 3-year period) and multiplying by 100,000. A 95% confidence interval for incidence was obtained assuming a Poisson distribution and are model derived.
For the purposes of TB outcome reporting, the drug-sensitive cohort is defined as all TB notifications, excluding those with multidrug-resistant or rifampicin-resistant TB (MDR/RR-TB) (initial or acquired), and those treated with a second line regimen for MDR/RR-TB. TB outcomes are reported at 12 months for drug-sensitive notifications with an expected duration of treatment of less than 12 months. This group excludes notifications with central nervous system (CNS) disease, who have an expected duration of treatment of 12 months. In addition, those with spinal, cryptic, disseminated or miliary disease are excluded from this group, as CNS involvement cannot be reliably ruled out from these groups. The last recorded outcome is reported for the entire drug-sensitive cohort.
Public Health Scotland aligned with the World Health Organization (WHO) treatment outcome definitions in 2015, and therefore treatment outcomes in this report may differ slightly to those in their regional report.
Statistical analysis was carried out using Stata 18 and RStudio 4.5.0.
Data cleaning to improve data quality
Denotifications
People with BCGosis, on chemoprophylaxis for latent TB infection or with a non-tuberculous mycobacterial infection who were notified in error were identified using comments fields, and denotified. People with culture-confirmed TB who had been denotified were queried with clinics, and lab contaminations were removed. People were renotified if they were found to have been denotified in error. In addition, a probabilistic matching process was carried out for notifications between January 2024 and December 2025 to identify people with more than one notification within a 12-month period. Identified duplicates were denotified with any missing information transferred from the duplicate to the original notification.
Geography
The postcode field (used to map postcodes to geographic areas) was cleaned by identifying invalid postcodes based on matching to the May 2022 Postcode Directory from ONS. Where cleaning was necessary, the correct postcode was identified using the address fields. For people who were homeless or who had a residence outside the UK, but were notified in England, the postcode of the clinic or hospital at which they were treated was assigned to the notification. For people with no postcode or treatment clinic or hospital, the local authority and UKHSA centre were updated using the local authority field recorded based on the area that the notifying case manager was located in.
The UKHSA region was derived from UKHSA region of residence based on the individual’s residential postcode. If missing, the UKHSA region in which treatment occurred (most recently, as care may have been transferred) was used. This may have happened if, for example, a person had no fixed abode.
Site of disease
The site of disease was reclassified to pulmonary if a positive sputum smear (microscopy) sample was recorded or if a positive culture was grown from a pulmonary laboratory specimen. People with laryngeal TB were included in figures for pulmonary disease, and people with miliary TB were included in both pulmonary and extra-pulmonary figures. Site of disease for people with culture confirmation was reclassified based on the site in the body from which the specimen was taken. Site of disease classifications were also updated using the free text field for site of disease.
Quality assurance
The UK annual official statistics report is produced using R. The production of the figures and the supplementary data tables has been automated. This reduces the risk of human error as users do not have to manually update figures or copy and paste between documents. Quality assurance is done after running the code.
The figures and tables are sense-checked and compared with figures from previous UK annual official statistics reports for irregularities by at least 2 members of the team. All of the automated outputs are manually checked in this way. If concerns are raised regarding one figure, further checks are conducted to assess possible errors in the data.
Confidentiality and disclosure control
Personal and confidential data is collected, processed, and used in accordance with the UKHSA privacy notice. All UKHSA staff with access to personal or confidential information must complete mandatory information governance training, which must be refreshed every year. Information is stored on computer systems that are kept up-to-date and regularly tested to make sure they are secure and protected from viruses and hacking. UKHSA staff do not store data on their own laptops or computers. Instead, data is stored centrally on UKHSA servers.
No personally identifiable information is included in published data. There are no specific disclosure control methods used, as aggregation of the published figures protects people’s personal data and tables presented cannot be cross tabulated to reveal sufficient information about individuals to pose a meaningful risk of secondary disclosure. The benefits of reporting small numbers in aggregated data are compared with the risk of secondary disclosure on a case-by-case basis. For example, there are relatively few notifications of children with TB but the implications for TB control and management differ by children’s age and therefore small numbers of children by age groups may be reported, but will not be published at a location level that would likely pose a risk of a child being identifiable by combining with other data sources.
Geography
The statistics in this report are published at 2 geographical levels: country (England, Wales, Scotland and Northern Ireland), and UKHSA region.
UKHSA region is based on an individual’s residential postcode. If the postcode is missing, the UKHSA region in which treatment occurred is used, for example if a person has no fixed abode.
Most UKHSA regions are consistent with the regions of England at International Territorial Level 1. The only difference between regions and UKHSA regions is the categorisation of Milton Keynes: Milton Keynes is part of the South East region, but is in the East of England UKHSA region.
Quality summary
The Code of Practice for Statistics states that quality means that statistics:
- fit their intended uses
- are based on appropriate data and methods
- are not materially misleading
Quality requires skilled professional judgment about collecting, preparing, analysing, and publishing statistics and data in ways that meet the needs of people who want to use the statistics.
This section assesses the statistics against the European Statistical System dimensions of quality.
Relevance
Relevance is the degree to which the statistics meet user needs in both coverage and content.
There is a clear need for timely TB statistics across the UK. The data provides essential evidence to monitor progress towards initiatives aimed at controlling TB in England, Wales, Scotland and Northern Ireland. In July 2021, the UKHSA and NHS England jointly launched the TB Action Plan for England, 2021 to 2026, which aims to improve the prevention, detection and control of TB in England by setting objectives aligned with the World Health Organisation (WHO) End TB Strategy and by defining indicators to monitor progress. While this plan focuses on England, similar efforts across the devolved nations contribute to the collective UK-wide response to TB.
TB statistics are published annually across the UK. All 4 devolved nations: England, Wales, Scotland and Northern Ireland are considered low-incidence countries for TB due to individual rates being below the threshold of 10 notifications per 100,000 people. Notifications had been steadily declining since 2011, but a gradual increase has been observed since 2020. To support the monitoring of this trend, we also produce quarterly reports, with a greater focus on England due to its relatively higher number of cases. In addition to the quarterly reports, we publish an annual TB report for England that provides more in-depth analysis, but at the cost of timeliness.
The TB statistics are primarily used by people in clinical care, and public health. These users report that they use the statistics for monitoring, strategy and resource allocation, and teaching.
We have continued to make changes to the publication to meet user needs. We now publish 3 products as part of the statistical release:
- The main statistics report.
- Supplementary data tables, first published in April 2023 as part of the quarter 1 of 2023 publication.
- This QMI report, first published in July 2023.
By providing this range of different outputs, we can better cater to the needs of different users from a range of backgrounds, in line with the Office for National Statistics user personas.
Accuracy and reliability
Accuracy is the proximity between an estimate and the unknown true value. Reliability is the closeness of early estimates to subsequent estimated values.
The accuracy of the statistics is largely dependent on the accuracy of the source data. We have assessed the source data to be accurate (see the data quality section as the design of NTBS helps prevent data entry errors, and guidance given to users helps ensure the right information is collected in the proper format. The statistics report on TB notifications, which are mandatory through NTBS. The statistics therefore represent the whole population of TB notifications in the UK.
The statistics present provisional data. The data is revised and updated as additional verification, data cleaning, and recoding are completed.
Timeliness and punctuality
Timeliness refers to the time gap between publication and the reference period. Punctuality refers to the gap between planned and actual publication dates.
This report aims to provide timely and up-to-date figures of important epidemiological indicators to inform ongoing TB control efforts in the UK.
These annual reports are official statistics and are pre-announced at least 28 days in advance, in line with the Code of Practice for Statistics. Provisional publication dates for the year ahead are pre-announced online in December and can be found on the UKHSA release calendar.
Accessibility and clarity
Accessibility is the ease with which users can access the data, also reflecting the format in which the data is available and the availability of supporting information. Clarity refers to the quality and sufficiency of the metadata, illustrations and accompanying advice.
We currently publish 3 statistical products as part of this statistical release:
- The main statistics report.
- Supporting data tables.
- This QMI report.
From 2022 publication (published February 2022) we started publishing the main statistics report as a HTML web page. The switch to HTML has made the report easier to access across different devices, and the HTML report inherits the accessibility features mentioned in the GOV.UK accessibility statement.
The publication includes visualisations that help explain the data. These are designed to be colour-blind friendly. Each element in a visualisation has a different luminance value. This means that there is always sufficient contrast between elements for them to be distinguished.
We have simplified the commentary in the publication, focusing on plain English, and shortened the publication overall. We also now include main messages in publication to help users understand the statistics.
The supplementary data tables are published in OpenDocument Spreadsheet (ODS) format and follow accessibility guidelines. Each sheet contains only one table. We also do not use nested tables as these do not always work well with screen readers. We avoid using empty cells for the same reason. Each sheet has a descriptive heading, for example, ‘Number of individuals notified with TB and rates by country, UK, 2000 to 2024’.
Coherence and comparability
Coherence is the degree to which data that is derived from different sources or methods, but refers to the same topic, is similar. Comparability is the degree to which data can be compared over time and domain.
Data included in these and other TB reports published on GOV.UK has been collected in a consistent manner over time using web-based databases. NTBS replaced the 2 former systems in 2021 and older data (2018 onwards) was verified and migrated into NTBS. Where there have been changes in specific variables over time, either through addition or changes in definition, these are detailed in the report.
TB notification numbers and rates in the UK rose sharply in 2021, increasing by 6.58% compared with 2020, following a decade of steady decline from 2010 to 2020. This trend may reflect the complex impact of the COVID-19 pandemic on healthcare access and delivery, migration patterns, and social behaviours, all of which could have influenced TB transmission, diagnosis and reporting. Further analysis is needed to fully understand how the pandemic has affected TB epidemiology. Current statistics present TB notifications for England, Wales, Scotland and Northern Ireland. Notifications in England, Wales and Scotland declined from 2010 to 2020, followed by a slight increase in 2021. In contrast, Northern Ireland saw a continued slight decrease in notification rates from 2010 through to 2021.
Uses and users
Users of statistics and data should be at the centre of statistical production, and statistics should meet user needs.
This section explains how the statistics are used, and how we understand user needs.
Appropriate use of the statistics
The statistics present TB notifications. Notifications occur either when someone is diagnosed with TB, or they have started treatment for suspected TB. Some individuals with TB will not receive a diagnosis or start treatment, so their case will never be notified. Users therefore should not use these statistics as a measure of TB incidence.
There are seasonal trends in TB notifications, with a peak around late spring/early summer. The seasonality of TB presentation has been repeatedly reported in the UK and other non-UK countries, but the underlying mechanism behind this is not clear. Users should generally compare the same quarter year on year, rather than different quarters in the same year.
Known uses
We are aware that the statistics have been used in several different ways, including:
- monitoring TB notifications and comparing different areas
- strategy and resource allocation
- awareness and teaching
- research
- clinical decision making
- evidence on the TB Action Plan and the WHO’s End TB strategy
Known users
Known users of the statistics are primarily in clinical care and public health. We are also aware of users in the media, the charity sector, and academia and research.
User engagement
UKHSA recently carried out a TB user engagement survey. Users were asked to provide information about who they are and what they use the publication for. This has provided some new insights into our users, including how they use the publication, and what they would like to see in it. The survey includes some detail on the specific parts of the publication that users find most useful, as well as suggestions for improvements to the publication.
Related statistics
TB statistics by UKHSA
This release is part of a collection of TB statistics published by UKHSA.
The annual Tuberculosis in England report describes the incidence, treatment, and prevention of TB in England. This publication contains annual figures on TB notifications in England, as well as rates of TB notifications per 100,000 population.
The National Quarterly Report of Tuberculosis in England provides provisional data on TB notifications, offering insight into recent trends in incidence, demographics and geography. This publication presents quarterly figures on TB cases in England to support surveillance, monitoring and public health response between annual reports.
TB in other countries
Most health protection functions in the UK are devolved to the other UK nations’ public health teams. Public Health Scotland publishes the Tuberculosis annual report for Scotland, the Public Health Agency reports on TB in Northern Ireland, and Public Health Wales reports on TB in Wales.
The European Centre for Disease Prevention and Control publishes the Tuberculosis surveillance and monitoring in Europe report. This provides an overview of the latest TB epidemiological situation in Europe.
WHO publishes the Global Tuberculosis Report. This provides a comprehensive and up-to-date assessment of the TB epidemic, and of progress in prevention, diagnosis and treatment of the disease, at global, regional and country levels.
Please note that it is not advisable to compare TB notification rates across countries, as TB incidence is strongly influenced by social and economic development and health-related risk factors.