12  Data collection resources

12.1 Gender and Transgender

Guidance on collecting and submitting data for the data items on gender within the Mental Health Services Data Set (MHSDS) is a useful resource for information on these protected characteristics.

12.2 NHS Data Dictionary

The NHS Data Model and Dictionary has been developed for everyone who is actively involved in the collection of data and the management of information in the NHS.

12.3 Population

NOMIS official census and labour market statistics - managed by Office for National Statistics publishing statistics related to population, society and the labour market at national, regional and local levels. Requires an account to save searches through API.

An R package called {nomisr}

POPPI Projecting Older People Population Information - requires registration.

Originally developed for the Department of Health, this system provides population data by age band, gender, ethnic group, and tenure, for English local authorities.

Calculations are applied to population figures to estimate projected numbers of older people by; those living alone, living in care home, provision of unpaid care, their ability to carry out domestic tasks and self care.

Prevalence rates from research have been used to estimate the impact of; limiting long term illness, depression, severe depression, dementia, heart attack, stroke, bronchitis, falls, continence, visual impairment, hearing impairment, mobility, obesity, diabetes and learning disability including Down’s syndrome and autistic spectrum disorders (ASD).

PANSI Projecting Adult Needs and Service Information - requires registration.

Originally developed for the Department of Health, this system provides population data by age band, gender, and ethnic group.

Prevalence rates from research have been used to estimate the impact of: learning disability, including living with a parent, Down’s syndrome, challenging behaviour, autistic spectrum disorders; moderate or serious physical disability including personal care, stroke, diabetes, visual impairment and hearing impairment; mental health problems including depression, neurotic, personality and psychotic disorders, drugs and alcohol, suicide, adult survivors of childhood sexual abuse and early onset dementia.

12.4 Small number suppression

Public Health Wales guidance in the Appendices is very useful in that it gives example situations for data suppression and how to handle them, including small denominator values, indirect disclosure and where self disclosure can cause distress.

UK Data Service assessing disclosure risk and managing risk has an example from a review showing the types of direct identifiers as well as variables affected by local knowledge.

Statistical disclosure dos and don’ts blog from Lancaster University.

NHS Digital Suppression Rules for different data sets.

ONS Policy on protecting confidentiality in tables of birth and death statistics.

12.5 Data Linkage

Specific online courses for data linkage is available through the Analysis Function.

Splink: Fast, accurate and scalable record linkage - Data in government (blog.gov.uk).

Health Economics Unit NHS Guide - “This guide to data linkage was produced to support NHS systems (such as ICSs and STPs), commissioners, providers, NHS England and NHS Improvement and arm’s-length bodies to co-learn and co-develop, and to share and spread best practice, learning and tools relating to data linkage that extends beyond primary and secondary care”.

Faculty of Public Health - Health Knowledge - Data linkage within and across datasets

12.6 Analyst surveys

12.6.1 Coding in Analysis and Research Survey (CARS)

Coordinated through the Analysis Function, this survey is open to all Government and Public Sector but reporting is only by departments or organisations if responses are high enough.

Details on the data collection

2023 results

12.6.2 Association of Professional Healthcare Analysts (AphA)

Continuing Professional Development (CPD) Survey blog, results and excel data