Summary information

Study title

SIPHER Synthetic Population for Individuals in Great Britain, 2019-2021

Creator

Lomax, N., University of Leeds
Hoehn, A., University of Glasgow
Heppenstall, A., University of Glasgow
Purshouse, R., University of Sheffield
Wu, G., University of Leeds
Zia, K., University of Glasgow
Meier, P., University of Glasgow

Study number / PID

9277 (UKDA)

10.5255/UKDA-SN-9277-1 (DOI)

Data access

Restricted

Series

Not available

Abstract

Abstract copyright UK Data Service and data collection copyright owner.The SIPHER Synthetic Population allows for the creation of a survey-based full-scale synthetic population for all of Great Britain, through a linkage with the UK Household Longitudinal Study (UKDS SN 6614, Understanding Society, wave k). By drawing on data reflecting 'real' survey respondents, the dataset represents over 50 million synthetic (i.e. 'not real') individuals. As a digital twin of the adult population in Great Britain, the SIPHER Synthetic Population provides a novel source of microdata for understanding 'status quo' and modelling 'what if' scenarios (e.g., via static/dynamic microsimulation model), as well as other exploratory analyses where a granular geographical resolution is required.The lack of a centralised and comprehensive register-based system in Great Britain limits opportunities for studying the interaction of aspects such as health, employment, benefit payments, or housing quality at the level of individuals and households. At the same time, the data that exist are typically strictly controlled and only available in safe haven environments under a 'create-and-destroy' model. In particular, when testing policy options via simulation models where results are required swiftly, these limitations can present major hurdles to coproduction and collaborative work connecting researchers, policymakers, and key stakeholders. In some cases, survey data can provide a suitable alternative to the lack of readily available administrative data. However, survey data does typically not allow for a small-area perspective. Although Special Licence area-level linkages of survey data can offer more detailed spatial information, the data coverage and statistical power might be too low for meaningful analysis.As the SIPHER Synthetic Population is the outcome of a statistical creation process, all results obtained from this dataset should always be treated as 'model output', including...
Read more

Methodology

Data collection period

Not available

Country

England and Wales, Scotland

Time dimension

Cross-sectional (one-time) study

Analysis unit

Individuals
Families/households
National

Universe

The data are based on respondents to Understanding Society, wave k.

Sampling procedure

No sampling (total universe)

Kind of data

Numeric
Text

Data collection mode

Compilation/Synthesis

Funding information

Grant number

MR/S037578/2

Access

Publisher

UK Data Service

Publication year

2024

Terms of data access

The Data Collection is available to UK Data Service registered users subject to the End User Licence Agreement.

Commercial use of the data requires approval from the data owner or their nominee. The UK Data Service will contact you.

Additional conditions of use apply:

The accompanying documentation must be read thoroughly before using the data to fully understand the data's structure, limitations, and intended use cases.