Summary information

Study title

SIPHER Synthetic Population for Individuals in Great Britain, 2019-2021: Supplementary Material, 2024

Creator

Lomax, N, University of Leeds
Hoehn, A, University of Glasgow
Heppenstall, A, University of Glasgow
Purshouse, R, University of Sheffield
Wu, G, University of Leeds
Zia, K, University of Glasgow
Meier, P, University of Glasgow

Study number / PID

856754 (UKDA)

10.5255/UKDA-SN-856754 (DOI)

Data access

Open

Series

Not available

Abstract

IMPORTANT: This deposit contains a range of supplementary material related to the deposit of the SIPHER Synthetic Population for Individuals, 2019-2021 (https://doi.org/10.5255/UKDA-SN-9277-1). See the shared readme file for a detailed description describing this deposit. Please note that this deposit does not contain the SIPHER Synthetic Population dataset, or any other Understanding Society survey datasets. The lack of a centralised and comprehensive register-based system in Great Britain limits opportunities for studying the interaction of aspects such as health, employment, benefit payments, or housing quality at the level of individuals and households. At the same time, the data that exist, is typically strictly controlled and only available in safe haven environments under a “create-and-destroy” model. In particular when testing policy options via simulation models where results are required swiftly, these limitations can present major hurdles to coproduction and collaborative work connecting researchers, policymakers, and key stakeholders. In some cases, survey data can provide a suitable alternative to the lack of readily available administrative data. However, survey data does typically not allow for a small-area perspective. Although special license area-level linkages of survey data can offer more detailed spatial information, the data’s coverage and statistical power might be too low for meaningful analysis. Through a linkage with the UK Household Longitudinal Study (Understanding Society, SN 6614, wave k), the SIPHER Synthetic Population allows for the creation of a survey-based full-scale synthetic population for all of Great Britain. By drawing on data reflecting “real” survey respondents, the dataset represents over 50 million synthetic (i.e. “not real”) individuals. As a digital twin of the adult population in Great Britain, the SIPHER Synthetic population provides a novel source of microdata for understanding “status quo” and modelling “what...
Read more

Methodology

Data collection period

Not available

Country

Great Britain, England, Wales, Scotland

Time dimension

Not available

Analysis unit

Individual
Family
Family: Household family
Household
Geographic Unit

Universe

Not available

Sampling procedure

Not available

Kind of data

Numeric
Text
Geospatial

Data collection mode

Please note that this deposit does not contain the main dataset. The main dataset is available via the UK Data Service (https://doi.org/10.5255/UKDA-SN-9277-1). Please see the respective User Guide provided for this dataset for further information on the rationale for creation, methodology, quality control and intended applications.The SIPHER Synthetic Population is a digital twin of the adult population aged 16 years and older in Great Britain. It reflects more than 50 million synthetic individuals - all of which are represented through “real” individuals covered in the Understanding Society survey. The dataset is a large-scale, two-variable file including the variables “pidp” and “synthetic_zone”. The dataset shared is intended for linkage with Understanding Society survey data files such as “k_indresp” and “k_hhresp” using the survey’s person identifier variable (“pidp”). Please see the respective User Guide provided for this dataset for further information on linkages and intended applications.

Funding information

Grant number

MR/S037578/2

Access

Publisher

UK Data Service

Publication year

2024

Terms of data access

The Data Collection is available to any user without the requirement for registration for download/access.

Related publications

Not available