Summary information

Study title

Birmingham Elsevier interdisciplinary research discourse datasets

Creator

Thompson, P, University of Birmingham

Study number / PID

852198 (UKDA)

10.5255/UKDA-SN-852198 (DOI)

Data access

Open

Series

Not available

Abstract

This project investigated the discourse of interdisciplinary research (IDR) through comprehensive linguistic analyses of the full holdings of a successful IDR journal, Global Environmental Change (GEC) in the period 1990-2010, and of ten other comparison journals published by Elsevier. The ten were chosen to represent other interdisciplinary (ID) journals and monodisciplinary (MD) journals. The corpus data cannot be included in the repository as it belongs to Elsevier – individual files can all be consulted through the Elsevier website. The main lines of analysis were multidimensional analysis (MDA). From the MDA, we derived six constellations in which papers with similar MDA profiles clustered. We then examined the N-grams and P-frames in each constellation – the raw numerical data are available in this repository. A second computational approach taken was to use topic modelling to establish, in an inductive manner, what the papers in the GEC corpus are ‘about’. The TopicModel folder contains data for this investigation. We also conducted survey and interview data analysis and the (anonymised) data are presented here. This project investigated the discourse of interdisciplinary research (IDR) through comprehensive linguistic analyses of the full holdings of a successful IDR journal, Global Environmental Change (GEC) in the period 1990-2010, and of ten other comparison journals published by Elsevier. The ten were chosen to represent other interdisciplinary (ID) journals and monodisciplinary (MD) journals. The corpus data cannot be included in the repository as it belongs to Elsevier – individual files can all be consulted through the Elsevier website. The main lines of analysis were multidimensional analysis (MDA) for which Doug Biber (Northern Arizona University) acted as a consultant. From the MDA, we derived six constellations in which papers with similar MDA profiles clustered. We then examined the N-grams and P-frames in each constellation – the raw...
Read more

Methodology

Data collection period

30/08/2013 - 03/11/2015

Country

United Kingdom

Time dimension

Not available

Analysis unit

Individual

Universe

Not available

Sampling procedure

Not available

Kind of data

Numeric

Data collection mode

The corpus was built from XML files provided by the publisher Elsevier, which were converted to annotated text files in the research. The interview data were skype or phone calls, then transcribed. The survey data were collected online, throuogh a web-based survey interface maintained by Elsevier.

Funding information

Grant number

ES/K007300/1

Access

Publisher

UK Data Service

Publication year

2016

Terms of data access

Not available

Related publications

Not available