Summary information

Study title

Analysis of search queries suggested by a Swedish climate obstruction network

Creator

Rödl, Malte ( Department of Urban and Rural Development, Swedish University of Agricultural Sciences)

Study number / PID

2024-48-1 (SND)

SLU.sol.2019.4.2-67 (slu.se)

https://doi.org/10.5878/zb1v-ba15 (DOI)

Data access

Restricted

Series

Not available

Abstract

This data comprises data traces related to search queries used in climate obstruction. It is based on "klimatsans" (Climate Sense or Climate Reason; translated from Swedish, cf. Vowles & Hultman, 2021), a Swedish blog and network which exists since 2014 and runs a Swedish-language blog and submits opinion pieces and letters to the editor to various Swedish news outlets. The stated aims of the network amount to first-level obstruction, i.e. they reject the scientific consensus that increased atmospheric CO2 leads to climate change. The data concerns how the network throughout its various publications invite readers to “google” certain words (keyphrases). The data set includes: 1) all blog posts published on klimatsans.com from January 2014 to June 2022; 2) all hyperlinks from the blog; 3) tabulation, count, and coding of all search queries suggested in the blog, as identified by following after the Swedish imperative verb "googla"; 4) tabulation of all uses of 25 selected keyphrases in Swedish newspapers; 5) results of search engine results pages for these 25 queries from Google and DuckDuckGo (each run three times: in plain, in verbatim using quotation marks, and preceded by the term "googla") (original data available via Sünkler et al., 2023); 6) tabulation and coding of domains frequently targeted by hyperlinks and/or listed in search engine results pages. Furthermore, the data set includes some scripts for replication, an extensive README file for methodological additions, and details on coding schemes. The data was originally collected to investigate to trace data voids through the texts of their creators or proponents. This provides insights into how data voids are created, promoted, used, and if they do not disappear also abandoned.

Methodology

Data collection period

01/01/2014 - 31/07/2022

Country

Sweden

Time dimension

Time series: Continuous

Analysis unit

Media unit: Text
Other

Universe

The data is concerned with the search queries suggested on the blog of a Swedish climate obstruction network (hereafter referred to as CON). Further data investigates what these search queries reveal (DuckDuckGo and Google), and how they have been spread in Swedish print media. Furthermore, we compared these to hyperlinks found on the same blog.

Sampling procedure

Queries were identified by scraping the entire blog, looking for the imperative verb "googla" (Swedish for "google!") followed by a keyphrase. This was assumed to constitute one query, which we then followed through Retriever's news database (all Swedish printed press), as well as the search engines Google and DuckDuckGo.
Total universe/Complete enumeration

Kind of data

Not available

Data collection mode

1. On 1 August 2022, we used the software httrack to crawl the CON’s blog, retrieving 2654 posts.2. We extracted 1943 hyperlinks from the retrieved blog posts.3. We identified 268 occurrences of the term “googla” on 177 different blog posts since 2014.4. We identified and tabulated all explicitly suggested keyphrases, i.e., those that follow an imperative verb and are quoted or follow a colon.5. We coded the retrieved keyphrases according to their syntactical composition. Coding was carried out by the first author and validated by the second author. 6. We created a set of 25 keyphrases to use as seeds for further data creation. The set included all ten keyphrases that had been suggested at least four times, and added 15 strategically selected keyphrases used two or three times to increase variation.7. We submitted the compiled keyphrases to the Swedish media database Retriever, yielding 240 results from Swedish print media. Of those, 204 asked readers to “google” the respective keyphrase.8. We submitted the same keyphrases as queries to Google Search and DuckDuckGo, using the search retrieval analysis software RAT (Result Assessment Tool) to obtain the first SERP for each search engine as well as the HTML source code of results (Lewandowski et al., 2022; data available via Sünkler et al., 2023). We submitted (a) the suggested queries; (b) the suggested queries in quotation marks (i.e., verbatim search); and (c) the Swedish imperative form of “google” followed by the suggested keyphrase (no quotation marks). With a maximum of 10 results per query, but often fewer and sometimes no results for queries b and c, we obtained 146 SERPs and 1001 search results.9. Of these, 249 results link to the CON’s blog, and further 236 results mention the CON or its authors—usually signed by the CON or linking to it. Few referred to the blog as engaged in climate obstruction (e.g. by debunking the CON’s claims); conversely, not all climate obstruction content in the data set mentions the CON.10. Based on search results and hyperlinks, we classified 204 unique domains as frequent, i.e. they occurred in at least two SERPs, at least 10 hyperlinks, or at least 1 SERP and 4 hyperlinks. As these counts represent the possibility of finding a specific domain, we included duplicate targets in these counts. We coded these frequent domains regarding their site type and language. Coding was carried out by the first author and validated by the second author.For more details, see the included README file.
Content coding

Funding information

Funder

Swedish Research Council for Environment Agricultural Sciences and Spatial Planning

Grant number

2022-01352_Formas

Funder

Foundation for Strategic Environmental Research

Grant number

Mistra Environmental Communication

Access

Publisher

Swedish National Data Service

Publication year

2024

Terms of data access

Access to data through SND. Access to data is restricted.

Related publications

Not available