Summary information

Study title

Semantic Query Analysis from the Global Science Gateway

Creator

C. Carlesi (Istituto di Scienze e Tecnologie dell’informazione “A. Faedo”, CNR-ISTI, Italy)

Study number / PID

doi:10.17026/dans-25m-fhe2 (DOI)

easy-dataset:120618 (DANS-KNAW)

Data access

Information not available

Series

Not available

Abstract

Nowadays web portals play an essential role in searching and retrieving information in the several fields of knowledge: they are ever more technologically advanced and designed for supporting the storage of a huge amount of information in natural language originating from the queries launched by users worldwide.A good example is given by the WorldWideScience search engine:The database is available at . It is based on a similar gateway, Science.gov, which is the major path to U.S. government science information, as it pulls together Web-based resources from various agencies. The information in the database is intended to be of high quality and authority, as well as the most current available from the participating countries in the Alliance, so users will find that the results will be more refined than those from a general search of Google. It covers the fields of medicine, agriculture, the environment, and energy, as well as basic sciences. Most of the information may be obtained free of charge (the database itself may be used free of charge) and is considered ‘‘open domain.’’ As of this writing, there are about 60 countries participating in WorldWideScience.org, providing access to 50+databases and information portals. Not all content is in English. (Bronson, 2009)Given this scenario, we focused on building a corpus constituted by the query logs registered by the GreyGuide: Repository and Portal to Good Practices and Resources in Grey Literature and received by the WorldWideScience.org (The Global Science Gateway) portal: the aim is to retrieve information related to social media which as of today represent a considerable source of data more and more widely used for research ends.This project includes eight months of query logs registered between July 2017 and February 2018 for a total of 445,827 queries. The analysis mainly concentrates on the semantics of the queries received from the portal clients: it is a process of information retrieval from a rich...
Read more

Topics

Not available

Methodology

Data collection period

Not available

Country

Time dimension

Not available

Analysis unit

Not available

Universe

Not available

Sampling procedure

Not available

Kind of data

Not available

Data collection mode

Not available

Access

Publisher

DANS Data Station Social Sciences and Humanities

Publication year

2019

Terms of data access

Not available

Related publications

Not available