w_isValid {wikiTools}R Documentation

Check if Wikidata entities are valid

Description

A entity is valid if it has a label or has a description. If one entity exists but is not valid, is possible that it has a redirection to other entity, in that case, the redirection is obtained. Other entities may have existed in the past, but have been deleted. The returned dataframe also includes the Wikidata class (another Wikidata entity) of which the searched entity are instances of. The data-frame no contains labels or descriptions about entities: the function w_LabelDesc can be used for valid entities. Duplicated entities are deleted before search. Index of the data-frame returned are also set to entity_list.

Usage

w_isValid(entity_list, nlimit = 50000, debug = FALSE)

Arguments

entity_list

A vector with de Wikidata entities.

nlimit

If the number of entities exceeds this number, chunked queries are done. This is the number of entities requested in each chunk. Please, reduce the default value if error is raised.

debug

For debugging purposes (default FALSE). If debug='info' information about chunked queries is shown. If debug='query' also the query launched is shown.

Value

A data-frame with four columns: entity, valid (TRUE or FALSE), instanceof and redirection (if the entity redirects to another Wikidata entity, the redirection column contains the last).

Author(s)

Angel Zazo, Department of Computer Science and Automatics, University of Salamanca

Examples

## Not run: 
w_isValid(c("Q9021", "Q115637688", "Q105660123"))
# Large list
l  <- w_SearchByOccupation(Qoc='Q2306091')
l2 <- append(l$entity, c("Q115637688", "Q105660123"))  # Note: adding two new entities
v <- w_isValid(l2)
# Not valid
v[!v$valid, ]

## End(Not run)

[Package wikiTools version 1.2.7 Index]