coupling_entity {biblionetwork} | R Documentation |
Creating Coupling Networks at Entity Level
Description
This function creates the edges of a network of entities from a direct citations data frame (i.e. documents citing references).
Entities could be authors, affiliations, journals, etc. Consequently, coupling links are calculated using the coupling angle measure
(like biblio_coupling()
) or the coupling strength measure (like coupling_strength()
. But it also takes into account the fact that
an entity can cite several times a reference, and considers that citing 10 times a ref is more significant that citing it only once (see details).
Usage
coupling_entity(
dt,
source,
ref,
entity,
weight_threshold = 1,
output_in_character = FALSE,
method = c("coupling_strength", "coupling_angle")
)
Arguments
dt |
The table with citing and cited documents. |
source |
The column name of the source identifiers, that is the documents that are citing. |
ref |
The column name of the cited references identifiers. |
entity |
The column name of the entity (authors, journals, institutions) that are citing. |
weight_threshold |
Corresponds to the value of the non-normalized weights of edges. The function just keeps the edges
that have a non-normalized weight superior to the |
output_in_character |
If TRUE, the function ends by transforming the |
method |
Choose the method you want to use for calculating the edges weights: either |
Details
Coupling links are calculated depending of the number of references two authors (or any entity) share,
taking into account the minimum number of times two authors are citing each references.
For instance, if two entities share a reference in common, the first one citing it twice (in other words, citing it in two different articles),
the second one three times, the function takes two as the minimum value. In addition to the features of the coupling strength measure (see coupling_strength()
)
or the coupling angle measure (see biblio_coupling()
), it means that, if two entities share two reference in common, if the first reference is cited
at least four times by the two entities, whereas the second reference is cited at least only once, the first reference contributes more to the edge weight than
the second reference. This use of minimum shared reference for entities coupling comes from Zhao and Strotmann (2008). It looks like
this for the coupling strength:
\frac{1}{L(A)}.\frac{1}{L(A)}\sum_{j} Min(C_{Aj},C_{Bj}).(log({\frac{N}{freq(R_{j})}}))
with C_{Aj}
and C_{Bj}
the number of time documents A and B cite the reference j.
Value
A data.table with the entity identifiers in from
and to
columns, with the coupling strength or coupling angle measures in
another column, as well as the method used. It also keeps a copy of from
and to
in the Source
and Target
columns. This is useful is you
are using the tidygraph package then, where from
and to
values are modified when creating a graph.
References
Zhao D, Strotmann A (2008). “Author Bibliographic Coupling: Another Approach to Citation-Based Author Knowledge Network Analysis.” Proceedings of the American Society for Information Science and Technology, 45(1), 1–10. https://asistdl.onlinelibrary.wiley.com/doi/full/10.1002/meet.2008.1450450292.
Examples
library(biblionetwork)
Ref_stagflation$Citing_ItemID_Ref <- as.character(Ref_stagflation$Citing_ItemID_Ref)
# merging the references data with the citing author information in Nodes_stagflation
entity_citations <- merge(Ref_stagflation,
Nodes_stagflation,
by.x = "Citing_ItemID_Ref",
by.y = "ItemID_Ref")
coupling_entity(entity_citations,
source = "Citing_ItemID_Ref",
ref = "ItemID_Ref",
entity = "Author.y",
method = "coupling_angle")