R: Simulate h-index and h-alpha values

simulate_hindex {hindex}

R Documentation

Simulate h-index and h-alpha values

Description

Simulate the effect of publishing, being cited, and (strategic) collaborating on the development of h-index and h-alpha values for a specified set of agents.

Usage

simulate_hindex(
  runs = 1,
  n = 100,
  periods = 20,
  subgroups_distr = 1,
  subgroup_advantage = 1,
  subgroup_exchange = 0,
  init_type = "fixage",
  distr_initial_papers = "poisson",
  max_age_scientists = 5,
  dpapers_pois_lambda = 2,
  dpapers_nbinom_dispersion = 1.1,
  dpapers_nbinom_mean = 2,
  productivity = 80,
  distr_citations = "poisson",
  dcitations_speed = 2,
  dcitations_peak = 3,
  dcitations_mean = 2,
  dcitations_dispersion = 1.1,
  coauthors = 5,
  strategic_teams = FALSE,
  diligence_share = 1,
  diligence_corr = 0,
  selfcitations = FALSE,
  update_alpha_authors = FALSE,
  boost = FALSE,
  boost_size = 0.1,
  alpha_share = 0.33
)

Arguments

`runs`	Number of times the simulation is repeated.
`n`	Number of agents acting in each simulation.
`periods`	Number of periods the agents collaborate across in each period.
`subgroups_distr`	Share of scientists in the first subgroup among all scientists
`subgroup_advantage`	Factor by which citations of papers published by agents of subgroup 2 exceed those of papers published by subgroup 1. This option is intended to reflect subdisciplines with different citation levels.
`subgroup_exchange`	Share of agents publishing (alone or in collaboration) with the other subgroup in each period. For example, when specifying subgroup_exchange = .1, 10% of each subgroup join the other subgroup each period.
`init_type`	Type of the initial setup. May be 'fixage' or 'varage'. For init_type = 'fixage', all initial papers have the same age (specified by max_age_scientists). For init_type = 'varage', papers get a random age which is less than or equal to max_age_scientists.
`distr_initial_papers`	Distribution of the papers the scientists have already published at the start of the simulation. Currently, the poisson distribution ("poisson") and the negative binomial distribution ("nbinomial") are supported.
`max_age_scientists`	Maximum age of scientists at the start of the simulation. For init_type = varage, a random age less than or equal to max_age_scientists is assigned to the initial papers. For init_type = fixage, all papers are max_age_scientists old.
`dpapers_pois_lambda`	The distribution parameter for a poisson distribution of initial papers.
`dpapers_nbinom_dispersion`	Dispersion parameter of a negative binomial distribution of initial papers.
`dpapers_nbinom_mean`	Expected value of a negative binomial distribution of initial papers.
`productivity`	The share of papers published by the 20% most productive agents in percentage. This parameter is only used for init_type = 'varage'. For init_type = 'fixage', diligence_share and diligence_corr can be used to control the productivity of scientists.
`distr_citations`	Distribution of citations the papers get. The expected value of this distribution follows a log-logistic function of time. Currently, the poisson distribution ("poisson") and the negative binomial distribution ("nbinomial") are supported.
`dcitations_speed`	The steepness (shape parameter) of the log-logistic time function of the expected citation values.
`dcitations_peak`	The period after publishing when the expected value of the citation distribution reaches its maximum.
`dcitations_mean`	The maximum expected value of the citation distribution (at period dcitations_peak after publishing, the citation distribution has dcitations_mean).
`dcitations_dispersion`	For a negative binomial citation distribution, dcitations_dispersion is a factor by which the variance exceeds the expected value.
`coauthors`	Average number of coauthors publishing papers.
`strategic_teams`	If this parameter is set to TRUE, agents with high h-index avoid co-authorships with agents who have equal or higher h-index values (they strategically select co-authors to improve their h-alpha index). This is implemented by assigning the agents with the highest h-index values to separate teams and randomly assigning the other agents to the teams. Otherwise, the collaborating agents are assigned to co-authorships at random.
`diligence_share`	The share of agents publishing in each period. Only used for init_type = 'fixage'.
`diligence_corr`	The correlation between the initial h-index value and the probability to publish in a given period. This parameter only has an effect if diligence_share < 1. Only used for init_type = 'fixage'.
`selfcitations`	If this parameter is set to TRUE, a paper gets one additional citation if at least one of its authors has a h-index value that exceeds the number of previous citations of the paper by one or two. This reflects agents strategically citing their own papers with citations just below their h-index to accelerate the growth of their h-index.
`update_alpha_authors`	If this parameter is set to TRUE, the alpha author of newly written papers is determined every period based on the current h-index values of its authors. Without this option, the alpha author is determined when the paper is written and held constant from then on.
`boost`	If this parameter is set to TRUE, papers of agents with a higher h-index are cited more frequently than papers of agents with lower h-index. For each team, this effect is based on the team's co-author with the highest h-index within this team.
`boost_size`	Magnitude of the boost effect. For every additional h point of a paper's co-author who has the highest h-index among all of the paper's co-authors, citations of the paper are increased by boost_size, rounded to the next integer.
`alpha_share`	The share of previously published papers where the corresponding agent is alpha author.

Value

For each run, the h-index values and the h-alpha values for each period are stored in a list of lists.

Examples

set.seed(123)
simdata <- simulate_hindex(runs = 2, n = 20, periods = 3)
plot_hsim(simdata, plot_hindex = TRUE)

[Package hindex version 0.2.0 Index]