| context-class {polmineR} | R Documentation | 
Context class.
Description
Class to organize information of context analysis.
Usage
## S4 method for signature 'context'
length(x)
## S4 method for signature 'context'
p_attributes(.Object)
## S4 method for signature 'context'
count(.Object)
## S4 method for signature 'context'
sample(x, size)
## S4 method for signature 'context'
enrich(
  .Object,
  s_attribute = NULL,
  p_attribute = NULL,
  decode = FALSE,
  stat = FALSE,
  verbose = TRUE,
  ...
)
## S4 method for signature 'context'
as.regions(x, node = TRUE)
## S4 method for signature 'context'
trim(
  .Object,
  s_attribute = NULL,
  positivelist = NULL,
  p_attribute = p_attributes(.Object),
  regex = FALSE,
  stoplist = NULL,
  fn = NULL,
  verbose = TRUE,
  progress = TRUE,
  ...
)
Arguments
| x | A  | 
| .Object | A  | 
| size | An  | 
| s_attribute | The s-attribute(s) to add to  | 
| p_attribute | The p-attribute(s) to add to  | 
| decode | A  | 
| stat | A  | 
| verbose | A  | 
| ... | To maintain backwards compatibility if argument  | 
| node | A logical value, whether to include the node (i.e. query matches) in the region matrix
generated when creating a  | 
| positivelist | Tokens that are required to be present to keep a match. | 
| regex | A  | 
| stoplist | Tokens that are used to exclude a match. | 
| fn | A function that will be applied on context tables splitted by match_id. | 
| progress | A  | 
Details
Objects of the class context include a data.table in the
slot cpos. The data.table will at least include the columns "match_id",
"cpos" and "position".
The length-method will return the number of hits that were achieved.
The enrich()-method can be used to add additional information to
the data.table in the cpos-slot of a context-object.
Slots
- query
- The query examined ( - character).
- count
- An - integervalue, the number of hits for the query.
- partition
- The - partitionthe- contextobject is based on.
- size_partition
- The size of the partition, a length-one - integervector.
- left
- A length-one - integervalue, the number of tokens to the left of the query match.
- right
- An - integervalue, the number of tokens to the right of the query match.
- size
- A length-one - integervalue, the number of tokens covered by the- context-object, i.e. the number of tokens in the right and left context of the node as well as query matches.
- size_match
- A length-one - integervalue, the number of tokens matches by the query. Identical with the value in slot- countif the query is not a CQP query.
- size_coi
- A length-one - integervalue, the number of tokens in the right and left context of the node (excluding query matches).
- size_ref
- A length-one - integervalue, the number of tokens in the partition, without tokens matched and the tokens in the left and right context.
- boundary
- An s-attribute ( - character).
- p_attribute
- The p-attribute of the query ( - character).
- corpus
- The CWB corpus used ( - character).
- stat
- A - data.table, the statistics of the analysis.
- encoding
- Object of class - character, encoding of the corpus.
- cpos
- A - data.table, with the columns match_id, cpos, position, word_id.
- method
- A - character-vector, statistical test used.
- call
- Object of class - character, call that generated the object.
Examples
# Keep matches for 'oil' only if first position to the left is 'crude'
.fn <- function(x) if (x[position == -1L][["word"]] == "crude") x else NULL
crude_oil <- context("REUTERS", "oil") %>%
  enrich(p_attribute = "word", decode = TRUE) %>%
  trim(fn = .fn)