Washington_content {ORKM} | R Documentation |
The second view of Washington data set.
Description
Webkb dataset contains web pages from four universities, with the corresponding clusters categorised as Professor, Student, Program, or Other pages. The data set contains four subsets of data, Cornell data set, Texas data set, Washington data set, and Wisconsin data set.
Usage
data("Washington_content")
Format
The format is: num [1:230, 1:1703] 0 0 0 0 0 0 0 0 0 0 ...
Details
Washington data set contains four views with a number of clusters of 5. This data set is the second view with a sample size of 230 and a number of features of 1703.
Source
http://www.cs.cmu.edu/~webkb/
References
M. Craven, D. DiPasquo, D. Freitag, A. McCallum, T. Mitchell, K. Nigam and S. Slattery. Learning to Extract Symbolic Knowledge from the World Wide Web. Proceedings of the 15th National Conference on Artificial Intelligence (AAAI-98).
Examples
data(Washington_content)
## maybe str(Washington_content) ; plot(Washington_content) ...