texas_content {ORKM}R Documentation

The second view of Texas dataset.

Description

Webkb data set contains web pages from four universities, with the corresponding clusters categorised as Professor, Student, Program, or Other pages. The data set contains four subsets of data, Cornell dataset, Texas dataset, Washington dataset, and Wisconsin dataset.

Usage

data("texas_content")

Format

The format is: num [1:187, 1:1703] 0 0 0 0 0 0 0 0 0 0 ...

Details

Texas data set contains four views with a number of clusters of 5. This data set is the second view with a sample size of 187 and a number of features of 1703.

Source

http://www.cs.cmu.edu/~webkb/

References

M. Craven, D. DiPasquo, D. Freitag, A. McCallum, T. Mitchell, K. Nigam and S. Slattery. Learning to Extract Symbolic Knowledge from the World Wide Web. Proceedings of the 15th National Conference on Artificial Intelligence (AAAI-98).

Examples

data(texas_content)
## maybe str(texas_content) ; plot(texas_content) ...

[Package ORKM version 0.8.0.0 Index]