data_instagram {classmap} | R Documentation |
Instagram data
Description
This dataset contains information on fake (spam) accounts on Instagram. The original source is https://www.kaggle.com/free4ever1/instagram-fake-spammer-genuine-accounts by Bardiya Bakhshandeh.
The data contains information on 696 Instagram accounts. For each account, 11 variables were recorded describing its characteristics. The goal is to detect fake instagram accounts, which are used for spamming.
Usage
data("data_instagram")
Format
A data frame with 696 observations on the following variables.
- profile.pic
binary, indicates whether profile has picture.
- nums.length.username
ratio of number of numerical chars in username to its length.
- fullname.words
number of words in full name.
- nums.length.fullname
ratio of number of numerical characters in full name to its length.
- name..username
binary, indicates whether the name and username of the profile are the same.
- description.length
length of the description/biography of the profile (in number of characters).
- external.URL
binary, indicates whether profile has external url.
- private
binary, indicates whether profile is private or not.
- X.posts
number of posts made by profile.
- X.followers
number of followers.
- X.follows
numbers of follows.
- y
whether profile is fake or not.
- dataType
vector taking the values “train” or “test” indicating whether the observation belongs to the training or the test data.
Source
https://www.kaggle.com/free4ever1/instagram-fake-spammer-genuine-accounts
Examples
data(data_instagram)
str(data_instagram)
# The data are used in:
## Not run:
vignette("Random_forest_examples")
## End(Not run)