data_instagram {classmap}R Documentation

Instagram data

Description

This dataset contains information on fake (spam) accounts on Instagram. The original source is https://www.kaggle.com/free4ever1/instagram-fake-spammer-genuine-accounts by Bardiya Bakhshandeh.

The data contains information on 696 Instagram accounts. For each account, 11 variables were recorded describing its characteristics. The goal is to detect fake instagram accounts, which are used for spamming.

Usage

data("data_instagram")

Format

A data frame with 696 observations on the following variables.

profile.pic

binary, indicates whether profile has picture.

nums.length.username

ratio of number of numerical chars in username to its length.

fullname.words

number of words in full name.

nums.length.fullname

ratio of number of numerical characters in full name to its length.

name..username

binary, indicates whether the name and username of the profile are the same.

description.length

length of the description/biography of the profile (in number of characters).

external.URL

binary, indicates whether profile has external url.

private

binary, indicates whether profile is private or not.

X.posts

number of posts made by profile.

X.followers

number of followers.

X.follows

numbers of follows.

y

whether profile is fake or not.

dataType

vector taking the values “train” or “test” indicating whether the observation belongs to the training or the test data.

Source

https://www.kaggle.com/free4ever1/instagram-fake-spammer-genuine-accounts

Examples

data(data_instagram)
str(data_instagram)

# The data are used in:
## Not run: 
vignette("Random_forest_examples")

## End(Not run)

[Package classmap version 1.2.3 Index]