Overview

Dataset info

Number of variables8
Number of observations310
Missing cells243 (9.8%)
Duplicate rows52 (16.8%)
Total size in memory21.8 KiB
Average record size in memory72.0 B

Variables types

Numeric3
Categorical5
Boolean0
Date0
URL0
Text (Unique)0
Rejected0
Unsupported0

Warnings

Dataset has 52 (16.8%) duplicate rows Warning
#_Comments has 243 (78.4%) missing values Missing

Variables

#_Comments
Numeric

Distinct count33
Unique (%)10.6%
Missing (%)78.4%
Missing (n)243
Infinite (%)0.0%
Infinite (n)0
Mean19.07462687
Minimum1
Maximum87
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q12
Median11
Q329
95-th percentile69.9
Maximum87
Range86
Interquartile range27

Descriptive statistics

Standard deviation22.19426177
Coef of variation1.163548935
Kurtosis1.513094373
Mean19.07462687
MAD17.16818891
Skewness1.462955716
Sum1278
Variance492.5852555
Memory size4.8 KiB
Histogram
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%) 
1 13 4.2%
 
2 10 3.2%
 
33 4 1.3%
 
3 3 1.0%
 
26 3 1.0%
 
35 2 0.6%
 
28 2 0.6%
 
31 2 0.6%
 
19 2 0.6%
 
6 2 0.6%
 
Other values (22) 24 7.7%
 
(Missing) 243 78.4%
 

Minimum 5 values

ValueCountFrequency (%) 
1 13 4.2%
 
2 10 3.2%
 
3 3 1.0%
 
4 2 0.6%
 
6 2 0.6%
 

Maximum 5 values

ValueCountFrequency (%) 
87 1 0.3%
 
81 1 0.3%
 
73 1 0.3%
 
72 1 0.3%
 
65 1 0.3%
 

Content
Categorical

Distinct count12
Unique (%)3.9%
Missing (%)0.0%
Missing (n)0
"Opportunity"
218
News
 
33
Other
 
14
Other values (9)
45
ValueCountFrequency (%) 
"Opportunity" 218 70.3%
 
News 33 10.6%
 
Other 14 4.5%
 
Investing 12 3.9%
 
Trading 6 1.9%
 
Technology 6 1.9%
 
Advertisement 6 1.9%
 
Entertainment 5 1.6%
 
Seeking Info 3 1.0%
 
Outright Scam 3 1.0%
 
Other values (2) 4 1.3%
 
Max length16
Mean length11.36129032
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesTrue
Contains non-wordsTrue

Gender
Categorical

Distinct count3
Unique (%)1.0%
Missing (%)0.0%
Missing (n)0
Male
203
Female
91
Unknown
 
16
ValueCountFrequency (%) 
Male 203 65.5%
 
Female 91 29.4%
 
Unknown 16 5.2%
 
Max length7
Mean length4.741935484
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Group_ID
Numeric

Distinct count30
Unique (%)9.7%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean15.03225806
Minimum1
Maximum30
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q17
Median15
Q323
95-th percentile29
Maximum30
Range29
Interquartile range16

Descriptive statistics

Standard deviation8.906140825
Coef of variation0.5924685957
Kurtosis-1.22335928
Mean15.03225806
MAD7.710718002
Skewness0.01815803906
Sum4660
Variance79.3193444
Memory size4.8 KiB
Histogram
Histogram with fixed size bins (bins=30)
Histogram
Histogram with variable size bins (bins=[ 1. 1.5 30. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 20 6.5%
 
29 10 3.2%
 
2 10 3.2%
 
3 10 3.2%
 
4 10 3.2%
 
5 10 3.2%
 
6 10 3.2%
 
7 10 3.2%
 
8 10 3.2%
 
9 10 3.2%
 
Other values (20) 200 64.5%
 

Minimum 5 values

ValueCountFrequency (%) 
1 20 6.5%
 
2 10 3.2%
 
3 10 3.2%
 
4 10 3.2%
 
5 10 3.2%
 

Maximum 5 values

ValueCountFrequency (%) 
30 10 3.2%
 
29 10 3.2%
 
28 10 3.2%
 
27 10 3.2%
 
26 10 3.2%
 

Member_Age
Categorical

Distinct count4
Unique (%)1.3%
Missing (%)0.0%
Missing (n)0
25-40
102
<25
99
>40
59
ValueCountFrequency (%) 
25-40 102 32.9%
 
<25 99 31.9%
 
>40 59 19.0%
 
Unknown 50 16.1%
 
Max length7
Mean length4.303225806
Min length3
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsTrue

Post_Length
Numeric

Distinct count5
Unique (%)1.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean3.370967742
Minimum1
Maximum5
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q11
Median4.5
Q35
95-th percentile5
Maximum5
Range4
Interquartile range4

Descriptive statistics

Standard deviation1.783000189
Coef of variation0.5289282858
Kurtosis-1.695252395
Mean3.370967742
MAD1.681789802
Skewness-0.354022714
Sum1045
Variance3.179089675
Memory size4.8 KiB
Histogram
Histogram with fixed size bins (bins=5)
Histogram
Histogram with variable size bins (bins=[1. 1.5 4.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5 155 50.0%
 
1 92 29.7%
 
3 26 8.4%
 
2 24 7.7%
 
4 13 4.2%
 

Minimum 5 values

ValueCountFrequency (%) 
1 92 29.7%
 
2 24 7.7%
 
3 26 8.4%
 
4 13 4.2%
 
5 155 50.0%
 

Maximum 5 values

ValueCountFrequency (%) 
5 155 50.0%
 
4 13 4.2%
 
3 26 8.4%
 
2 24 7.7%
 
1 92 29.7%
 

Post_Type
Categorical

Distinct count7
Unique (%)2.3%
Missing (%)0.0%
Missing (n)0
Link W/ Text
161
Link
57
Text
38
Other values (4)
54
ValueCountFrequency (%) 
Link W/ Text 161 51.9%
 
Link 57 18.4%
 
Text 38 12.3%
 
Image W/ Text 31 10.0%
 
Image 10 3.2%
 
Video 8 2.6%
 
Video W/ Text 5 1.6%
 
Max length13
Mean length9.258064516
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesTrue
Contains non-wordsTrue

Sentiment
Categorical

Distinct count3
Unique (%)1.0%
Missing (%)0.0%
Missing (n)0
Neutral
274
Bullish
 
32
Bearish
 
4
ValueCountFrequency (%) 
Neutral 274 88.4%
 
Bullish 32 10.3%
 
Bearish 4 1.3%
 
Max length7
Mean length7
Min length7
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Correlations

Missing values

Sample

First rows

#_CommentsContentGenderGroup_IDMember_AgePost_LengthPost_TypeSentiment
019.0"Opportunity"Male1<252Image W/ TextNeutral
125.0"Opportunity"Male1<252Image W/ TextNeutral
265.0"Opportunity"Female125-402Image W/ TextNeutral
315.0OtherMale125-401TextNeutral
42.0"Opportunity"Female1<255Image W/ TextNeutral
533.0"Opportunity"Female1>401Image W/ TextNeutral
630.0"Opportunity"Male1>401Image W/ TextNeutral
71.0"Opportunity"Female1>405Link W/ TextNeutral
826.0Outright ScamMale125-402TextNeutral
931.0"Opportunity"Unknown1Unknown5TextNeutral

Last rows

#_CommentsContentGenderGroup_IDMember_AgePost_LengthPost_TypeSentiment
300NaN"Opportunity"Male30>405Link W/ TextNeutral
301NaN"Opportunity"Male3025-401LinkNeutral
302NaN"Opportunity"Male3025-405Link W/ TextNeutral
303NaN"Opportunity"Male30>405Link W/ TextNeutral
304NaN"Opportunity"Male3025-401ImageNeutral
305NaNOtherMale3025-401LinkNeutral
306NaNOtherMale30>401VideoNeutral
307NaNOtherFemale3025-401VideoNeutral
308NaNOtherMale3025-401LinkNeutral
309NaNOtherMale3025-401VideoNeutral