Dataset info
Number of variables | 8 |
---|---|
Number of observations | 310 |
Missing cells | 243 (9.8%) |
Duplicate rows | 52 (16.8%) |
Total size in memory | 21.8 KiB |
Average record size in memory | 72.0 B |
Variables types
Numeric | 3 |
---|---|
Categorical | 5 |
Boolean | 0 |
Date | 0 |
URL | 0 |
Text (Unique) | 0 |
Rejected | 0 |
Unsupported | 0 |
Warnings
Dataset has 52 (16.8%) duplicate rows | Warning |
#_Comments has 243 (78.4%) missing values | Missing |
#_Comments
Numeric
Distinct count | 33 |
---|---|
Unique (%) | 10.6% |
Missing (%) | 78.4% |
Missing (n) | 243 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 19.07462687 |
---|---|
Minimum | 1 |
Maximum | 87 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
Median | 11 |
Q3 | 29 |
95-th percentile | 69.9 |
Maximum | 87 |
Range | 86 |
Interquartile range | 27 |
Descriptive statistics
Standard deviation | 22.19426177 |
---|---|
Coef of variation | 1.163548935 |
Kurtosis | 1.513094373 |
Mean | 19.07462687 |
MAD | 17.16818891 |
Skewness | 1.462955716 |
Sum | 1278 |
Variance | 492.5852555 |
Memory size | 4.8 KiB |
Value | Count | Frequency (%) | |
1 | 13 | 4.2% | |
2 | 10 | 3.2% | |
33 | 4 | 1.3% | |
3 | 3 | 1.0% | |
26 | 3 | 1.0% | |
35 | 2 | 0.6% | |
28 | 2 | 0.6% | |
31 | 2 | 0.6% | |
19 | 2 | 0.6% | |
6 | 2 | 0.6% | |
Other values (22) | 24 | 7.7% | |
(Missing) | 243 | 78.4% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 13 | 4.2% | |
2 | 10 | 3.2% | |
3 | 3 | 1.0% | |
4 | 2 | 0.6% | |
6 | 2 | 0.6% |
Maximum 5 values
Value | Count | Frequency (%) | |
87 | 1 | 0.3% | |
81 | 1 | 0.3% | |
73 | 1 | 0.3% | |
72 | 1 | 0.3% | |
65 | 1 | 0.3% |
Content
Categorical
Distinct count | 12 |
---|---|
Unique (%) | 3.9% |
Missing (%) | 0.0% |
Missing (n) | 0 |
"Opportunity" | |
---|---|
News | 33 |
Other | 14 |
Other values (9) |
Value | Count | Frequency (%) | |
"Opportunity" | 218 | 70.3% | |
News | 33 | 10.6% | |
Other | 14 | 4.5% | |
Investing | 12 | 3.9% | |
Trading | 6 | 1.9% | |
Technology | 6 | 1.9% | |
Advertisement | 6 | 1.9% | |
Entertainment | 5 | 1.6% | |
Seeking Info | 3 | 1.0% | |
Outright Scam | 3 | 1.0% | |
Other values (2) | 4 | 1.3% |
Max length | 16 |
---|---|
Mean length | 11.36129032 |
Min length | 4 |
Contains chars | True |
Contains digits | False |
Contains spaces | True |
Contains non-words | True |
Gender
Categorical
Distinct count | 3 |
---|---|
Unique (%) | 1.0% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Male | |
---|---|
Female | |
Unknown | 16 |
Value | Count | Frequency (%) | |
Male | 203 | 65.5% | |
Female | 91 | 29.4% | |
Unknown | 16 | 5.2% |
Max length | 7 |
---|---|
Mean length | 4.741935484 |
Min length | 4 |
Contains chars | True |
Contains digits | False |
Contains spaces | False |
Contains non-words | False |
Group_ID
Numeric
Distinct count | 30 |
---|---|
Unique (%) | 9.7% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 15.03225806 |
---|---|
Minimum | 1 |
Maximum | 30 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 7 |
Median | 15 |
Q3 | 23 |
95-th percentile | 29 |
Maximum | 30 |
Range | 29 |
Interquartile range | 16 |
Descriptive statistics
Standard deviation | 8.906140825 |
---|---|
Coef of variation | 0.5924685957 |
Kurtosis | -1.22335928 |
Mean | 15.03225806 |
MAD | 7.710718002 |
Skewness | 0.01815803906 |
Sum | 4660 |
Variance | 79.3193444 |
Memory size | 4.8 KiB |
Value | Count | Frequency (%) | |
1 | 20 | 6.5% | |
29 | 10 | 3.2% | |
2 | 10 | 3.2% | |
3 | 10 | 3.2% | |
4 | 10 | 3.2% | |
5 | 10 | 3.2% | |
6 | 10 | 3.2% | |
7 | 10 | 3.2% | |
8 | 10 | 3.2% | |
9 | 10 | 3.2% | |
Other values (20) | 200 | 64.5% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 20 | 6.5% | |
2 | 10 | 3.2% | |
3 | 10 | 3.2% | |
4 | 10 | 3.2% | |
5 | 10 | 3.2% |
Maximum 5 values
Value | Count | Frequency (%) | |
30 | 10 | 3.2% | |
29 | 10 | 3.2% | |
28 | 10 | 3.2% | |
27 | 10 | 3.2% | |
26 | 10 | 3.2% |
Member_Age
Categorical
Distinct count | 4 |
---|---|
Unique (%) | 1.3% |
Missing (%) | 0.0% |
Missing (n) | 0 |
25-40 | |
---|---|
<25 | |
>40 |
Value | Count | Frequency (%) | |
25-40 | 102 | 32.9% | |
<25 | 99 | 31.9% | |
>40 | 59 | 19.0% | |
Unknown | 50 | 16.1% |
Max length | 7 |
---|---|
Mean length | 4.303225806 |
Min length | 3 |
Contains chars | True |
Contains digits | True |
Contains spaces | False |
Contains non-words | True |
Post_Length
Numeric
Distinct count | 5 |
---|---|
Unique (%) | 1.6% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 3.370967742 |
---|---|
Minimum | 1 |
Maximum | 5 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
Median | 4.5 |
Q3 | 5 |
95-th percentile | 5 |
Maximum | 5 |
Range | 4 |
Interquartile range | 4 |
Descriptive statistics
Standard deviation | 1.783000189 |
---|---|
Coef of variation | 0.5289282858 |
Kurtosis | -1.695252395 |
Mean | 3.370967742 |
MAD | 1.681789802 |
Skewness | -0.354022714 |
Sum | 1045 |
Variance | 3.179089675 |
Memory size | 4.8 KiB |
Value | Count | Frequency (%) | |
5 | 155 | 50.0% | |
1 | 92 | 29.7% | |
3 | 26 | 8.4% | |
2 | 24 | 7.7% | |
4 | 13 | 4.2% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 92 | 29.7% | |
2 | 24 | 7.7% | |
3 | 26 | 8.4% | |
4 | 13 | 4.2% | |
5 | 155 | 50.0% |
Maximum 5 values
Value | Count | Frequency (%) | |
5 | 155 | 50.0% | |
4 | 13 | 4.2% | |
3 | 26 | 8.4% | |
2 | 24 | 7.7% | |
1 | 92 | 29.7% |
Post_Type
Categorical
Distinct count | 7 |
---|---|
Unique (%) | 2.3% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Link W/ Text | |
---|---|
Link | |
Text | |
Other values (4) |
Value | Count | Frequency (%) | |
Link W/ Text | 161 | 51.9% | |
Link | 57 | 18.4% | |
Text | 38 | 12.3% | |
Image W/ Text | 31 | 10.0% | |
Image | 10 | 3.2% | |
Video | 8 | 2.6% | |
Video W/ Text | 5 | 1.6% |
Max length | 13 |
---|---|
Mean length | 9.258064516 |
Min length | 4 |
Contains chars | True |
Contains digits | False |
Contains spaces | True |
Contains non-words | True |
Sentiment
Categorical
Distinct count | 3 |
---|---|
Unique (%) | 1.0% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Neutral | |
---|---|
Bullish | 32 |
Bearish | 4 |
Value | Count | Frequency (%) | |
Neutral | 274 | 88.4% | |
Bullish | 32 | 10.3% | |
Bearish | 4 | 1.3% |
Max length | 7 |
---|---|
Mean length | 7 |
Min length | 7 |
Contains chars | True |
Contains digits | False |
Contains spaces | False |
Contains non-words | False |
First rows
#_Comments | Content | Gender | Group_ID | Member_Age | Post_Length | Post_Type | Sentiment | |
---|---|---|---|---|---|---|---|---|
0 | 19.0 | "Opportunity" | Male | 1 | <25 | 2 | Image W/ Text | Neutral |
1 | 25.0 | "Opportunity" | Male | 1 | <25 | 2 | Image W/ Text | Neutral |
2 | 65.0 | "Opportunity" | Female | 1 | 25-40 | 2 | Image W/ Text | Neutral |
3 | 15.0 | Other | Male | 1 | 25-40 | 1 | Text | Neutral |
4 | 2.0 | "Opportunity" | Female | 1 | <25 | 5 | Image W/ Text | Neutral |
5 | 33.0 | "Opportunity" | Female | 1 | >40 | 1 | Image W/ Text | Neutral |
6 | 30.0 | "Opportunity" | Male | 1 | >40 | 1 | Image W/ Text | Neutral |
7 | 1.0 | "Opportunity" | Female | 1 | >40 | 5 | Link W/ Text | Neutral |
8 | 26.0 | Outright Scam | Male | 1 | 25-40 | 2 | Text | Neutral |
9 | 31.0 | "Opportunity" | Unknown | 1 | Unknown | 5 | Text | Neutral |
Last rows
#_Comments | Content | Gender | Group_ID | Member_Age | Post_Length | Post_Type | Sentiment | |
---|---|---|---|---|---|---|---|---|
300 | NaN | "Opportunity" | Male | 30 | >40 | 5 | Link W/ Text | Neutral |
301 | NaN | "Opportunity" | Male | 30 | 25-40 | 1 | Link | Neutral |
302 | NaN | "Opportunity" | Male | 30 | 25-40 | 5 | Link W/ Text | Neutral |
303 | NaN | "Opportunity" | Male | 30 | >40 | 5 | Link W/ Text | Neutral |
304 | NaN | "Opportunity" | Male | 30 | 25-40 | 1 | Image | Neutral |
305 | NaN | Other | Male | 30 | 25-40 | 1 | Link | Neutral |
306 | NaN | Other | Male | 30 | >40 | 1 | Video | Neutral |
307 | NaN | Other | Female | 30 | 25-40 | 1 | Video | Neutral |
308 | NaN | Other | Male | 30 | 25-40 | 1 | Link | Neutral |
309 | NaN | Other | Male | 30 | 25-40 | 1 | Video | Neutral |