Description
Paper
Basic Stats
General
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 10295 entries, 0 to 10294
Data columns (total 38 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 author 10295 non-null object
1 mbti 9084 non-null object
2 introverted 9078 non-null float64
3 intuitive 9083 non-null float64
4 thinking 9080 non-null float64
5 perceiving 9074 non-null float64
6 gender 3227 non-null object
7 age 2324 non-null float64
8 enneagram 794 non-null object
9 country 2146 non-null object
10 state 857 non-null object
11 type 1611 non-null object
12 agreeableness 1606 non-null float64
13 openness 1588 non-null float64
14 conscientiousness 1605 non-null float64
15 extraversion 1608 non-null float64
16 neuroticism 1603 non-null float64
17 is_description 151 non-null float64
18 is_percentile 368 non-null object
19 is_score 1098 non-null float64
20 contains_details 1386 non-null float64
21 num_comments 10295 non-null int64
22 en_comments 10295 non-null int64
23 en_comments_percentage 10295 non-null float64
24 region 852 non-null object
25 continent 2146 non-null object
26 country_code 2146 non-null object
27 enneagram_type 794 non-null float64
28 enneagram_wing 790 non-null float64
29 is_native_english_country 2146 non-null float64
30 predicted_test 1677 non-null float64
31 test_name 1677 non-null object
32 test_scale 1677 non-null object
33 16pers_ta 9 non-null object
34 test_result_type 1677 non-null object
35 is_female 3084 non-null float64
36 is_female_pred 10295 non-null int64
37 is_female_proba 10295 non-null float64
dtypes: float64(20), int64(3), object(15)
memory usage: 3.0+ MB
Gender
| age | agreeableness | openness | conscientiousness | extraversion | neuroticism | comments | in english |
---|
count | 2324 | 1606 | 1588 | 1605 | 1608 | 1603 | 10295 | 10295 |
---|
mean | 25.68 | 42.40 | 62.45 | 40.16 | 37.38 | 49.78 | 1819 | 1714 |
---|
std | 7.07 | 31.04 | 27.78 | 30.39 | 30.47 | 32.37 | 4104 | 3866 |
---|
min | 14 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
---|
25% | 21 | 14 | 44 | 13 | 10 | 19 | 219 | 206 |
---|
50% | 24 | 40 | 69 | 35 | 30 | 50 | 604 | 569 |
---|
75% | 29 | 70 | 85 | 65 | 61 | 82 | 1729 | 1616 |
---|
max | 67 | 100 | 100 | 99 | 99 | 100 | 101789 | 99568 |
---|
MBTI dimensions
| introverted | intuitive | thinking | perceiving |
---|
1.0 | 7152 | 8054 | 5857 | 5315 |
---|
0.0 | 1926 | 1029 | 3223 | 3759 |
---|
MBTI types
| intp | intj | infp | infj | entp | enfp | istp | entj | istj | enfj | isfp | isfj | estp | esfp | estj | esfj |
---|
mbti | 2336 | 1847 | 1074 | 1051 | 631 | 617 | 407 | 320 | 195 | 163 | 123 | 109 | 72 | 50 | 43 | 29 |
---|
Enneagram types
| 5.0 | 4.0 | 9.0 | 7.0 | 6.0 | 2.0 | 8.0 | 1.0 | 3.0 |
---|
enneagram_type | 239 | 164 | 100 | 79 | 56 | 44 | 41 | 41 | 30 |
---|
Continents
| north america | europe | asia | oceania | south america | africa |
---|
continent | 1325 | 598 | 106 | 87 | 26 | 4 |
---|
Top 10 countries
| us | canada | uk | australia | germany | netherlands | sweden | eu | france | finland |
---|
country | 1125 | 188 | 177 | 73 | 54 | 38 | 33 | 27 | 27 | 24 |
---|
Regions (without prefix US, Canada with prefix c)
| w | mw | se | ne | sw | cw | ce |
---|
region | 214 | 155 | 145 | 140 | 101 | 53 | 44 |
---|
Terms of use
Please note the following in case you download the datasets:
- You cannot transfer or reproduce any part of the dataset.
- You cannot attempt to identify any user in the dataset.
- You cannot contact any user in the dataset.
- You can not display users’ names and sensitive messages publicly
- You should check (via Reddit API) if users removed some of their content and/or deleted their accounts
- You can report your findings publicly only on an aggregate level