Dataset statistics
Number of variables | 17 |
---|---|
Number of observations | 45211 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 29.2 MiB |
Average record size in memory | 677.2 B |
Variable types
NUM | 7 |
---|---|
CAT | 6 |
BOOL | 4 |
Reproduction
Analysis started | 2020-02-14 00:41:42.650525 |
---|---|
Analysis finished | 2020-02-14 00:42:01.088083 |
Version | pandas-profiling v2.5.0 |
Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
Download configuration | config.yaml |
age
Real number (ℝ≥0)
Distinct count | 77 |
---|---|
Unique (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40.93621021432837 |
---|---|
Minimum | 18 |
Maximum | 95 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 353.3 KiB |
Dataset statistics
Number of variables | 17 |
---|---|
Number of observations | 45211 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 29.2 MiB |
Average record size in memory | 677.2 B |
Variable types
NUM | 7 |
---|---|
CAT | 6 |
BOOL | 4 |
Reproduction
Analysis started | 2020-02-13 23:56:23.157177 |
---|---|
Analysis finished | 2020-02-13 23:56:42.235000 |
Version | pandas-profiling v2.5.0 |
Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
Download configuration | config.yaml |
age
Real number (ℝ≥0)
Distinct count | 77 |
---|---|
Unique (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40.93621021432837 |
---|---|
Minimum | 18 |
Maximum | 95 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | 18 |
---|---|
5-th percentile | 27 |
Q1 | 33 |
median | 39 |
Q3 | 48 |
95-th percentile | 59 |
Maximum | 95 |
Range | 77 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 10.61876204 |
---|---|
Coefficient of variation (CV) | 0.2593977797 |
Kurtosis | 0.3195703759 |
Mean | 40.93621021 |
Median Absolute Deviation (MAD) | 8.737267827 |
Skewness | 0.6848179257 |
Sum | 1850767 |
Variance | 112.7581073 |
Quantile statistics
Minimum | 18 |
---|---|
5-th percentile | 27 |
Q1 | 33 |
median | 39 |
Q3 | 48 |
95-th percentile | 59 |
Maximum | 95 |
Range | 77 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 10.61876204 |
---|---|
Coefficient of variation (CV) | 0.2593977797 |
Kurtosis | 0.3195703759 |
Mean | 40.93621021 |
Median Absolute Deviation (MAD) | 8.737267827 |
Skewness | 0.6848179257 |
Sum | 1850767 |
Variance | 112.7581073 |
Value | Count | Frequency (%) | |
32 | 2085 | 4.6% | |
31 | 1996 | 4.4% | |
33 | 1972 | 4.4% | |
34 | 1930 | 4.3% | |
35 | 1894 | 4.2% | |
36 | 1806 | 4.0% | |
30 | 1757 | 3.9% | |
37 | 1696 | 3.8% | |
39 | 1487 | 3.3% | |
38 | 1466 | 3.2% | |
Other values (67) | 27122 | 60.0% |
Value | Count | Frequency (%) | |
18 | 12 | < 0.1% | |
19 | 35 | 0.1% | |
20 | 50 | 0.1% | |
21 | 79 | 0.2% | |
22 | 129 | 0.3% |
Value | Count | Frequency (%) | |
95 | 2 | < 0.1% | |
94 | 1 | < 0.1% | |
93 | 2 | < 0.1% | |
92 | 2 | < 0.1% | |
90 | 2 | < 0.1% |
job
Categorical
Distinct count | 12 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
blue-collar | |
---|---|
management | |
technician | |
admin. | |
services | |
Other values (7) |
Value | Count | Frequency (%) | |
blue-collar | 9732 | 21.5% | |
management | 9458 | 20.9% | |
technician | 7597 | 16.8% | |
admin. | 5171 | 11.4% | |
services | 4154 | 9.2% | |
retired | 2264 | 5.0% | |
self-employed | 1579 | 3.5% | |
entrepreneur | 1487 | 3.3% | |
unemployed | 1303 | 2.9% | |
housemaid | 1240 | 2.7% | |
Other values (2) | 1226 | 2.7% |
Value | Count | Frequency (%) | |
32 | 2085 | 4.6% | |
31 | 1996 | 4.4% | |
33 | 1972 | 4.4% | |
34 | 1930 | 4.3% | |
35 | 1894 | 4.2% | |
36 | 1806 | 4.0% | |
30 | 1757 | 3.9% | |
37 | 1696 | 3.8% | |
39 | 1487 | 3.3% | |
38 | 1466 | 3.2% | |
Other values (67) | 27122 | 60.0% |
Value | Count | Frequency (%) | |
18 | 12 | < 0.1% | |
19 | 35 | 0.1% | |
20 | 50 | 0.1% | |
21 | 79 | 0.2% | |
22 | 129 | 0.3% |
Value | Count | Frequency (%) | |
95 | 2 | < 0.1% | |
94 | 1 | < 0.1% | |
93 | 2 | < 0.1% | |
92 | 2 | < 0.1% | |
90 | 2 | < 0.1% |
job
Categorical
Distinct count | 12 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
blue-collar | |
---|---|
management | |
technician | |
admin. | |
services | |
Other values (7) |
Value | Count | Frequency (%) | |
blue-collar | 9732 | 21.5% | |
management | 9458 | 20.9% | |
technician | 7597 | 16.8% | |
admin. | 5171 | 11.4% | |
services | 4154 | 9.2% | |
retired | 2264 | 5.0% | |
self-employed | 1579 | 3.5% | |
entrepreneur | 1487 | 3.3% | |
unemployed | 1303 | 2.9% | |
housemaid | 1240 | 2.7% | |
Other values (2) | 1226 | 2.7% |
Length
Max length | 13 |
---|---|
Mean length | 9.485545553 |
Min length | 6 |
Value | Count | Frequency (%) | |
Lowercase_Letter | 22 | 91.7% | |
Dash_Punctuation | 1 | 4.2% | |
Other_Punctuation | 1 | 4.2% |
Value | Count | Frequency (%) | |
Latin | 22 | 91.7% | |
Common | 2 | 8.3% |
Value | Count | Frequency (%) | |
ASCII | 24 | 100.0% |
marital
Categorical
Distinct count | 3 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
married | |
---|---|
single | |
divorced | 5207 |
Value | Count | Frequency (%) | |
married | 27214 | 60.2% | |
single | 12790 | 28.3% | |
divorced | 5207 | 11.5% |
Length
Max length | 13 |
---|---|
Mean length | 9.485545553 |
Min length | 6 |
Value | Count | Frequency (%) | |
Lowercase_Letter | 22 | 91.7% | |
Dash_Punctuation | 1 | 4.2% | |
Other_Punctuation | 1 | 4.2% |
Value | Count | Frequency (%) | |
Latin | 22 | 91.7% | |
Common | 2 | 8.3% |
Value | Count | Frequency (%) | |
ASCII | 24 | 100.0% |
marital
Categorical
Distinct count | 3 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
married | |
---|---|
single | |
divorced | 5207 |
Value | Count | Frequency (%) | |
married | 27214 | 60.2% | |
single | 12790 | 28.3% | |
divorced | 5207 | 11.5% |
Length
Max length | 8 |
---|---|
Mean length | 6.832275331 |
Min length | 6 |
Value | Count | Frequency (%) | |
Lowercase_Letter | 13 | 100.0% |
Value | Count | Frequency (%) | |
Latin | 13 | 100.0% |
Value | Count | Frequency (%) | |
ASCII | 13 | 100.0% |
education
Categorical
Distinct count | 4 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
secondary | |
---|---|
tertiary | |
primary | |
unknown | 1857 |
Value | Count | Frequency (%) | |
secondary | 23202 | 51.3% | |
tertiary | 13301 | 29.4% | |
primary | 6851 | 15.2% | |
unknown | 1857 | 4.1% |
Length
Max length | 8 |
---|---|
Mean length | 6.832275331 |
Min length | 6 |
Value | Count | Frequency (%) | |
Lowercase_Letter | 13 | 100.0% |
Value | Count | Frequency (%) | |
Latin | 13 | 100.0% |
Value | Count | Frequency (%) | |
ASCII | 13 | 100.0% |
education
Categorical
Distinct count | 4 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
secondary | |
---|---|
tertiary | |
primary | |
unknown | 1857 |
Value | Count | Frequency (%) | |
secondary | 23202 | 51.3% | |
tertiary | 13301 | 29.4% | |
primary | 6851 | 15.2% | |
unknown | 1857 | 4.1% |
Length
Max length | 9 |
---|---|
Mean length | 8.320585698 |
Min length | 7 |
Value | Count | Frequency (%) | |
Lowercase_Letter | 16 | 100.0% |
Value | Count | Frequency (%) | |
Latin | 16 | 100.0% |
Value | Count | Frequency (%) | |
ASCII | 16 | 100.0% |
default
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
no | |
---|---|
yes | 815 |
Value | Count | Frequency (%) | |
no | 44396 | 98.2% | |
yes | 815 | 1.8% |
Distinct count | 7168 |
---|---|
Unique (%) | 15.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1362.2720576850766 |
---|---|
Minimum | -8019 |
Maximum | 102127 |
Zeros | 3514 |
Zeros (%) | 7.8% |
Memory size | 353.3 KiB |
Length
Max length | 9 |
---|---|
Mean length | 8.320585698 |
Min length | 7 |
Value | Count | Frequency (%) | |
Lowercase_Letter | 16 | 100.0% |
Value | Count | Frequency (%) | |
Latin | 16 | 100.0% |
Value | Count | Frequency (%) | |
ASCII | 16 | 100.0% |
default
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
no | |
---|---|
yes | 815 |
Value | Count | Frequency (%) | |
no | 44396 | 98.2% | |
yes | 815 | 1.8% |
Distinct count | 7168 |
---|---|
Unique (%) | 15.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1362.2720576850766 |
---|---|
Minimum | -8019 |
Maximum | 102127 |
Zeros | 3514 |
Zeros (%) | 7.8% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | -8019 |
---|---|
5-th percentile | -172 |
Q1 | 72 |
median | 448 |
Q3 | 1428 |
95-th percentile | 5768 |
Maximum | 102127 |
Range | 110146 |
Interquartile range (IQR) | 1356 |
Descriptive statistics
Standard deviation | 3044.765829 |
---|---|
Coefficient of variation (CV) | 2.235064437 |
Kurtosis | 140.7515466 |
Mean | 1362.272058 |
Median Absolute Deviation (MAD) | 1551.513033 |
Skewness | 8.360308326 |
Sum | 61589682 |
Variance | 9270598.954 |
Quantile statistics
Minimum | -8019 |
---|---|
5-th percentile | -172 |
Q1 | 72 |
median | 448 |
Q3 | 1428 |
95-th percentile | 5768 |
Maximum | 102127 |
Range | 110146 |
Interquartile range (IQR) | 1356 |
Descriptive statistics
Standard deviation | 3044.765829 |
---|---|
Coefficient of variation (CV) | 2.235064437 |
Kurtosis | 140.7515466 |
Mean | 1362.272058 |
Median Absolute Deviation (MAD) | 1551.513033 |
Skewness | 8.360308326 |
Sum | 61589682 |
Variance | 9270598.954 |
Value | Count | Frequency (%) | |
0 | 3514 | 7.8% | |
1 | 195 | 0.4% | |
2 | 156 | 0.3% | |
4 | 139 | 0.3% | |
3 | 134 | 0.3% | |
5 | 113 | 0.2% | |
6 | 88 | 0.2% | |
8 | 81 | 0.2% | |
23 | 75 | 0.2% | |
10 | 69 | 0.2% | |
Other values (7158) | 40647 | 89.9% |
Value | Count | Frequency (%) | |
-8019 | 1 | < 0.1% | |
-6847 | 1 | < 0.1% | |
-4057 | 1 | < 0.1% | |
-3372 | 1 | < 0.1% | |
-3313 | 1 | < 0.1% |
Value | Count | Frequency (%) | |
102127 | 1 | < 0.1% | |
98417 | 1 | < 0.1% | |
81204 | 2 | < 0.1% | |
71188 | 1 | < 0.1% | |
66721 | 1 | < 0.1% |
housing
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
yes | |
---|---|
no |
Value | Count | Frequency (%) | |
yes | 25130 | 55.6% | |
no | 20081 | 44.4% |
loan
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
no | |
---|---|
yes | 7244 |
Value | Count | Frequency (%) | |
no | 37967 | 84.0% | |
yes | 7244 | 16.0% |
contact
Categorical
Distinct count | 3 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
cellular | |
---|---|
unknown | |
telephone | 2906 |
Value | Count | Frequency (%) | |
cellular | 29285 | 64.8% | |
unknown | 13020 | 28.8% | |
telephone | 2906 | 6.4% |
Value | Count | Frequency (%) | |
0 | 3514 | 7.8% | |
1 | 195 | 0.4% | |
2 | 156 | 0.3% | |
4 | 139 | 0.3% | |
3 | 134 | 0.3% | |
5 | 113 | 0.2% | |
6 | 88 | 0.2% | |
8 | 81 | 0.2% | |
23 | 75 | 0.2% | |
10 | 69 | 0.2% | |
Other values (7158) | 40647 | 89.9% |
Value | Count | Frequency (%) | |
-8019 | 1 | < 0.1% | |
-6847 | 1 | < 0.1% | |
-4057 | 1 | < 0.1% | |
-3372 | 1 | < 0.1% | |
-3313 | 1 | < 0.1% |
Value | Count | Frequency (%) | |
102127 | 1 | < 0.1% | |
98417 | 1 | < 0.1% | |
81204 | 2 | < 0.1% | |
71188 | 1 | < 0.1% | |
66721 | 1 | < 0.1% |
housing
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
yes | |
---|---|
no |
Value | Count | Frequency (%) | |
yes | 25130 | 55.6% | |
no | 20081 | 44.4% |
loan
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
no | |
---|---|
yes | 7244 |
Value | Count | Frequency (%) | |
no | 37967 | 84.0% | |
yes | 7244 | 16.0% |
contact
Categorical
Distinct count | 3 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
cellular | |
---|---|
unknown | |
telephone | 2906 |
Value | Count | Frequency (%) | |
cellular | 29285 | 64.8% | |
unknown | 13020 | 28.8% | |
telephone | 2906 | 6.4% |
Length
Max length | 9 |
---|---|
Mean length | 7.77629338 |
Min length | 7 |
Value | Count | Frequency (%) | |
Lowercase_Letter | 13 | 100.0% |
Value | Count | Frequency (%) | |
Latin | 13 | 100.0% |
Value | Count | Frequency (%) | |
ASCII | 13 | 100.0% |
day
Real number (ℝ≥0)
Distinct count | 31 |
---|---|
Unique (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.80641879188693 |
---|---|
Minimum | 1 |
Maximum | 31 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 353.3 KiB |
Length
Max length | 9 |
---|---|
Mean length | 7.77629338 |
Min length | 7 |
Value | Count | Frequency (%) | |
Lowercase_Letter | 13 | 100.0% |
Value | Count | Frequency (%) | |
Latin | 13 | 100.0% |
Value | Count | Frequency (%) | |
ASCII | 13 | 100.0% |
day
Real number (ℝ≥0)
Distinct count | 31 |
---|---|
Unique (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.80641879188693 |
---|---|
Minimum | 1 |
Maximum | 31 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 8 |
median | 16 |
Q3 | 21 |
95-th percentile | 29 |
Maximum | 31 |
Range | 30 |
Interquartile range (IQR) | 13 |
Descriptive statistics
Standard deviation | 8.322476153 |
---|---|
Coefficient of variation (CV) | 0.5265250948 |
Kurtosis | -1.059897373 |
Mean | 15.80641879 |
Median Absolute Deviation (MAD) | 7.055904428 |
Skewness | 0.09307901402 |
Sum | 714624 |
Variance | 69.26360932 |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 8 |
median | 16 |
Q3 | 21 |
95-th percentile | 29 |
Maximum | 31 |
Range | 30 |
Interquartile range (IQR) | 13 |
Descriptive statistics
Standard deviation | 8.322476153 |
---|---|
Coefficient of variation (CV) | 0.5265250948 |
Kurtosis | -1.059897373 |
Mean | 15.80641879 |
Median Absolute Deviation (MAD) | 7.055904428 |
Skewness | 0.09307901402 |
Sum | 714624 |
Variance | 69.26360932 |
Value | Count | Frequency (%) | |
20 | 2752 | 6.1% | |
18 | 2308 | 5.1% | |
21 | 2026 | 4.5% | |
17 | 1939 | 4.3% | |
6 | 1932 | 4.3% | |
5 | 1910 | 4.2% | |
14 | 1848 | 4.1% | |
8 | 1842 | 4.1% | |
28 | 1830 | 4.0% | |
7 | 1817 | 4.0% | |
Other values (21) | 25007 | 55.3% |
Value | Count | Frequency (%) | |
1 | 322 | 0.7% | |
2 | 1293 | 2.9% | |
3 | 1079 | 2.4% | |
4 | 1445 | 3.2% | |
5 | 1910 | 4.2% |
Value | Count | Frequency (%) | |
31 | 643 | 1.4% | |
30 | 1566 | 3.5% | |
29 | 1745 | 3.9% | |
28 | 1830 | 4.0% | |
27 | 1121 | 2.5% |
month
Categorical
Distinct count | 12 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
may | |
---|---|
jul | |
aug | |
jun | |
nov | |
Other values (7) |
Value | Count | Frequency (%) | |
may | 13766 | 30.4% | |
jul | 6895 | 15.3% | |
aug | 6247 | 13.8% | |
jun | 5341 | 11.8% | |
nov | 3970 | 8.8% | |
apr | 2932 | 6.5% | |
feb | 2649 | 5.9% | |
jan | 1403 | 3.1% | |
oct | 738 | 1.6% | |
sep | 579 | 1.3% | |
Other values (2) | 691 | 1.5% |
Value | Count | Frequency (%) | |
20 | 2752 | 6.1% | |
18 | 2308 | 5.1% | |
21 | 2026 | 4.5% | |
17 | 1939 | 4.3% | |
6 | 1932 | 4.3% | |
5 | 1910 | 4.2% | |
14 | 1848 | 4.1% | |
8 | 1842 | 4.1% | |
28 | 1830 | 4.0% | |
7 | 1817 | 4.0% | |
Other values (21) | 25007 | 55.3% |
Value | Count | Frequency (%) | |
1 | 322 | 0.7% | |
2 | 1293 | 2.9% | |
3 | 1079 | 2.4% | |
4 | 1445 | 3.2% | |
5 | 1910 | 4.2% |
Value | Count | Frequency (%) | |
31 | 643 | 1.4% | |
30 | 1566 | 3.5% | |
29 | 1745 | 3.9% | |
28 | 1830 | 4.0% | |
27 | 1121 | 2.5% |
month
Categorical
Distinct count | 12 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
may | |
---|---|
jul | |
aug | |
jun | |
nov | |
Other values (7) |
Value | Count | Frequency (%) | |
may | 13766 | 30.4% | |
jul | 6895 | 15.3% | |
aug | 6247 | 13.8% | |
jun | 5341 | 11.8% | |
nov | 3970 | 8.8% | |
apr | 2932 | 6.5% | |
feb | 2649 | 5.9% | |
jan | 1403 | 3.1% | |
oct | 738 | 1.6% | |
sep | 579 | 1.3% | |
Other values (2) | 691 | 1.5% |
Length
Max length | 3 |
---|---|
Mean length | 3 |
Min length | 3 |
Value | Count | Frequency (%) | |
Lowercase_Letter | 19 | 100.0% |
Value | Count | Frequency (%) | |
Latin | 19 | 100.0% |
Value | Count | Frequency (%) | |
ASCII | 19 | 100.0% |
duration
Real number (ℝ≥0)
Distinct count | 1573 |
---|---|
Unique (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 258.1630797814691 |
---|---|
Minimum | 0 |
Maximum | 4918 |
Zeros | 3 |
Zeros (%) | < 0.1% |
Memory size | 353.3 KiB |
Length
Max length | 3 |
---|---|
Mean length | 3 |
Min length | 3 |
Value | Count | Frequency (%) | |
Lowercase_Letter | 19 | 100.0% |
Value | Count | Frequency (%) | |
Latin | 19 | 100.0% |
Value | Count | Frequency (%) | |
ASCII | 19 | 100.0% |
duration
Real number (ℝ≥0)
Distinct count | 1573 |
---|---|
Unique (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 258.1630797814691 |
---|---|
Minimum | 0 |
Maximum | 4918 |
Zeros | 3 |
Zeros (%) | < 0.1% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 35 |
Q1 | 103 |
median | 180 |
Q3 | 319 |
95-th percentile | 751 |
Maximum | 4918 |
Range | 4918 |
Interquartile range (IQR) | 216 |
Descriptive statistics
Standard deviation | 257.5278123 |
---|---|
Coefficient of variation (CV) | 0.9975392782 |
Kurtosis | 18.15391527 |
Mean | 258.1630798 |
Median Absolute Deviation (MAD) | 170.9677992 |
Skewness | 3.144318099 |
Sum | 11671811 |
Variance | 66320.57409 |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 35 |
Q1 | 103 |
median | 180 |
Q3 | 319 |
95-th percentile | 751 |
Maximum | 4918 |
Range | 4918 |
Interquartile range (IQR) | 216 |
Descriptive statistics
Standard deviation | 257.5278123 |
---|---|
Coefficient of variation (CV) | 0.9975392782 |
Kurtosis | 18.15391527 |
Mean | 258.1630798 |
Median Absolute Deviation (MAD) | 170.9677992 |
Skewness | 3.144318099 |
Sum | 11671811 |
Variance | 66320.57409 |
Value | Count | Frequency (%) | |
124 | 188 | 0.4% | |
90 | 184 | 0.4% | |
89 | 177 | 0.4% | |
122 | 175 | 0.4% | |
104 | 175 | 0.4% | |
114 | 175 | 0.4% | |
136 | 174 | 0.4% | |
112 | 174 | 0.4% | |
139 | 174 | 0.4% | |
121 | 173 | 0.4% | |
Other values (1563) | 43442 | 96.1% |
Value | Count | Frequency (%) | |
0 | 3 | < 0.1% | |
1 | 2 | < 0.1% | |
2 | 3 | < 0.1% | |
3 | 4 | < 0.1% | |
4 | 15 | < 0.1% |
Value | Count | Frequency (%) | |
4918 | 1 | < 0.1% | |
3881 | 1 | < 0.1% | |
3785 | 1 | < 0.1% | |
3422 | 1 | < 0.1% | |
3366 | 1 | < 0.1% |
campaign
Real number (ℝ≥0)
Distinct count | 48 |
---|---|
Unique (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.763840658246887 |
---|---|
Minimum | 1 |
Maximum | 63 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 353.3 KiB |
Value | Count | Frequency (%) | |
124 | 188 | 0.4% | |
90 | 184 | 0.4% | |
89 | 177 | 0.4% | |
122 | 175 | 0.4% | |
104 | 175 | 0.4% | |
114 | 175 | 0.4% | |
136 | 174 | 0.4% | |
112 | 174 | 0.4% | |
139 | 174 | 0.4% | |
121 | 173 | 0.4% | |
Other values (1563) | 43442 | 96.1% |
Value | Count | Frequency (%) | |
0 | 3 | < 0.1% | |
1 | 2 | < 0.1% | |
2 | 3 | < 0.1% | |
3 | 4 | < 0.1% | |
4 | 15 | < 0.1% |
Value | Count | Frequency (%) | |
4918 | 1 | < 0.1% | |
3881 | 1 | < 0.1% | |
3785 | 1 | < 0.1% | |
3422 | 1 | < 0.1% | |
3366 | 1 | < 0.1% |
campaign
Real number (ℝ≥0)
Distinct count | 48 |
---|---|
Unique (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.763840658246887 |
---|---|
Minimum | 1 |
Maximum | 63 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 3 |
95-th percentile | 8 |
Maximum | 63 |
Range | 62 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 3.098020883 |
---|---|
Coefficient of variation (CV) | 1.120911538 |
Kurtosis | 39.2496508 |
Mean | 2.763840658 |
Median Absolute Deviation (MAD) | 1.791451104 |
Skewness | 4.898650166 |
Sum | 124956 |
Variance | 9.597733393 |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 3 |
95-th percentile | 8 |
Maximum | 63 |
Range | 62 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 3.098020883 |
---|---|
Coefficient of variation (CV) | 1.120911538 |
Kurtosis | 39.2496508 |
Mean | 2.763840658 |
Median Absolute Deviation (MAD) | 1.791451104 |
Skewness | 4.898650166 |
Sum | 124956 |
Variance | 9.597733393 |
Value | Count | Frequency (%) | |
1 | 17544 | 38.8% | |
2 | 12505 | 27.7% | |
3 | 5521 | 12.2% | |
4 | 3522 | 7.8% | |
5 | 1764 | 3.9% | |
6 | 1291 | 2.9% | |
7 | 735 | 1.6% | |
8 | 540 | 1.2% | |
9 | 327 | 0.7% | |
10 | 266 | 0.6% | |
Other values (38) | 1196 | 2.6% |
Value | Count | Frequency (%) | |
1 | 17544 | 38.8% | |
2 | 12505 | 27.7% | |
3 | 5521 | 12.2% | |
4 | 3522 | 7.8% | |
5 | 1764 | 3.9% |
Value | Count | Frequency (%) | |
63 | 1 | < 0.1% | |
58 | 1 | < 0.1% | |
55 | 1 | < 0.1% | |
51 | 1 | < 0.1% | |
50 | 2 | < 0.1% |
pdays
Real number (ℝ)
Distinct count | 559 |
---|---|
Unique (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40.19782796222158 |
---|---|
Minimum | -1 |
Maximum | 871 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 353.3 KiB |
Value | Count | Frequency (%) | |
1 | 17544 | 38.8% | |
2 | 12505 | 27.7% | |
3 | 5521 | 12.2% | |
4 | 3522 | 7.8% | |
5 | 1764 | 3.9% | |
6 | 1291 | 2.9% | |
7 | 735 | 1.6% | |
8 | 540 | 1.2% | |
9 | 327 | 0.7% | |
10 | 266 | 0.6% | |
Other values (38) | 1196 | 2.6% |
Value | Count | Frequency (%) | |
1 | 17544 | 38.8% | |
2 | 12505 | 27.7% | |
3 | 5521 | 12.2% | |
4 | 3522 | 7.8% | |
5 | 1764 | 3.9% |
Value | Count | Frequency (%) | |
63 | 1 | < 0.1% | |
58 | 1 | < 0.1% | |
55 | 1 | < 0.1% | |
51 | 1 | < 0.1% | |
50 | 2 | < 0.1% |
pdays
Real number (ℝ)
Distinct count | 559 |
---|---|
Unique (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40.19782796222158 |
---|---|
Minimum | -1 |
Maximum | 871 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | -1 |
---|---|
5-th percentile | -1 |
Q1 | -1 |
median | -1 |
Q3 | -1 |
95-th percentile | 317 |
Maximum | 871 |
Range | 872 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 100.128746 |
---|---|
Coefficient of variation (CV) | 2.490899411 |
Kurtosis | 6.93519521 |
Mean | 40.19782796 |
Median Absolute Deviation (MAD) | 67.60696484 |
Skewness | 2.615715474 |
Sum | 1817384 |
Variance | 10025.76577 |
Quantile statistics
Minimum | -1 |
---|---|
5-th percentile | -1 |
Q1 | -1 |
median | -1 |
Q3 | -1 |
95-th percentile | 317 |
Maximum | 871 |
Range | 872 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 100.128746 |
---|---|
Coefficient of variation (CV) | 2.490899411 |
Kurtosis | 6.93519521 |
Mean | 40.19782796 |
Median Absolute Deviation (MAD) | 67.60696484 |
Skewness | 2.615715474 |
Sum | 1817384 |
Variance | 10025.76577 |
Value | Count | Frequency (%) | |
-1 | 36954 | 81.7% | |
182 | 167 | 0.4% | |
92 | 147 | 0.3% | |
183 | 126 | 0.3% | |
91 | 126 | 0.3% | |
181 | 117 | 0.3% | |
370 | 99 | 0.2% | |
184 | 85 | 0.2% | |
364 | 77 | 0.2% | |
95 | 74 | 0.2% | |
Other values (549) | 7239 | 16.0% |
Value | Count | Frequency (%) | |
-1 | 36954 | 81.7% | |
1 | 15 | < 0.1% | |
2 | 37 | 0.1% | |
3 | 1 | < 0.1% | |
4 | 2 | < 0.1% |
Value | Count | Frequency (%) | |
871 | 1 | < 0.1% | |
854 | 1 | < 0.1% | |
850 | 1 | < 0.1% | |
842 | 1 | < 0.1% | |
838 | 1 | < 0.1% |
Distinct count | 41 |
---|---|
Unique (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.5803233726305546 |
---|---|
Minimum | 0 |
Maximum | 275 |
Zeros | 36954 |
Zeros (%) | 81.7% |
Memory size | 353.3 KiB |
Value | Count | Frequency (%) | |
-1 | 36954 | 81.7% | |
182 | 167 | 0.4% | |
92 | 147 | 0.3% | |
183 | 126 | 0.3% | |
91 | 126 | 0.3% | |
181 | 117 | 0.3% | |
370 | 99 | 0.2% | |
184 | 85 | 0.2% | |
364 | 77 | 0.2% | |
95 | 74 | 0.2% | |
Other values (549) | 7239 | 16.0% |
Value | Count | Frequency (%) | |
-1 | 36954 | 81.7% | |
1 | 15 | < 0.1% | |
2 | 37 | 0.1% | |
3 | 1 | < 0.1% | |
4 | 2 | < 0.1% |
Value | Count | Frequency (%) | |
871 | 1 | < 0.1% | |
854 | 1 | < 0.1% | |
850 | 1 | < 0.1% | |
842 | 1 | < 0.1% | |
838 | 1 | < 0.1% |
Distinct count | 41 |
---|---|
Unique (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.5803233726305546 |
---|---|
Minimum | 0 |
Maximum | 275 |
Zeros | 36954 |
Zeros (%) | 81.7% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 3 |
Maximum | 275 |
Range | 275 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2.303441045 |
---|---|
Coefficient of variation (CV) | 3.969237073 |
Kurtosis | 4506.86066 |
Mean | 0.5803233726 |
Median Absolute Deviation (MAD) | 0.9486748761 |
Skewness | 41.84645447 |
Sum | 26237 |
Variance | 5.305840647 |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 3 |
Maximum | 275 |
Range | 275 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2.303441045 |
---|---|
Coefficient of variation (CV) | 3.969237073 |
Kurtosis | 4506.86066 |
Mean | 0.5803233726 |
Median Absolute Deviation (MAD) | 0.9486748761 |
Skewness | 41.84645447 |
Sum | 26237 |
Variance | 5.305840647 |
Value | Count | Frequency (%) | |
0 | 36954 | 81.7% | |
1 | 2772 | 6.1% | |
2 | 2106 | 4.7% | |
3 | 1142 | 2.5% | |
4 | 714 | 1.6% | |
5 | 459 | 1.0% | |
6 | 277 | 0.6% | |
7 | 205 | 0.5% | |
8 | 129 | 0.3% | |
9 | 92 | 0.2% | |
Other values (31) | 361 | 0.8% |
Value | Count | Frequency (%) | |
0 | 36954 | 81.7% | |
1 | 2772 | 6.1% | |
2 | 2106 | 4.7% | |
3 | 1142 | 2.5% | |
4 | 714 | 1.6% |
Value | Count | Frequency (%) | |
275 | 1 | < 0.1% | |
58 | 1 | < 0.1% | |
55 | 1 | < 0.1% | |
51 | 1 | < 0.1% | |
41 | 1 | < 0.1% |
poutcome
Categorical
Distinct count | 4 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
unknown | |
---|---|
failure | 4901 |
other | 1840 |
success | 1511 |
Value | Count | Frequency (%) | |
unknown | 36959 | 81.7% | |
failure | 4901 | 10.8% | |
other | 1840 | 4.1% | |
success | 1511 | 3.3% |
Value | Count | Frequency (%) | |
0 | 36954 | 81.7% | |
1 | 2772 | 6.1% | |
2 | 2106 | 4.7% | |
3 | 1142 | 2.5% | |
4 | 714 | 1.6% | |
5 | 459 | 1.0% | |
6 | 277 | 0.6% | |
7 | 205 | 0.5% | |
8 | 129 | 0.3% | |
9 | 92 | 0.2% | |
Other values (31) | 361 | 0.8% |
Value | Count | Frequency (%) | |
0 | 36954 | 81.7% | |
1 | 2772 | 6.1% | |
2 | 2106 | 4.7% | |
3 | 1142 | 2.5% | |
4 | 714 | 1.6% |
Value | Count | Frequency (%) | |
275 | 1 | < 0.1% | |
58 | 1 | < 0.1% | |
55 | 1 | < 0.1% | |
51 | 1 | < 0.1% | |
41 | 1 | < 0.1% |
poutcome
Categorical
Distinct count | 4 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
unknown | |
---|---|
failure | 4901 |
other | 1840 |
success | 1511 |
Value | Count | Frequency (%) | |
unknown | 36959 | 81.7% | |
failure | 4901 | 10.8% | |
other | 1840 | 4.1% | |
success | 1511 | 3.3% |
Length
Max length | 7 |
---|---|
Mean length | 6.91860388 |
Min length | 5 |
Value | Count | Frequency (%) | |
Lowercase_Letter | 15 | 100.0% |
Value | Count | Frequency (%) | |
Latin | 15 | 100.0% |
Value | Count | Frequency (%) | |
ASCII | 15 | 100.0% |
y
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
no | |
---|---|
yes | 5289 |
Value | Count | Frequency (%) | |
no | 39922 | 88.3% | |
yes | 5289 | 11.7% |
Length
Max length | 7 |
---|---|
Mean length | 6.91860388 |
Min length | 5 |
Value | Count | Frequency (%) | |
Lowercase_Letter | 15 | 100.0% |
Value | Count | Frequency (%) | |
Latin | 15 | 100.0% |
Value | Count | Frequency (%) | |
ASCII | 15 | 100.0% |
y
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 353.3 KiB |
no | |
---|---|
yes | 5289 |
Value | Count | Frequency (%) | |
no | 39922 | 88.3% | |
yes | 5289 | 11.7% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
age | job | marital | education | default | balance | housing | loan | contact | day | month | duration | campaign | pdays | previous | poutcome | y | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 58 | management | married | tertiary | no | 2143 | yes | no | unknown | 5 | may | 261 | 1 | -1 | 0 | unknown | no |
1 | 44 | technician | single | secondary | no | 29 | yes | no | unknown | 5 | may | 151 | 1 | -1 | 0 | unknown | no |
2 | 33 | entrepreneur | married | secondary | no | 2 | yes | yes | unknown | 5 | may | 76 | 1 | -1 | 0 | unknown | no |
3 | 47 | blue-collar | married | unknown | no | 1506 | yes | no | unknown | 5 | may | 92 | 1 | -1 | 0 | unknown | no |
4 | 33 | unknown | single | unknown | no | 1 | no | no | unknown | 5 | may | 198 | 1 | -1 | 0 | unknown | no |
5 | 35 | management | married | tertiary | no | 231 | yes | no | unknown | 5 | may | 139 | 1 | -1 | 0 | unknown | no |
6 | 28 | management | single | tertiary | no | 447 | yes | yes | unknown | 5 | may | 217 | 1 | -1 | 0 | unknown | no |
7 | 42 | entrepreneur | divorced | tertiary | yes | 2 | yes | no | unknown | 5 | may | 380 | 1 | -1 | 0 | unknown | no |
8 | 58 | retired | married | primary | no | 121 | yes | no | unknown | 5 | may | 50 | 1 | -1 | 0 | unknown | no |
9 | 43 | technician | single | secondary | no | 593 | yes | no | unknown | 5 | may | 55 | 1 | -1 | 0 | unknown | no |
Last rows
age | job | marital | education | default | balance | housing | loan | contact | day | month | duration | campaign | pdays | previous | poutcome | y | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
45201 | 53 | management | married | tertiary | no | 583 | no | no | cellular | 17 | nov | 226 | 1 | 184 | 4 | success | yes |
45202 | 34 | admin. | single | secondary | no | 557 | no | no | cellular | 17 | nov | 224 | 1 | -1 | 0 | unknown | yes |
45203 | 23 | student | single | tertiary | no | 113 | no | no | cellular | 17 | nov | 266 | 1 | -1 | 0 | unknown | yes |
45204 | 73 | retired | married | secondary | no | 2850 | no | no | cellular | 17 | nov | 300 | 1 | 40 | 8 | failure | yes |
45205 | 25 | technician | single | secondary | no | 505 | no | yes | cellular | 17 | nov | 386 | 2 | -1 | 0 | unknown | yes |
45206 | 51 | technician | married | tertiary | no | 825 | no | no | cellular | 17 | nov | 977 | 3 | -1 | 0 | unknown | yes |
45207 | 71 | retired | divorced | primary | no | 1729 | no | no | cellular | 17 | nov | 456 | 2 | -1 | 0 | unknown | yes |
45208 | 72 | retired | married | secondary | no | 5715 | no | no | cellular | 17 | nov | 1127 | 5 | 184 | 3 | success | yes |
45209 | 57 | blue-collar | married | secondary | no | 668 | no | no | telephone | 17 | nov | 508 | 4 | -1 | 0 | unknown | no |
45210 | 37 | entrepreneur | married | secondary | no | 2971 | no | no | cellular | 17 | nov | 361 | 2 | 188 | 11 | other | no |