Overview

Dataset info

Number of variables12
Number of observations891
Missing cells866 (8.1%)
Duplicate rows0 (0.0%)
Total size in memory83.6 KiB
Average record size in memory96.1 B

Variables types

Numeric5
Categorical5
Boolean1
Date0
URL0
Text (Unique)1
Rejected0
Unsupported0

Warnings

Age has 177 (19.9%) missing values Missing
Cabin has a high cardinality: 148 distinct values Warning
Cabin has 687 (77.1%) missing values Missing
Fare has 15 (< 0.1%) zeros Zeros
Parch has 678 (76.1%) zeros Zeros
SibSp has 608 (68.2%) zeros Zeros
Ticket has a high cardinality: 681 distinct values Warning

Variables

Age
Numeric

Distinct count89
Unique (%)10.0%
Missing (%)19.9%
Missing (n)177
Infinite (%)0.0%
Infinite (n)0
Mean29.699
Minimum0.42
Maximum80
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.42
5-th percentile4
Q120.125
Median28
Q338
95-th percentile56
Maximum80
Range79.58
Interquartile range17.875

Descriptive statistics

Standard deviation14.526
Coef of variation0.48912
Kurtosis0.17827
Mean29.699
MAD11.323
Skewness0.38911
Sum21205
Variance211.02
Memory size7.0 KiB
Histogram
ValueCountFrequency (%) 
24 30 < 0.1%
 
22 27 < 0.1%
 
18 26 < 0.1%
 
28 25 < 0.1%
 
19 25 < 0.1%
 
30 25 < 0.1%
 
21 24 < 0.1%
 
25 23 < 0.1%
 
36 22 < 0.1%
 
29 20 < 0.1%
 
Other values (78) 467 52.4%
 
(Missing) 177 19.9%
 

Minimum 5 values

ValueCountFrequency (%) 
0.42 1 < 0.1%
 
0.67 1 < 0.1%
 
0.75 2 < 0.1%
 
0.83 2 < 0.1%
 
0.92 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
80 1 < 0.1%
 
74 1 < 0.1%
 
71 2 < 0.1%
 
70.5 1 < 0.1%
 
70 2 < 0.1%
 

Cabin
Categorical

Distinct count148
Unique (%)16.6%
Missing (%)77.1%
Missing (n)687
B96 B98
 
4
G6
 
4
C23 C25 C27
 
4
Other values (144)
192
(Missing)
687
ValueCountFrequency (%) 
B96 B98 4 < 0.1%
 
G6 4 < 0.1%
 
C23 C25 C27 4 < 0.1%
 
C22 C26 3 < 0.1%
 
F33 3 < 0.1%
 
E101 3 < 0.1%
 
D 3 < 0.1%
 
F2 3 < 0.1%
 
E44 2 < 0.1%
 
F G73 2 < 0.1%
 
Other values (137) 173 19.4%
 
(Missing) 687 77.1%
 
Max length15
Mean length3.1347
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

Embarked
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)< 0.1%
Missing (n)2
S
644
C
168
Q
 
77
(Missing)
 
2
ValueCountFrequency (%) 
S 644 72.3%
 
C 168 18.9%
 
Q 77 8.6%
 
(Missing) 2 < 0.1%
 
Max length3
Mean length1.0045
Min length1
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Fare
Numeric

Distinct count248
Unique (%)27.8%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean32.204
Minimum0
Maximum512.33
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile7.225
Q17.9104
Median14.454
Q331
95-th percentile112.08
Maximum512.33
Range512.33
Interquartile range23.09

Descriptive statistics

Standard deviation49.693
Coef of variation1.5431
Kurtosis33.398
Mean32.204
MAD28.164
Skewness4.7873
Sum28694
Variance2469.4
Memory size7.0 KiB
Histogram
ValueCountFrequency (%) 
8.05 43 < 0.1%
 
13 42 < 0.1%
 
7.8958 38 < 0.1%
 
7.75 34 < 0.1%
 
26 31 < 0.1%
 
10.5 24 < 0.1%
 
7.925 18 < 0.1%
 
7.775 16 < 0.1%
 
26.55 15 < 0.1%
 
0 15 < 0.1%
 
Other values (238) 615 69.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0 15 < 0.1%
 
4.0125 1 < 0.1%
 
5 1 < 0.1%
 
6.2375 1 < 0.1%
 
6.4375 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
512.33 3 < 0.1%
 
263 4 < 0.1%
 
262.38 2 < 0.1%
 
247.52 2 < 0.1%
 
227.53 4 < 0.1%
 

Name
Categorical, Unique

First 5 values
Abbing, Mr. Anthony
Abbott, Mr. Rossmore Edward
Abbott, Mrs. Stanton (Rosa Hunt)
Abelson, Mr. Samuel
Abelson, Mrs. Samuel (Hannah Wizosky)
Last 5 values
de Mulder, Mr. Theodore
de Pelsmaeker, Mr. Alfons
del Carlo, Mr. Sebastiano
van Billiard, Mr. Austin Blyler
van Melkebeke, Mr. Philemon

First 5 values

ValueCountFrequency (%) 
Abbing, Mr. Anthony 1 < 0.1%
 
Abbott, Mr. Rossmore Edward 1 < 0.1%
 
Abbott, Mrs. Stanton (Rosa Hunt) 1 < 0.1%
 
Abelson, Mr. Samuel 1 < 0.1%
 
Abelson, Mrs. Samuel (Hannah Wizosky) 1 < 0.1%
 

Last 5 values

ValueCountFrequency (%) 
van Melkebeke, Mr. Philemon 1 < 0.1%
 
van Billiard, Mr. Austin Blyler 1 < 0.1%
 
del Carlo, Mr. Sebastiano 1 < 0.1%
 
de Pelsmaeker, Mr. Alfons 1 < 0.1%
 
de Mulder, Mr. Theodore 1 < 0.1%
 

Parch
Numeric

Distinct count7
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.38159
Minimum0
Maximum6
Zeros (%)76.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile2
Maximum6
Range6
Interquartile range0

Descriptive statistics

Standard deviation0.80606
Coef of variation2.1123
Kurtosis9.7781
Mean0.38159
MAD0.58074
Skewness2.7491
Sum340
Variance0.64973
Memory size7.0 KiB
Histogram
ValueCountFrequency (%) 
0 678 76.1%
 
1 118 13.2%
 
2 80 9.0%
 
5 5 < 0.1%
 
3 5 < 0.1%
 
4 4 < 0.1%
 
6 1 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 678 76.1%
 
1 118 13.2%
 
2 80 9.0%
 
3 5 < 0.1%
 
4 4 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
6 1 < 0.1%
 
5 5 < 0.1%
 
4 4 < 0.1%
 
3 5 < 0.1%
 
2 80 9.0%
 

PassengerId
Numeric

Distinct count891
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean446
Minimum1
Maximum891
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile45.5
Q1223.5
Median446
Q3668.5
95-th percentile846.5
Maximum891
Range890
Interquartile range445

Descriptive statistics

Standard deviation257.35
Coef of variation0.57703
Kurtosis-1.2
Mean446
MAD222.75
Skewness0
Sum3.9739e+05
Variance66231
Memory size7.0 KiB
Histogram
ValueCountFrequency (%) 
891 1 < 0.1%
 
293 1 < 0.1%
 
304 1 < 0.1%
 
303 1 < 0.1%
 
302 1 < 0.1%
 
301 1 < 0.1%
 
300 1 < 0.1%
 
299 1 < 0.1%
 
298 1 < 0.1%
 
297 1 < 0.1%
 
Other values (881) 881 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
5 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
891 1 < 0.1%
 
890 1 < 0.1%
 
889 1 < 0.1%
 
888 1 < 0.1%
 
887 1 < 0.1%
 

Pclass
Categorical

Distinct count3
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
3
491
1
216
2
184
ValueCountFrequency (%) 
3 491 55.1%
 
1 216 24.2%
 
2 184 20.7%
 
Max length1
Mean length1
Min length1
Contains charsFalse
Contains digitsTrue
Contains spacesFalse
Contains non-wordsFalse

Sex
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
male
577
female
314
ValueCountFrequency (%) 
male 577 64.8%
 
female 314 35.2%
 
Max length6
Mean length4.7048
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

SibSp
Numeric

Distinct count7
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.52301
Minimum0
Maximum8
Zeros (%)68.2%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q31
95-th percentile3
Maximum8
Range8
Interquartile range1

Descriptive statistics

Standard deviation1.1027
Coef of variation2.1085
Kurtosis17.88
Mean0.52301
MAD0.71378
Skewness3.6954
Sum466
Variance1.216
Memory size7.0 KiB
Histogram
ValueCountFrequency (%) 
0 608 68.2%
 
1 209 23.5%
 
2 28 < 0.1%
 
4 18 < 0.1%
 
3 16 < 0.1%
 
8 7 < 0.1%
 
5 5 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 608 68.2%
 
1 209 23.5%
 
2 28 < 0.1%
 
3 16 < 0.1%
 
4 18 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
8 7 < 0.1%
 
5 5 < 0.1%
 
4 18 < 0.1%
 
3 16 < 0.1%
 
2 28 < 0.1%
 

Survived
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
549
1
342
ValueCountFrequency (%) 
0 549 61.6%
 
1 342 38.4%
 

Ticket
Categorical

Distinct count681
Unique (%)76.4%
Missing (%)0.0%
Missing (n)0
CA. 2343
 
7
1601
 
7
347082
 
7
Other values (678)
870
ValueCountFrequency (%) 
CA. 2343 7 < 0.1%
 
1601 7 < 0.1%
 
347082 7 < 0.1%
 
347088 6 < 0.1%
 
CA 2144 6 < 0.1%
 
3101295 6 < 0.1%
 
382652 5 < 0.1%
 
S.O.C. 14879 5 < 0.1%
 
LINE 4 < 0.1%
 
17421 4 < 0.1%
 
Other values (671) 834 93.6%
 
Max length18
Mean length6.7508
Min length3
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

Correlations

Pearson matrix
Spearman matrix
Kendall matrix
Phi<sub>k</sub> matrix
Recoded matrix

Missing values

Matrix

Matrix

Count

Bar

Heatmap

Heatmap

Dendrogram

Dendrogram

Sample

First rows

AgeCabinEmbarkedFareNameParchPassengerIdPclassSexSibSpSurvivedTicket
022.0NaNS7.2500Braund, Mr. Owen Harris013male10A/5 21171
138.0C85C71.2833Cumings, Mrs. John Bradley (Florence Briggs Th...021female11PC 17599
226.0NaNS7.9250Heikkinen, Miss. Laina033female01STON/O2. 3101282
335.0C123S53.1000Futrelle, Mrs. Jacques Heath (Lily May Peel)041female11113803
435.0NaNS8.0500Allen, Mr. William Henry053male00373450
5NaNNaNQ8.4583Moran, Mr. James063male00330877
654.0E46S51.8625McCarthy, Mr. Timothy J071male0017463
72.0NaNS21.0750Palsson, Master. Gosta Leonard183male30349909
827.0NaNS11.1333Johnson, Mrs. Oscar W (Elisabeth Vilhelmina Berg)293female01347742
914.0NaNC30.0708Nasser, Mrs. Nicholas (Adele Achem)0102female11237736

Last rows

AgeCabinEmbarkedFareNameParchPassengerIdPclassSexSibSpSurvivedTicket
88133.0NaNS7.8958Markun, Mr. Johann08823male00349257
88222.0NaNS10.5167Dahlberg, Miss. Gerda Ulrika08833female007552
88328.0NaNS10.5000Banfield, Mr. Frederick James08842male00C.A./SOTON 34068
88425.0NaNS7.0500Sutehall, Mr. Henry Jr08853male00SOTON/OQ 392076
88539.0NaNQ29.1250Rice, Mrs. William (Margaret Norton)58863female00382652
88627.0NaNS13.0000Montvila, Rev. Juozas08872male00211536
88719.0B42S30.0000Graham, Miss. Margaret Edith08881female01112053
888NaNNaNS23.4500Johnston, Miss. Catherine Helen "Carrie"28893female10W./C. 6607
88926.0C148C30.0000Behr, Mr. Karl Howell08901male01111369
89032.0NaNQ7.7500Dooley, Mr. Patrick08913male00370376