Overview

Dataset info

Number of variables16
Number of observations45726
Missing cells117837 (16.1%)
Duplicate rows0 (0.0%)
Total size in memory5.3 MiB
Average record size in memory121.0 B

Variables types

Numeric6
Categorical5
Boolean1
Date1
URL0
Text (Unique)1
Rejected2
Unsupported0

Warnings

Counties has 44067 (> 99.9%) missing values Missing
GeoLocation has a high cardinality: 17101 distinct values Warning
GeoLocation has 7315 (16.0%) missing values Missing
mass_(g) is highly skewed (γ1 = 76.918) Skewed
recclass has a high cardinality: 466 distinct values Warning
reclat has 6438 (14.1%) zeros Zeros
reclat has 7315 (16.0%) missing values Missing
reclat_city is highly correlated with reclat (ρ = 0.99424) Rejected
reclong has 6214 (13.6%) zeros Zeros
reclong has 7315 (16.0%) missing values Missing
source has constant value "NASA" Rejected
States has 44067 (> 99.9%) missing values Missing

Variables

boolean
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
True
22953
False
22773
ValueCountFrequency (%) 
True 22953 50.2%
 
False 22773 49.8%
 

Counties
Numeric

Distinct count663
Unique (%)< 0.1%
Missing (%)> 99.9%
Missing (n)44067
Infinite (%)0.0%
Infinite (n)0
Mean1353.3
Minimum5
Maximum3210
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum5
5-th percentile30
Q1482
Median1195
Q32113
95-th percentile3109.1
Maximum3210
Range3205
Interquartile range1631

Descriptive statistics

Standard deviation994.09
Coef of variation0.73455
Kurtosis-1.19
Mean1353.3
MAD874.35
Skewness0.23764
Sum2.2452e+06
Variance9.8821e+05
Memory size357.3 KiB
Histogram
ValueCountFrequency (%) 
78 187 < 0.1%
 
1987 132 < 0.1%
 
8 58 < 0.1%
 
482 24 < 0.1%
 
2357 22 < 0.1%
 
799 20 < 0.1%
 
801 19 < 0.1%
 
3192 17 < 0.1%
 
942 16 < 0.1%
 
480 16 < 0.1%
 
Other values (652) 1148 < 0.1%
 
(Missing) 44067 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
5 10 < 0.1%
 
7 6 < 0.1%
 
8 58 < 0.1%
 
9 1 < 0.1%
 
10 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
3210 1 < 0.1%
 
3200 1 < 0.1%
 
3192 17 < 0.1%
 
3190 1 < 0.1%
 
3188 10 < 0.1%
 

fall
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Found
44609
Fell
 
1117
ValueCountFrequency (%) 
Found 44609 > 99.9%
 
Fell 1117 < 0.1%
 
Max length5
Mean length4.9756
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

GeoLocation
Categorical

Distinct count17101
Unique (%)37.4%
Missing (%)16.0%
Missing (n)7315
(0.0, 0.0)
6214
(-71.5, 35.66667)
 
4761
(-84.0, 168.0)
 
3040
Other values (17097)
24396
(Missing)
7315
ValueCountFrequency (%) 
(0.0, 0.0) 6214 13.6%
 
(-71.5, 35.66667) 4761 10.4%
 
(-84.0, 168.0) 3040 6.6%
 
(-72.0, 26.0) 1505 < 0.1%
 
(-79.68333, 159.75) 657 < 0.1%
 
(-76.71667, 159.66667) 637 < 0.1%
 
(-76.18333, 157.16667) 539 < 0.1%
 
(-79.68333, 155.75) 473 < 0.1%
 
(-84.21667, 160.5) 263 < 0.1%
 
(-86.36667, -70.0) 226 < 0.1%
 
Other values (17090) 20096 43.9%
 
(Missing) 7315 16.0%
 
Max length24
Mean length15.016
Min length3
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

id
Numeric

Distinct count45716
Unique (%)> 99.9%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean26884
Minimum1
Maximum57458
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile2388.8
Q112681
Median24256
Q340654
95-th percentile54891
Maximum57458
Range57457
Interquartile range27972

Descriptive statistics

Standard deviation16863
Coef of variation0.62727
Kurtosis-1.1601
Mean26884
MAD14490
Skewness0.26653
Sum1.2293e+09
Variance2.8438e+08
Memory size357.3 KiB
Histogram
ValueCountFrequency (%) 
417 2 < 0.1%
 
398 2 < 0.1%
 
1 2 < 0.1%
 
6 2 < 0.1%
 
392 2 < 0.1%
 
370 2 < 0.1%
 
379 2 < 0.1%
 
2 2 < 0.1%
 
390 2 < 0.1%
 
10 2 < 0.1%
 
Other values (45706) 45706 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
1 2 < 0.1%
 
2 2 < 0.1%
 
4 1 < 0.1%
 
5 1 < 0.1%
 
6 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
57458 1 < 0.1%
 
57457 1 < 0.1%
 
57456 1 < 0.1%
 
57455 1 < 0.1%
 
57454 1 < 0.1%
 

mass_(g)
Numeric

Distinct count12577
Unique (%)27.5%
Missing (%)< 0.1%
Missing (n)131
Infinite (%)0.0%
Infinite (n)0
Mean13278
Minimum0
Maximum6e+07
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile1.1
Q17.2
Median32.61
Q3202.9
95-th percentile4000
Maximum6e+07
Range6e+07
Interquartile range195.7

Descriptive statistics

Standard deviation5.7493e+05
Coef of variation43.298
Kurtosis6798.4
Mean13278
MAD25113
Skewness76.918
Sum6.0543e+08
Variance3.3054e+11
Memory size357.3 KiB
Histogram
ValueCountFrequency (%) 
1.3 171 < 0.1%
 
1.2 140 < 0.1%
 
1.4 138 < 0.1%
 
2.1 130 < 0.1%
 
2.4 126 < 0.1%
 
1.6 120 < 0.1%
 
0.5 119 < 0.1%
 
1.1 116 < 0.1%
 
3.8 114 < 0.1%
 
0.7 111 < 0.1%
 
Other values (12566) 44310 > 99.9%
 
(Missing) 131 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 19 < 0.1%
 
0.01 2 < 0.1%
 
0.013 1 < 0.1%
 
0.02 1 < 0.1%
 
0.03 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
6e+07 1 < 0.1%
 
5.82e+07 1 < 0.1%
 
5e+07 1 < 0.1%
 
3e+07 1 < 0.1%
 
2.8e+07 1 < 0.1%
 

mixed
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
A
22940
1
22786
ValueCountFrequency (%) 
A 22940 50.2%
 
1 22786 49.8%
 
Max length1
Mean length1
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsFalse

name
Categorical, Unique

First 5 values
Aachen
Aachen copy
Aarhus
Aarhus copy
Abajo
Last 5 values
Österplana 062
Österplana 063
Österplana 064
Łowicz
Święcany

First 5 values

ValueCountFrequency (%) 
Aachen 1 < 0.1%
 
Aachen copy 1 < 0.1%
 
Aarhus 1 < 0.1%
 
Aarhus copy 1 < 0.1%
 
Abajo 1 < 0.1%
 

Last 5 values

ValueCountFrequency (%) 
Święcany 1 < 0.1%
 
Łowicz 1 < 0.1%
 
Österplana 064 1 < 0.1%
 
Österplana 063 1 < 0.1%
 
Österplana 062 1 < 0.1%
 

nametype
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Valid
45651
Relict
 
75
ValueCountFrequency (%) 
Valid 45651 > 99.9%
 
Relict 75 < 0.1%
 
Max length6
Mean length5.0016
Min length5
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

recclass
Categorical

Distinct count466
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
L6
8287
H5
7143
L5
 
4797
Other values (463)
25499
ValueCountFrequency (%) 
L6 8287 18.1%
 
H5 7143 15.6%
 
L5 4797 10.5%
 
H6 4529 9.9%
 
H4 4211 9.2%
 
LL5 2766 6.0%
 
LL6 2043 < 0.1%
 
L4 1253 < 0.1%
 
H4/5 428 < 0.1%
 
CM2 416 < 0.1%
 
Other values (456) 9853 21.5%
 
Max length26
Mean length3.0525
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

reclat
Numeric

Distinct count12739
Unique (%)27.9%
Missing (%)16.0%
Missing (n)7315
Infinite (%)0.0%
Infinite (n)0
Mean-39.107
Minimum-87.367
Maximum81.167
Zeros (%)14.1%
Mini histogram

Quantile statistics

Minimum-87.367
5-th percentile-84.355
Q1-76.714
Median-71.5
Q30
95-th percentile34.494
Maximum81.167
Range168.53
Interquartile range76.714

Descriptive statistics

Standard deviation46.386
Coef of variation-1.1861
Kurtosis-1.4769
Mean-39.107
MAD43.937
Skewness0.49132
Sum-1.5021e+06
Variance2151.7
Memory size357.3 KiB
Histogram
ValueCountFrequency (%) 
0 6438 14.1%
 
-71.5 4761 10.4%
 
-84 3040 6.6%
 
-72 1506 < 0.1%
 
-79.683 1130 < 0.1%
 
-76.717 680 < 0.1%
 
-76.183 539 < 0.1%
 
-84.217 263 < 0.1%
 
-86.367 226 < 0.1%
 
-86.717 217 < 0.1%
 
Other values (12728) 19611 42.9%
 
(Missing) 7315 16.0%
 

Minimum 5 values

ValueCountFrequency (%) 
-87.367 4 < 0.1%
 
-87.033 3 < 0.1%
 
-86.933 3 < 0.1%
 
-86.717 217 < 0.1%
 
-86.567 17 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
81.167 1 < 0.1%
 
76.533 1 < 0.1%
 
76.133 1 < 0.1%
 
72.883 1 < 0.1%
 
72.683 1 < 0.1%
 

reclat_city
Highly correlated

This variable is highly correlated with reclat and should be ignored for analysis

Correlation0.99424

reclong
Numeric

Distinct count14641
Unique (%)32.0%
Missing (%)16.0%
Missing (n)7315
Infinite (%)0.0%
Infinite (n)0
Mean61.053
Minimum-165.43
Maximum354.47
Zeros (%)13.6%
Mini histogram

Quantile statistics

Minimum-165.43
5-th percentile-90.427
Q10
Median35.667
Q3157.17
95-th percentile168
Maximum354.47
Range519.91
Interquartile range157.17

Descriptive statistics

Standard deviation80.655
Coef of variation1.3211
Kurtosis-0.73139
Mean61.053
MAD67.606
Skewness-0.17438
Sum2.3451e+06
Variance6505.3
Memory size357.3 KiB
Histogram
ValueCountFrequency (%) 
0 6214 13.6%
 
35.667 4985 10.9%
 
168 3040 6.6%
 
26 1506 < 0.1%
 
159.75 657 < 0.1%
 
159.67 637 < 0.1%
 
157.17 542 < 0.1%
 
155.75 473 < 0.1%
 
160.5 263 < 0.1%
 
-70 228 < 0.1%
 
Other values (14630) 19866 43.4%
 
(Missing) 7315 16.0%
 

Minimum 5 values

ValueCountFrequency (%) 
-165.43 9 < 0.1%
 
-165.12 17 < 0.1%
 
-163.17 1 < 0.1%
 
-162.55 1 < 0.1%
 
-157.87 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
354.47 1 < 0.1%
 
178.2 1 < 0.1%
 
178.08 1 < 0.1%
 
175.73 1 < 0.1%
 
175.13 1 < 0.1%
 

source
Constant

This variable is constant and should be ignored for analysis

Constant valueNASA

States
Numeric

Distinct count46
Unique (%)< 0.1%
Missing (%)> 99.9%
Missing (n)44067
Infinite (%)0.0%
Infinite (n)0
Mean17.338
Minimum1
Maximum51
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile7
Q19
Median15
Q323
95-th percentile39
Maximum51
Range50
Interquartile range14

Descriptive statistics

Standard deviation10.411
Coef of variation0.60048
Kurtosis0.69477
Mean17.338
MAD8.3381
Skewness1.116
Sum28763
Variance108.39
Memory size357.3 KiB
Histogram
ValueCountFrequency (%) 
23 297 < 0.1%
 
8 224 < 0.1%
 
11 222 < 0.1%
 
17 139 < 0.1%
 
7 120 < 0.1%
 
10 95 < 0.1%
 
9 87 < 0.1%
 
19 49 < 0.1%
 
20 40 < 0.1%
 
37 29 < 0.1%
 
Other values (35) 357 < 0.1%
 
(Missing) 44067 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
1 7 < 0.1%
 
2 6 < 0.1%
 
3 9 < 0.1%
 
4 2 < 0.1%
 
5 6 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
51 4 < 0.1%
 
50 9 < 0.1%
 
49 5 < 0.1%
 
48 8 < 0.1%
 
47 11 < 0.1%
 

year
Date

Distinct count246
Unique (%)< 0.1%
Missing (%)< 0.1%
Missing (n)312
Infinite (%)0.0%
Infinite (n)0
Minimum1688-01-01 00:00:00
Maximum2101-01-01 00:00:00
Mini histogram
Histogram

Correlations

Pearson matrix
Spearman matrix
Kendall matrix
Phi<sub>k</sub> matrix
Recoded matrix

Missing values

Matrix

Matrix

Count

Bar

Heatmap

Heatmap

Dendrogram

Dendrogram

Sample

First rows

booleanCountiesfallGeoLocationidmass_(g)mixednamenametyperecclassreclatreclat_cityreclongsourceStatesyear
0TrueNaNFell(50.775, 6.08333)121.0AAachenValidL550.7750047.1268896.08333NASANaN1880-01-01
1TrueNaNFell(56.18333, 10.23333)2720.0AAarhusValidH656.1833353.28560110.23333NASANaN1951-01-01
2FalseNaNFell(54.21667, -113.0)6107000.0AAbeeValidEH454.2166755.501254-113.00000NASANaN1952-01-01
3FalseNaNFell(16.88333, -99.9)101914.01AcapulcoValidAcapulcoite16.8833319.987934-99.90000NASANaN1976-01-01
4TrueNaNFell(-33.16667, -64.95)370780.01AchirasValidL6-33.16667-38.389655-64.95000NASANaN1902-01-01
5FalseNaNFell(32.1, 71.8)3794239.01Adhi KotValidEH432.1000030.64010171.80000NASANaN1919-01-01
6TrueNaNFell(44.83333, 95.16667)390910.01Adzhi-Bogdo (stone)ValidLL3-644.8333340.93045395.16667NASANaN1949-01-01
7FalseNaNFell(44.21667, 0.61667)39230000.0AAgenValidH544.2166746.3552920.61667NASANaN1814-01-01
8FalseNaNFell(-31.6, -65.23333)3981620.01AguadaValidL6-31.60000-33.612112-65.23333NASANaN1930-01-01
9TrueNaNFell(-30.86667, -64.55)4171440.01Aguila BlancaValidL-30.86667-29.775441-64.55000NASANaN1920-01-01

Last rows

booleanCountiesfallGeoLocationidmass_(g)mixednamenametyperecclassreclatreclat_cityreclongsourceStatesyear
45716TrueNaNFell(50.775, 6.08333)121.0AAachen copyValidL550.7750047.1268896.08333NASANaN1880-01-01
45717TrueNaNFell(56.18333, 10.23333)2720.0AAarhus copyValidH656.1833353.28560110.23333NASANaN1951-01-01
45718FalseNaNFell(54.21667, -113.0)6107000.0AAbee copyValidEH454.2166755.501254-113.00000NASANaN1952-01-01
45719FalseNaNFell(16.88333, -99.9)101914.01Acapulco copyValidAcapulcoite16.8833319.987934-99.90000NASANaN1976-01-01
45720TrueNaNFell(-33.16667, -64.95)370780.01Achiras copyValidL6-33.16667-38.389655-64.95000NASANaN1902-01-01
45721FalseNaNFell(32.1, 71.8)3794239.01Adhi Kot copyValidEH432.1000030.64010171.80000NASANaN1919-01-01
45722TrueNaNFell(44.83333, 95.16667)390910.01Adzhi-Bogdo (stone) copyValidLL3-644.8333340.93045395.16667NASANaN1949-01-01
45723FalseNaNFell(44.21667, 0.61667)39230000.0AAgen copyValidH544.2166746.3552920.61667NASANaN1814-01-01
45724FalseNaNFell(-31.6, -65.23333)3981620.01Aguada copyValidL6-31.60000-33.612112-65.23333NASANaN1930-01-01
45725TrueNaNFell(-30.86667, -64.55)4171440.01Aguila Blanca copyValidL-30.86667-29.775441-64.55000NASANaN1920-01-01