Dataset info
Number of variables | 14 |
---|---|
Number of observations | 45726 |
Missing cells | 29703 (4.6%) |
Duplicate rows | 0 (0.0%) |
Total size in memory | 4.6 MiB |
Average record size in memory | 105.0 B |
Variables types
Numeric | 4 |
---|---|
Categorical | 5 |
Boolean | 1 |
Date | 1 |
URL | 0 |
Text (Unique) | 1 |
Rejected | 2 |
Unsupported | 0 |
Warnings
GeoLocation has a high cardinality: 17101 distinct values | Warning |
GeoLocation has 7315 (16.0%) missing values | Missing |
mass_(g) is highly skewed (γ1 = 76.91847245) | Skewed |
recclass has a high cardinality: 466 distinct values | Warning |
reclat has 6438 (14.1%) zeros | Zeros |
reclat has 7315 (16.0%) missing values | Missing |
reclat_city is highly correlated with reclat (ρ = 0.9942518712) | Rejected |
reclong has 6214 (13.6%) zeros | Zeros |
reclong has 7315 (16.0%) missing values | Missing |
source has constant value "NASA" | Rejected |
boolean
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
True | |
---|---|
False |
Value | Count | Frequency (%) | |
True | 23002 | 50.3% | |
False | 22724 | 49.7% |
fall
Categorical
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Found | |
---|---|
Fell | 1117 |
Value | Count | Frequency (%) | |
Found | 44609 | 97.6% | |
Fell | 1117 | 2.4% |
Max length | 5 |
---|---|
Mean length | 4.975571885 |
Min length | 4 |
Contains chars | True |
Contains digits | False |
Contains spaces | False |
Contains non-words | False |
GeoLocation
Categorical
Distinct count | 17101 |
---|---|
Unique (%) | 37.4% |
Missing (%) | 16.0% |
Missing (n) | 7315 |
(0.0, 0.0) | |
---|---|
(-71.5, 35.66667) | 4761 |
(-84.0, 168.0) | 3040 |
Other values (17097) | |
(Missing) |
Value | Count | Frequency (%) | |
(0.0, 0.0) | 6214 | 13.6% | |
(-71.5, 35.66667) | 4761 | 10.4% | |
(-84.0, 168.0) | 3040 | 6.6% | |
(-72.0, 26.0) | 1505 | 3.3% | |
(-79.68333, 159.75) | 657 | 1.4% | |
(-76.71667, 159.66667) | 637 | 1.4% | |
(-76.18333, 157.16667) | 539 | 1.2% | |
(-79.68333, 155.75) | 473 | 1.0% | |
(-84.21667, 160.5) | 263 | 0.6% | |
(-86.36667, -70.0) | 226 | 0.5% | |
Other values (17090) | 20096 | 43.9% | |
(Missing) | 7315 | 16.0% |
Max length | 24 |
---|---|
Mean length | 15.01640205 |
Min length | 3 |
Contains chars | True |
Contains digits | True |
Contains spaces | True |
Contains non-words | True |
id
Numeric
Distinct count | 45716 |
---|---|
Unique (%) | > 99.9% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 26883.9062 |
---|---|
Minimum | 1 |
Maximum | 57458 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2388.75 |
Q1 | 12681.25 |
Median | 24256.5 |
Q3 | 40653.5 |
95-th percentile | 54890.75 |
Maximum | 57458 |
Range | 57457 |
Interquartile range | 27972.25 |
Descriptive statistics
Standard deviation | 16863.44557 |
---|---|
Coef of variation | 0.6272691713 |
Kurtosis | -1.160130804 |
Mean | 26883.9062 |
MAD | 14489.93531 |
Skewness | 0.2665300704 |
Sum | 1229293495 |
Variance | 284375796.4 |
Memory size | 357.4 KiB |
Histogram with fixed size bins (bins=50)
Histogram with variable size bins (bins=[1.00000e+00 1.16250e+03 1.24350e+03 2.35650e+03 2.43150e+03 ... 5.49015e+04 5.49255e+04 5.72245e+04 5.72885e+04 5.74580e+04], "bayesian blocks" binning strategy used)
Value | Count | Frequency (%) | |
417 | 2 | < 0.1% | |
398 | 2 | < 0.1% | |
1 | 2 | < 0.1% | |
6 | 2 | < 0.1% | |
392 | 2 | < 0.1% | |
370 | 2 | < 0.1% | |
379 | 2 | < 0.1% | |
2 | 2 | < 0.1% | |
390 | 2 | < 0.1% | |
10 | 2 | < 0.1% | |
Other values (45706) | 45706 | > 99.9% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 2 | < 0.1% | |
2 | 2 | < 0.1% | |
4 | 1 | < 0.1% | |
5 | 1 | < 0.1% | |
6 | 2 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
57458 | 1 | < 0.1% | |
57457 | 1 | < 0.1% | |
57456 | 1 | < 0.1% | |
57455 | 1 | < 0.1% | |
57454 | 1 | < 0.1% |
mass_(g)
Numeric
Distinct count | 12577 |
---|---|
Unique (%) | 27.5% |
Missing (%) | 0.3% |
Missing (n) | 131 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 13278.42646 |
---|---|
Minimum | 0 |
Maximum | 60000000 |
Zeros (%) | < 0.1% |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1.1 |
Q1 | 7.2 |
Median | 32.61 |
Q3 | 202.9 |
95-th percentile | 4000 |
Maximum | 60000000 |
Range | 60000000 |
Interquartile range | 195.7 |
Descriptive statistics
Standard deviation | 574926.0121 |
---|---|
Coef of variation | 43.2977517 |
Kurtosis | 6798.398388 |
Mean | 13278.42646 |
MAD | 25112.89201 |
Skewness | 76.91847245 |
Sum | 605429854.6 |
Variance | 3.305399193e+11 |
Memory size | 357.4 KiB |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) | |
1.3 | 171 | 0.4% | |
1.2 | 140 | 0.3% | |
1.4 | 138 | 0.3% | |
2.1 | 130 | 0.3% | |
2.4 | 126 | 0.3% | |
1.6 | 120 | 0.3% | |
0.5 | 119 | 0.3% | |
1.1 | 116 | 0.3% | |
3.8 | 114 | 0.2% | |
0.7 | 111 | 0.2% | |
Other values (12566) | 44310 | 96.9% | |
(Missing) | 131 | 0.3% |
Minimum 5 values
Value | Count | Frequency (%) | |
0 | 19 | < 0.1% | |
0.01 | 2 | < 0.1% | |
0.013 | 1 | < 0.1% | |
0.02 | 1 | < 0.1% | |
0.03 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
60000000 | 1 | < 0.1% | |
58200000 | 1 | < 0.1% | |
50000000 | 1 | < 0.1% | |
30000000 | 1 | < 0.1% | |
28000000 | 1 | < 0.1% |
mixed
Categorical
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
1 | |
---|---|
A |
Value | Count | Frequency (%) | |
1 | 22935 | 50.2% | |
A | 22791 | 49.8% |
Max length | 1 |
---|---|
Mean length | 1 |
Min length | 1 |
Contains chars | True |
Contains digits | True |
Contains spaces | False |
Contains non-words | False |
name
Categorical, Unique
First 5 values |
---|
Aachen |
Aachen copy |
Aarhus |
Aarhus copy |
Abajo |
Last 5 values |
---|
Österplana 062 |
Österplana 063 |
Österplana 064 |
Łowicz |
Święcany |
First 5 values
Value | Count | Frequency (%) | |
Aachen | 1 | < 0.1% | |
Aachen copy | 1 | < 0.1% | |
Aarhus | 1 | < 0.1% | |
Aarhus copy | 1 | < 0.1% | |
Abajo | 1 | < 0.1% |
Last 5 values
Value | Count | Frequency (%) | |
Święcany | 1 | < 0.1% | |
Łowicz | 1 | < 0.1% | |
Österplana 064 | 1 | < 0.1% | |
Österplana 063 | 1 | < 0.1% | |
Österplana 062 | 1 | < 0.1% |
nametype
Categorical
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Valid | |
---|---|
Relict | 75 |
Value | Count | Frequency (%) | |
Valid | 45651 | 99.8% | |
Relict | 75 | 0.2% |
Max length | 6 |
---|---|
Mean length | 5.001640205 |
Min length | 5 |
Contains chars | True |
Contains digits | False |
Contains spaces | False |
Contains non-words | False |
recclass
Categorical
Distinct count | 466 |
---|---|
Unique (%) | 1.0% |
Missing (%) | 0.0% |
Missing (n) | 0 |
L6 | |
---|---|
H5 | |
L5 | 4797 |
Other values (463) |
Value | Count | Frequency (%) | |
L6 | 8287 | 18.1% | |
H5 | 7143 | 15.6% | |
L5 | 4797 | 10.5% | |
H6 | 4529 | 9.9% | |
H4 | 4211 | 9.2% | |
LL5 | 2766 | 6.0% | |
LL6 | 2043 | 4.5% | |
L4 | 1253 | 2.7% | |
H4/5 | 428 | 0.9% | |
CM2 | 416 | 0.9% | |
Other values (456) | 9853 | 21.5% |
Max length | 26 |
---|---|
Mean length | 3.052530289 |
Min length | 1 |
Contains chars | True |
Contains digits | True |
Contains spaces | True |
Contains non-words | True |
reclat
Numeric
Distinct count | 12739 |
---|---|
Unique (%) | 27.9% |
Missing (%) | 16.0% |
Missing (n) | 7315 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | -39.10709514 |
---|---|
Minimum | -87.36667 |
Maximum | 81.16667 |
Zeros (%) | 14.1% |
Quantile statistics
Minimum | -87.36667 |
---|---|
5-th percentile | -84.35476 |
Q1 | -76.71377 |
Median | -71.5 |
Q3 | 0 |
95-th percentile | 34.494325 |
Maximum | 81.16667 |
Range | 168.53334 |
Interquartile range | 76.71377 |
Descriptive statistics
Standard deviation | 46.38601095 |
---|---|
Coef of variation | -1.186127755 |
Kurtosis | -1.476865084 |
Mean | -39.10709514 |
MAD | 43.93747025 |
Skewness | 0.4913157316 |
Sum | -1502142.632 |
Variance | 2151.662012 |
Memory size | 357.4 KiB |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) | |
0 | 6438 | 14.1% | |
-71.5 | 4761 | 10.4% | |
-84 | 3040 | 6.6% | |
-72 | 1506 | 3.3% | |
-79.68333 | 1130 | 2.5% | |
-76.71667 | 680 | 1.5% | |
-76.18333 | 539 | 1.2% | |
-84.21667 | 263 | 0.6% | |
-86.36667 | 226 | 0.5% | |
-86.71667 | 217 | 0.5% | |
Other values (12728) | 19611 | 42.9% | |
(Missing) | 7315 | 16.0% |
Minimum 5 values
Value | Count | Frequency (%) | |
-87.36667 | 4 | < 0.1% | |
-87.03333 | 3 | < 0.1% | |
-86.93333 | 3 | < 0.1% | |
-86.71667 | 217 | 0.5% | |
-86.56667 | 17 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
81.16667 | 1 | < 0.1% | |
76.53333 | 1 | < 0.1% | |
76.13333 | 1 | < 0.1% | |
72.88333 | 1 | < 0.1% | |
72.68333 | 1 | < 0.1% |
reclat_city
Highly correlated
This variable is highly correlated with reclat
and should be ignored for analysis
Correlation | 0.9942518712 |
---|
reclong
Numeric
Distinct count | 14641 |
---|---|
Unique (%) | 32.0% |
Missing (%) | 16.0% |
Missing (n) | 7315 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 61.05259359 |
---|---|
Minimum | -165.43333 |
Maximum | 354.47333 |
Zeros (%) | 13.6% |
Quantile statistics
Minimum | -165.43333 |
---|---|
5-th percentile | -90.427 |
Q1 | 0 |
Median | 35.66667 |
Q3 | 157.16667 |
95-th percentile | 168 |
Maximum | 354.47333 |
Range | 519.90666 |
Interquartile range | 157.16667 |
Descriptive statistics
Standard deviation | 80.65525774 |
---|---|
Coef of variation | 1.321078319 |
Kurtosis | -0.7313935567 |
Mean | 61.05259359 |
MAD | 67.60562132 |
Skewness | -0.1743813291 |
Sum | 2345091.172 |
Variance | 6505.2706 |
Memory size | 357.4 KiB |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) | |
0 | 6214 | 13.6% | |
35.66667 | 4985 | 10.9% | |
168 | 3040 | 6.6% | |
26 | 1506 | 3.3% | |
159.75 | 657 | 1.4% | |
159.66667 | 637 | 1.4% | |
157.16667 | 542 | 1.2% | |
155.75 | 473 | 1.0% | |
160.5 | 263 | 0.6% | |
-70 | 228 | 0.5% | |
Other values (14630) | 19866 | 43.4% | |
(Missing) | 7315 | 16.0% |
Minimum 5 values
Value | Count | Frequency (%) | |
-165.43333 | 9 | < 0.1% | |
-165.11667 | 17 | < 0.1% | |
-163.16667 | 1 | < 0.1% | |
-162.55 | 1 | < 0.1% | |
-157.86667 | 1 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
354.47333 | 1 | < 0.1% | |
178.2 | 1 | < 0.1% | |
178.08333 | 1 | < 0.1% | |
175.73028 | 1 | < 0.1% | |
175.13333 | 1 | < 0.1% |
source
Constant
This variable is constant and should be ignored for analysis
Constant value | NASA |
---|
year
Date
Distinct count | 246 |
---|---|
Unique (%) | 0.5% |
Missing (%) | 0.7% |
Missing (n) | 312 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Minimum | 1688-01-01 00:00:00 |
---|---|
Maximum | 2101-01-01 00:00:00 |
Histogram of 'year' (bins=N)
First rows
boolean | fall | GeoLocation | id | mass_(g) | mixed | name | nametype | recclass | reclat | reclat_city | reclong | source | year | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | True | Fell | (50.775, 6.08333) | 1 | 21.0 | 1 | Aachen | Valid | L5 | 50.77500 | 42.564600 | 6.08333 | NASA | 1880-01-01 |
1 | False | Fell | (56.18333, 10.23333) | 2 | 720.0 | A | Aarhus | Valid | H6 | 56.18333 | 56.624204 | 10.23333 | NASA | 1951-01-01 |
2 | False | Fell | (54.21667, -113.0) | 6 | 107000.0 | 1 | Abee | Valid | EH4 | 54.21667 | 59.830879 | -113.00000 | NASA | 1952-01-01 |
3 | True | Fell | (16.88333, -99.9) | 10 | 1914.0 | A | Acapulco | Valid | Acapulcoite | 16.88333 | 17.463788 | -99.90000 | NASA | 1976-01-01 |
4 | False | Fell | (-33.16667, -64.95) | 370 | 780.0 | 1 | Achiras | Valid | L6 | -33.16667 | -41.940615 | -64.95000 | NASA | 1902-01-01 |
5 | False | Fell | (32.1, 71.8) | 379 | 4239.0 | A | Adhi Kot | Valid | EH4 | 32.10000 | 27.765932 | 71.80000 | NASA | 1919-01-01 |
6 | False | Fell | (44.83333, 95.16667) | 390 | 910.0 | A | Adzhi-Bogdo (stone) | Valid | LL3-6 | 44.83333 | 45.180523 | 95.16667 | NASA | 1949-01-01 |
7 | False | Fell | (44.21667, 0.61667) | 392 | 30000.0 | A | Agen | Valid | H5 | 44.21667 | 43.054335 | 0.61667 | NASA | 1814-01-01 |
8 | False | Fell | (-31.6, -65.23333) | 398 | 1620.0 | 1 | Aguada | Valid | L6 | -31.60000 | -25.557196 | -65.23333 | NASA | 1930-01-01 |
9 | False | Fell | (-30.86667, -64.55) | 417 | 1440.0 | A | Aguila Blanca | Valid | L | -30.86667 | -34.164097 | -64.55000 | NASA | 1920-01-01 |
Last rows
boolean | fall | GeoLocation | id | mass_(g) | mixed | name | nametype | recclass | reclat | reclat_city | reclong | source | year | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
45716 | True | Fell | (50.775, 6.08333) | 1 | 21.0 | 1 | Aachen copy | Valid | L5 | 50.77500 | 42.564600 | 6.08333 | NASA | 1880-01-01 |
45717 | False | Fell | (56.18333, 10.23333) | 2 | 720.0 | A | Aarhus copy | Valid | H6 | 56.18333 | 56.624204 | 10.23333 | NASA | 1951-01-01 |
45718 | False | Fell | (54.21667, -113.0) | 6 | 107000.0 | 1 | Abee copy | Valid | EH4 | 54.21667 | 59.830879 | -113.00000 | NASA | 1952-01-01 |
45719 | True | Fell | (16.88333, -99.9) | 10 | 1914.0 | A | Acapulco copy | Valid | Acapulcoite | 16.88333 | 17.463788 | -99.90000 | NASA | 1976-01-01 |
45720 | False | Fell | (-33.16667, -64.95) | 370 | 780.0 | 1 | Achiras copy | Valid | L6 | -33.16667 | -41.940615 | -64.95000 | NASA | 1902-01-01 |
45721 | False | Fell | (32.1, 71.8) | 379 | 4239.0 | A | Adhi Kot copy | Valid | EH4 | 32.10000 | 27.765932 | 71.80000 | NASA | 1919-01-01 |
45722 | False | Fell | (44.83333, 95.16667) | 390 | 910.0 | A | Adzhi-Bogdo (stone) copy | Valid | LL3-6 | 44.83333 | 45.180523 | 95.16667 | NASA | 1949-01-01 |
45723 | False | Fell | (44.21667, 0.61667) | 392 | 30000.0 | A | Agen copy | Valid | H5 | 44.21667 | 43.054335 | 0.61667 | NASA | 1814-01-01 |
45724 | False | Fell | (-31.6, -65.23333) | 398 | 1620.0 | 1 | Aguada copy | Valid | L6 | -31.60000 | -25.557196 | -65.23333 | NASA | 1930-01-01 |
45725 | False | Fell | (-30.86667, -64.55) | 417 | 1440.0 | A | Aguila Blanca copy | Valid | L | -30.86667 | -34.164097 | -64.55000 | NASA | 1920-01-01 |