Dataset statistics
Number of variables | 2 |
---|---|
Number of observations | 361 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.8 KiB |
Average record size in memory | 16.4 B |
Variable types
DATE | 1 |
---|---|
NUM | 1 |
Reproduction
Analysis started | 2020-02-14 00:01:30.282690 |
---|---|
Analysis finished | 2020-02-14 00:01:31.045214 |
Version | pandas-profiling v2.5.0 |
Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
Download configuration | config.yaml |
Distinct count | 361 |
---|---|
Unique (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 KiB |
Minimum | 1990-01-01 00:00:00 |
---|---|
Maximum | 2020-01-01 00:00:00 |
Histogram
PCOALAUUSDM
Real number (ℝ≥0)
Distinct count | 275 |
---|---|
Unique (%) | 76.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 61.06970978828314 |
---|---|
Minimum | 24.0 |
Maximum | 195.18633540372699 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 2.9 KiB |
Quantile statistics
Minimum | 24 |
---|---|
5-th percentile | 26.1 |
Q1 | 33.6 |
median | 52.43303571 |
Q3 | 85.56173469 |
95-th percentile | 125.0858766 |
Maximum | 195.1863354 |
Range | 171.1863354 |
Interquartile range (IQR) | 51.96173469 |
Descriptive statistics
Standard deviation | 33.60143246 |
---|---|
Coefficient of variation (CV) | 0.5502143793 |
Kurtosis | 0.4179963001 |
Mean | 61.06970979 |
Median Absolute Deviation (MAD) | 28.05645774 |
Skewness | 1.002984538 |
Sum | 22046.16523 |
Variance | 1129.056264 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 24. 26.09464286 26.12142857 30.875 31.09 ... 54.5187406 57.19212662 106.05620301 142.48182398 195.1863354 ], "bayesian blocks" binning strategy used)
Value | Count | Frequency (%) | |
39.5 | 19 | 5.3% | |
31 | 13 | 3.6% | |
26.1 | 11 | 3.0% | |
40.5 | 10 | 2.8% | |
25.1 | 6 | 1.7% | |
25.6 | 5 | 1.4% | |
33.1 | 5 | 1.4% | |
38 | 4 | 1.1% | |
35 | 4 | 1.1% | |
27.15 | 3 | 0.8% | |
Other values (265) | 281 | 77.8% |
Value | Count | Frequency (%) | |
24 | 1 | 0.3% | |
24.45 | 1 | 0.3% | |
24.9 | 1 | 0.3% | |
24.96428571 | 1 | 0.3% | |
25.1 | 6 | 1.7% |
Value | Count | Frequency (%) | |
195.1863354 | 1 | 0.3% | |
173.3035714 | 1 | 0.3% | |
166.9897959 | 1 | 0.3% | |
164.4983766 | 1 | 0.3% | |
143.0758929 | 1 | 0.3% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
DATE | PCOALAUUSDM | |
---|---|---|
0 | 1990-01-01 | 38.0 |
1 | 1990-02-01 | 38.0 |
2 | 1990-03-01 | 38.0 |
3 | 1990-04-01 | 38.0 |
4 | 1990-05-01 | 40.5 |
5 | 1990-06-01 | 40.5 |
6 | 1990-07-01 | 40.5 |
7 | 1990-08-01 | 40.5 |
8 | 1990-09-01 | 40.5 |
9 | 1990-10-01 | 40.5 |
Last rows
DATE | PCOALAUUSDM | |
---|---|---|
351 | 2019-04-01 | 88.764643 |
352 | 2019-05-01 | 89.564286 |
353 | 2019-06-01 | 77.629821 |
354 | 2019-07-01 | 77.845807 |
355 | 2019-08-01 | 69.739286 |
356 | 2019-09-01 | 66.958673 |
357 | 2019-10-01 | 69.194255 |
358 | 2019-11-01 | 69.729082 |
359 | 2019-12-01 | 70.464643 |
360 | 2020-01-01 | 72.106169 |