Dataset info
Number of variables | 14 |
---|---|
Number of observations | 220056 |
Missing cells | 41688 (1.4%) |
Duplicate rows | 0 (0.0%) |
Total size in memory | 23.5 MiB |
Average record size in memory | 112.0 B |
Variables types
Numeric | 5 |
---|---|
Categorical | 2 |
Boolean | 0 |
Date | 1 |
URL | 0 |
Text (Unique) | 0 |
Rejected | 6 |
Unsupported | 0 |
Warnings
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG (ρ = 0.9538974473) | Rejected |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC (ρ = 0.937317187) | Rejected |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD (ρ = 0.9295724875) | Rejected |
DATUM_BESTAND has constant value "2019-07-10" | Rejected |
GEMIDDELDE_VERKOOPPRIJS has 39256 (17.8%) missing values | Missing |
PEILDATUM has constant value "2019-07-01" | Rejected |
TYPERENDE_DIAGNOSE_CD has a high cardinality: 1772 distinct values | Warning |
VERSIE has constant value "1.0" | Rejected |
ZORGPRODUCT_CD has a high cardinality: 5872 distinct values | Warning |
ZORGPRODUCT_CD has 2432 (1.1%) missing values | Missing |
AANTAL_PAT_PER_DIAG
Numeric
Distinct count | 6901 |
---|---|
Unique (%) | 3.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 7389.779034 |
---|---|
Minimum | 1 |
Maximum | 205513 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 32 |
Q1 | 354 |
Median | 1570 |
Q3 | 6061 |
95-th percentile | 35984 |
Maximum | 205513 |
Range | 205512 |
Interquartile range | 5707 |
Descriptive statistics
Standard deviation | 17269.46408 |
---|---|
Coef of variation | 2.336939169 |
Kurtosis | 31.30153635 |
Mean | 7389.779034 |
MAD | 9038.062863 |
Skewness | 4.920357863 |
Sum | 1626165215 |
Variance | 298234389.4 |
Memory size | 1.7 MiB |
Value | Count | Frequency (%) | |
6 | 420 | 0.2% | |
32 | 398 | 0.2% | |
4 | 391 | 0.2% | |
12 | 388 | 0.2% | |
21 | 386 | 0.2% | |
19 | 386 | 0.2% | |
8 | 385 | 0.2% | |
23 | 385 | 0.2% | |
5 | 379 | 0.2% | |
17 | 378 | 0.2% | |
Other values (6891) | 216160 | 98.2% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 340 | 0.2% | |
2 | 374 | 0.2% | |
3 | 355 | 0.2% | |
4 | 391 | 0.2% | |
5 | 379 | 0.2% |
Maximum 5 values
Value | Count | Frequency (%) | |
205513 | 19 | < 0.1% | |
200182 | 17 | < 0.1% | |
199981 | 16 | < 0.1% | |
197742 | 20 | < 0.1% | |
189114 | 19 | < 0.1% |
AANTAL_PAT_PER_SPC
Numeric
Distinct count | 217 |
---|---|
Unique (%) | 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 642319.0376 |
---|---|
Minimum | 83 |
Maximum | 1489568 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 83 |
---|---|
5-th percentile | 30777 |
Q1 | 242846 |
Median | 713937 |
Q3 | 977237 |
95-th percentile | 1328494 |
Maximum | 1489568 |
Range | 1489485 |
Interquartile range | 734391 |
Descriptive statistics
Standard deviation | 426698.7108 |
---|---|
Coef of variation | 0.6643096122 |
Kurtosis | -1.108280086 |
Mean | 642319.0376 |
MAD | 373675.5045 |
Skewness | 0.06739837272 |
Sum | 1.413461581e+11 |
Variance | 1.820717898e+11 |
Memory size | 1.7 MiB |
Value | Count | Frequency (%) | |
881250 | 5107 | 2.3% | |
870026 | 4401 | 2.0% | |
871428 | 4372 | 2.0% | |
841375 | 4367 | 2.0% | |
1061055 | 3974 | 1.8% | |
1058528 | 3972 | 1.8% | |
693207 | 3970 | 1.8% | |
977237 | 3872 | 1.8% | |
1040393 | 3858 | 1.8% | |
995598 | 3724 | 1.7% | |
Other values (207) | 178439 | 81.1% |
Minimum 5 values
Value | Count | Frequency (%) | |
83 | 13 | < 0.1% | |
102 | 6 | < 0.1% | |
132 | 3 | < 0.1% | |
362 | 57 | < 0.1% | |
583 | 102 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1489568 | 2981 | 1.4% | |
1450694 | 3057 | 1.4% | |
1421988 | 3588 | 1.6% | |
1328494 | 3616 | 1.6% | |
1307582 | 3590 | 1.6% |
AANTAL_PAT_PER_ZPD
Numeric
Distinct count | 8019 |
---|---|
Unique (%) | 3.6% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 484.0502236 |
---|---|
Minimum | 1 |
Maximum | 150256 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
Median | 12 |
Q3 | 92 |
95-th percentile | 1592 |
Maximum | 150256 |
Range | 150255 |
Interquartile range | 90 |
Descriptive statistics
Standard deviation | 3048.750513 |
---|---|
Coef of variation | 6.298417736 |
Kurtosis | 369.6471736 |
Mean | 484.0502236 |
MAD | 776.5827494 |
Skewness | 16.23883553 |
Sum | 106518156 |
Variance | 9294879.691 |
Memory size | 1.7 MiB |
Value | Count | Frequency (%) | |
1 | 38238 | 17.4% | |
2 | 18353 | 8.3% | |
3 | 11855 | 5.4% | |
4 | 8822 | 4.0% | |
5 | 6850 | 3.1% | |
6 | 5699 | 2.6% | |
7 | 4726 | 2.1% | |
8 | 3958 | 1.8% | |
9 | 3623 | 1.6% | |
10 | 3190 | 1.4% | |
Other values (8009) | 114742 | 52.1% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 38238 | 17.4% | |
2 | 18353 | 8.3% | |
3 | 11855 | 5.4% | |
4 | 8822 | 4.0% | |
5 | 6850 | 3.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
150256 | 1 | < 0.1% | |
144234 | 1 | < 0.1% | |
122173 | 1 | < 0.1% | |
108889 | 1 | < 0.1% | |
108086 | 1 | < 0.1% |
AANTAL_SUBTRAJECT_PER_DIAG
Highly correlated
This variable is highly correlated with AANTAL_PAT_PER_DIAG
and should be ignored for analysis
Correlation | 0.9538974473 |
---|
AANTAL_SUBTRAJECT_PER_SPC
Highly correlated
This variable is highly correlated with AANTAL_PAT_PER_SPC
and should be ignored for analysis
Correlation | 0.937317187 |
---|
AANTAL_SUBTRAJECT_PER_ZPD
Highly correlated
This variable is highly correlated with AANTAL_PAT_PER_ZPD
and should be ignored for analysis
Correlation | 0.9295724875 |
---|
BEHANDELEND_SPECIALISME_CD
Numeric
Distinct count | 28 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 421.6114898 |
---|---|
Minimum | 100 |
Maximum | 8418 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 100 |
---|---|
5-th percentile | 302 |
Q1 | 305 |
Median | 313 |
Q3 | 322 |
95-th percentile | 361 |
Maximum | 8418 |
Range | 8318 |
Interquartile range | 17 |
Descriptive statistics
Standard deviation | 919.7531312 |
---|---|
Coef of variation | 2.181518183 |
Kurtosis | 71.44324618 |
Mean | 421.6114898 |
MAD | 210.3102137 |
Skewness | 8.562981878 |
Sum | 92778138 |
Variance | 845945.8223 |
Memory size | 1.7 MiB |
Value | Count | Frequency (%) | |
305 | 30805 | 14.0% | |
313 | 28755 | 13.1% | |
303 | 25331 | 11.5% | |
330 | 17841 | 8.1% | |
316 | 15076 | 6.9% | |
308 | 10523 | 4.8% | |
324 | 9070 | 4.1% | |
301 | 9007 | 4.1% | |
306 | 8937 | 4.1% | |
304 | 7145 | 3.2% | |
Other values (18) | 57566 | 26.2% |
Minimum 5 values
Value | Count | Frequency (%) | |
100 | 9 | < 0.1% | |
301 | 9007 | 4.1% | |
302 | 4799 | 2.2% | |
303 | 25331 | 11.5% | |
304 | 7145 | 3.2% |
Maximum 5 values
Value | Count | Frequency (%) | |
8418 | 2867 | 1.3% | |
1900 | 145 | 0.1% | |
390 | 534 | 0.2% | |
389 | 2439 | 1.1% | |
362 | 3700 | 1.7% |
DATUM_BESTAND
Constant
This variable is constant and should be ignored for analysis
Constant value | 2019-07-10 |
---|
GEMIDDELDE_VERKOOPPRIJS
Numeric
Distinct count | 2872 |
---|---|
Unique (%) | 1.3% |
Missing (%) | 17.8% |
Missing (n) | 39256 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 3403.963993 |
---|---|
Minimum | 70 |
Maximum | 287220 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 70 |
---|---|
5-th percentile | 135 |
Q1 | 445 |
Median | 1185 |
Q3 | 3860 |
95-th percentile | 12880.25 |
Maximum | 287220 |
Range | 287150 |
Interquartile range | 3415 |
Descriptive statistics
Standard deviation | 6577.433188 |
---|---|
Coef of variation | 1.932286358 |
Kurtosis | 197.0488928 |
Mean | 3403.963993 |
MAD | 3505.52706 |
Skewness | 8.509069144 |
Sum | 615436690 |
Variance | 43262627.34 |
Memory size | 1.7 MiB |
Value | Count | Frequency (%) | |
160 | 1701 | 0.8% | |
105 | 1634 | 0.7% | |
180 | 1542 | 0.7% | |
110 | 1194 | 0.5% | |
300 | 1171 | 0.5% | |
140 | 1130 | 0.5% | |
295 | 959 | 0.4% | |
235 | 945 | 0.4% | |
115 | 944 | 0.4% | |
145 | 944 | 0.4% | |
Other values (2861) | 168636 | 76.6% | |
(Missing) | 39256 | 17.8% |
Minimum 5 values
Value | Count | Frequency (%) | |
70 | 226 | 0.1% | |
75 | 75 | < 0.1% | |
80 | 428 | 0.2% | |
85 | 714 | 0.3% | |
90 | 448 | 0.2% |
Maximum 5 values
Value | Count | Frequency (%) | |
287220 | 8 | < 0.1% | |
147535 | 3 | < 0.1% | |
122155 | 4 | < 0.1% | |
116910 | 3 | < 0.1% | |
108570 | 7 | < 0.1% |
JAAR
Date
Distinct count | 8 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Minimum | 2012-01-01 00:00:00 |
---|---|
Maximum | 2019-01-01 00:00:00 |
PEILDATUM
Constant
This variable is constant and should be ignored for analysis
Constant value | 2019-07-01 |
---|
TYPERENDE_DIAGNOSE_CD
Categorical
Distinct count | 1772 |
---|---|
Unique (%) | 0.8% |
Missing (%) | 0.0% |
Missing (n) | 0 |
101 | 929 |
---|---|
402 | 925 |
403 | 883 |
Other values (1769) |
Value | Count | Frequency (%) | |
101 | 929 | 0.4% | |
402 | 925 | 0.4% | |
403 | 883 | 0.4% | |
301 | 883 | 0.4% | |
203 | 841 | 0.4% | |
201 | 834 | 0.4% | |
401 | 765 | 0.3% | |
404 | 741 | 0.3% | |
409 | 727 | 0.3% | |
802 | 707 | 0.3% | |
Other values (1762) | 211821 | 96.3% |
Max length | 4 |
---|---|
Mean length | 3.344621369 |
Min length | 1 |
Contains chars | True |
Contains digits | True |
Contains spaces | False |
Contains non-words | False |
VERSIE
Constant
This variable is constant and should be ignored for analysis
Constant value | 1.0 |
---|
ZORGPRODUCT_CD
Categorical
Distinct count | 5872 |
---|---|
Unique (%) | 2.7% |
Missing (%) | 1.1% |
Missing (n) | 2432 |
990004009 | 1656 |
---|---|
990004007 | 1598 |
990003004 | 1555 |
Other values (5868) | |
(Missing) | 2432 |
Value | Count | Frequency (%) | |
990004009 | 1656 | 0.8% | |
990004007 | 1598 | 0.7% | |
990003004 | 1555 | 0.7% | |
990004006 | 1236 | 0.6% | |
990356076 | 1059 | 0.5% | |
990003007 | 990 | 0.4% | |
131999228 | 986 | 0.4% | |
131999164 | 973 | 0.4% | |
990356073 | 970 | 0.4% | |
199299013 | 915 | 0.4% | |
Other values (5861) | 205686 | 93.5% | |
(Missing) | 2432 | 1.1% |
Max length | 9 |
---|---|
Mean length | 8.933689606 |
Min length | 3 |
Contains chars | True |
Contains digits | True |
Contains spaces | True |
Contains non-words | True |
First rows
AANTAL_PAT_PER_DIAG | AANTAL_PAT_PER_SPC | AANTAL_PAT_PER_ZPD | AANTAL_SUBTRAJECT_PER_DIAG | AANTAL_SUBTRAJECT_PER_SPC | AANTAL_SUBTRAJECT_PER_ZPD | BEHANDELEND_SPECIALISME_CD | DATUM_BESTAND | GEMIDDELDE_VERKOOPPRIJS | JAAR | PEILDATUM | TYPERENDE_DIAGNOSE_CD | VERSIE | ZORGPRODUCT_CD | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 2456 | 187330 | 478 | 3798 | 304742 | 588 | 327 | 2019-07-10 | 405.0 | 2013-01-01 | 2019-07-01 | 0415 | 1.0 | 990027133 |
1 | 2456 | 187330 | 18 | 3798 | 304742 | 18 | 327 | 2019-07-10 | 24960.0 | 2013-01-01 | 2019-07-01 | 0415 | 1.0 | 990027166 |
2 | 2456 | 187330 | 13 | 3798 | 304742 | 13 | 327 | 2019-07-10 | 33390.0 | 2013-01-01 | 2019-07-01 | 0415 | 1.0 | 990027163 |
3 | 2456 | 187330 | 2 | 3798 | 304742 | 2 | 327 | 2019-07-10 | 2410.0 | 2013-01-01 | 2019-07-01 | 0415 | 1.0 | 990027160 |
4 | 2456 | 187330 | 6 | 3798 | 304742 | 6 | 327 | 2019-07-10 | NaN | 2013-01-01 | 2019-07-01 | 0415 | 1.0 | 990027161 |
5 | 2456 | 187330 | 1 | 3798 | 304742 | 1 | 327 | 2019-07-10 | 2225.0 | 2013-01-01 | 2019-07-01 | 0415 | 1.0 | 990027142 |
6 | 2456 | 187330 | 2 | 3798 | 304742 | 2 | 327 | 2019-07-10 | NaN | 2013-01-01 | 2019-07-01 | 0415 | 1.0 | 990027165 |
7 | 2456 | 187330 | 483 | 3798 | 304742 | 533 | 327 | 2019-07-10 | 2650.0 | 2013-01-01 | 2019-07-01 | 0415 | 1.0 | 990027168 |
8 | 2456 | 187330 | 11 | 3798 | 304742 | 11 | 327 | 2019-07-10 | 56195.0 | 2013-01-01 | 2019-07-01 | 0415 | 1.0 | 990027162 |
9 | 2456 | 187330 | 1 | 3798 | 304742 | 1 | 327 | 2019-07-10 | 51765.0 | 2013-01-01 | 2019-07-01 | 0415 | 1.0 | 990027153 |
Last rows
AANTAL_PAT_PER_DIAG | AANTAL_PAT_PER_SPC | AANTAL_PAT_PER_ZPD | AANTAL_SUBTRAJECT_PER_DIAG | AANTAL_SUBTRAJECT_PER_SPC | AANTAL_SUBTRAJECT_PER_ZPD | BEHANDELEND_SPECIALISME_CD | DATUM_BESTAND | GEMIDDELDE_VERKOOPPRIJS | JAAR | PEILDATUM | TYPERENDE_DIAGNOSE_CD | VERSIE | ZORGPRODUCT_CD | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
220046 | 116 | 354080 | 9 | 202 | 543156 | 12 | 316 | 2019-07-10 | 3040.0 | 2018-01-01 | 2019-07-01 | 6119 | 1.0 | 990116004 |
220047 | 116 | 354080 | 35 | 202 | 543156 | 38 | 316 | 2019-07-10 | 1080.0 | 2018-01-01 | 2019-07-01 | 6119 | 1.0 | 990116011 |
220048 | 116 | 354080 | 1 | 202 | 543156 | 1 | 316 | 2019-07-10 | NaN | 2018-01-01 | 2019-07-01 | 6119 | 1.0 | 990116007 |
220049 | 116 | 354080 | 5 | 202 | 543156 | 5 | 316 | 2019-07-10 | NaN | 2018-01-01 | 2019-07-01 | 6119 | 1.0 | 990116048 |
220050 | 116 | 354080 | 1 | 202 | 543156 | 1 | 316 | 2019-07-10 | NaN | 2018-01-01 | 2019-07-01 | 6119 | 1.0 | 990116055 |
220051 | 116 | 354080 | 48 | 202 | 543156 | 54 | 316 | 2019-07-10 | 385.0 | 2018-01-01 | 2019-07-01 | 6119 | 1.0 | 990116027 |
220052 | 116 | 354080 | 16 | 202 | 543156 | 16 | 316 | 2019-07-10 | 310.0 | 2018-01-01 | 2019-07-01 | 6119 | 1.0 | 990116018 |
220053 | 116 | 354080 | 5 | 202 | 543156 | 6 | 316 | 2019-07-10 | 14365.0 | 2018-01-01 | 2019-07-01 | 6119 | 1.0 | 990116008 |
220054 | 116 | 354080 | 1 | 202 | 543156 | 1 | 316 | 2019-07-10 | NaN | 2018-01-01 | 2019-07-01 | 6119 | 1.0 | 990116054 |
220055 | 116 | 354080 | 9 | 202 | 543156 | 10 | 316 | 2019-07-10 | NaN | 2018-01-01 | 2019-07-01 | 6119 | 1.0 | 990116049 |