Dataset info
Number of variables | 14 |
---|---|
Number of observations | 207555 |
Missing cells | 38068 (< 0.1%) |
Duplicate rows | 0 (0.0%) |
Total size in memory | 22.2 MiB |
Average record size in memory | 112.0 B |
Variables types
Numeric | 5 |
---|---|
Categorical | 2 |
Boolean | 0 |
Date | 1 |
URL | 0 |
Text (Unique) | 0 |
Rejected | 6 |
Unsupported | 0 |
Warnings
AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG (ρ = 0.95404) | Rejected |
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC (ρ = 0.93355) | Rejected |
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD (ρ = 0.93013) | Rejected |
DATUM_BESTAND has constant value "2019-05-20" | Rejected |
GEMIDDELDE_VERKOOPPRIJS has 35702 (17.2%) missing values | Missing |
PEILDATUM has constant value "2019-05-01" | Rejected |
TYPERENDE_DIAGNOSE_CD has a high cardinality: 1771 distinct values | Warning |
VERSIE has constant value "1.0" | Rejected |
ZORGPRODUCT_CD has a high cardinality: 5859 distinct values | Warning |
ZORGPRODUCT_CD has 2366 (< 0.1%) missing values | Missing |
AANTAL_PAT_PER_DIAG
Numeric
Distinct count | 6779 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 7546.4 |
---|---|
Minimum | 1 |
Maximum | 1.9809e+05 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 37 |
Q1 | 401 |
Median | 1692 |
Q3 | 6415 |
95-th percentile | 36209 |
Maximum | 1.9809e+05 |
Range | 1.9809e+05 |
Interquartile range | 6014 |
Descriptive statistics
Standard deviation | 17219 |
---|---|
Coef of variation | 2.2817 |
Kurtosis | 29.942 |
Mean | 7546.4 |
MAD | 9104.1 |
Skewness | 4.8174 |
Sum | 1.5663e+09 |
Variance | 2.9648e+08 |
Memory size | 1.6 MiB |
Value | Count | Frequency (%) | |
1 | 447 | < 0.1% | |
2 | 446 | < 0.1% | |
3 | 353 | < 0.1% | |
6 | 341 | < 0.1% | |
4 | 338 | < 0.1% | |
25 | 334 | < 0.1% | |
21 | 327 | < 0.1% | |
11 | 323 | < 0.1% | |
5 | 322 | < 0.1% | |
12 | 312 | < 0.1% | |
Other values (6769) | 204012 | > 99.9% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 447 | < 0.1% | |
2 | 446 | < 0.1% | |
3 | 353 | < 0.1% | |
4 | 338 | < 0.1% | |
5 | 322 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1.9809e+05 | 16 | < 0.1% | |
1.9578e+05 | 17 | < 0.1% | |
1.9549e+05 | 19 | < 0.1% | |
1.9327e+05 | 20 | < 0.1% | |
1.8911e+05 | 19 | < 0.1% |
AANTAL_PAT_PER_SPC
Numeric
Distinct count | 215 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 6.5669e+05 |
---|---|
Minimum | 1 |
Maximum | 1.4896e+06 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 42205 |
Q1 | 2.7349e+05 |
Median | 7.3334e+05 |
Q3 | 9.5911e+05 |
95-th percentile | 1.3048e+06 |
Maximum | 1.4896e+06 |
Range | 1.4896e+06 |
Interquartile range | 6.8562e+05 |
Descriptive statistics
Standard deviation | 4.0861e+05 |
---|---|
Coef of variation | 0.62222 |
Kurtosis | -0.98819 |
Mean | 6.5669e+05 |
MAD | 3.5264e+05 |
Skewness | 0.068181 |
Sum | 1.363e+11 |
Variance | 1.6696e+11 |
Memory size | 1.6 MiB |
Value | Count | Frequency (%) | |
8.7667e+05 | 5100 | < 0.1% | |
8.5896e+05 | 4398 | < 0.1% | |
8.3173e+05 | 4357 | < 0.1% | |
8.4828e+05 | 4339 | < 0.1% | |
1.0456e+06 | 3968 | < 0.1% | |
1.026e+06 | 3946 | < 0.1% | |
9.5911e+05 | 3869 | < 0.1% | |
1.0312e+06 | 3858 | < 0.1% | |
6.0443e+05 | 3803 | < 0.1% | |
9.9561e+05 | 3724 | < 0.1% | |
Other values (205) | 166193 | 80.1% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 1 | < 0.1% | |
5 | 2 | < 0.1% | |
13 | 1 | < 0.1% | |
43 | 9 | < 0.1% | |
46 | 10 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1.4896e+06 | 2981 | < 0.1% | |
1.4506e+06 | 3055 | < 0.1% | |
1.4122e+06 | 3578 | < 0.1% | |
1.3048e+06 | 3590 | < 0.1% | |
1.2967e+06 | 1182 | < 0.1% |
AANTAL_PAT_PER_ZPD
Numeric
Distinct count | 7821 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 493.89 |
---|---|
Minimum | 1 |
Maximum | 1.4273e+05 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
Median | 13 |
Q3 | 97 |
95-th percentile | 1650 |
Maximum | 1.4273e+05 |
Range | 1.4273e+05 |
Interquartile range | 95 |
Descriptive statistics
Standard deviation | 3046.2 |
---|---|
Coef of variation | 6.1678 |
Kurtosis | 349.56 |
Mean | 493.89 |
MAD | 789.52 |
Skewness | 15.829 |
Sum | 1.0251e+08 |
Variance | 9.2795e+06 |
Memory size | 1.6 MiB |
Value | Count | Frequency (%) | |
1 | 35624 | 17.2% | |
2 | 17081 | 8.2% | |
3 | 11016 | 5.3% | |
4 | 8217 | < 0.1% | |
5 | 6406 | < 0.1% | |
6 | 5294 | < 0.1% | |
7 | 4339 | < 0.1% | |
8 | 3602 | < 0.1% | |
9 | 3425 | < 0.1% | |
10 | 3025 | < 0.1% | |
Other values (7811) | 109526 | 52.8% |
Minimum 5 values
Value | Count | Frequency (%) | |
1 | 35624 | 17.2% | |
2 | 17081 | 8.2% | |
3 | 11016 | 5.3% | |
4 | 8217 | < 0.1% | |
5 | 6406 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
1.4273e+05 | 1 | < 0.1% | |
1.4108e+05 | 1 | < 0.1% | |
1.0768e+05 | 1 | < 0.1% | |
1.056e+05 | 1 | < 0.1% | |
1.0468e+05 | 1 | < 0.1% |
AANTAL_SUBTRAJECT_PER_DIAG
Highly correlated
This variable is highly correlated with AANTAL_PAT_PER_DIAG
and should be ignored for analysis
Correlation | 0.95404 |
---|
AANTAL_SUBTRAJECT_PER_SPC
Highly correlated
This variable is highly correlated with AANTAL_PAT_PER_SPC
and should be ignored for analysis
Correlation | 0.93355 |
---|
AANTAL_SUBTRAJECT_PER_ZPD
Highly correlated
This variable is highly correlated with AANTAL_PAT_PER_ZPD
and should be ignored for analysis
Correlation | 0.93013 |
---|
BEHANDELEND_SPECIALISME_CD
Numeric
Distinct count | 28 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 420.51 |
---|---|
Minimum | 100 |
Maximum | 8418 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 100 |
---|---|
5-th percentile | 302 |
Q1 | 305 |
Median | 313 |
Q3 | 322 |
95-th percentile | 361 |
Maximum | 8418 |
Range | 8318 |
Interquartile range | 17 |
Descriptive statistics
Standard deviation | 914.84 |
---|---|
Coef of variation | 2.1756 |
Kurtosis | 72.277 |
Mean | 420.51 |
MAD | 208.06 |
Skewness | 8.6115 |
Sum | 8.7278e+07 |
Variance | 8.3693e+05 |
Memory size | 1.6 MiB |
Value | Count | Frequency (%) | |
305 | 29207 | 14.1% | |
313 | 27061 | 13.0% | |
303 | 23803 | 11.5% | |
330 | 16794 | 8.1% | |
316 | 14294 | 6.9% | |
308 | 9854 | < 0.1% | |
324 | 8530 | < 0.1% | |
301 | 8462 | < 0.1% | |
306 | 8443 | < 0.1% | |
304 | 6669 | < 0.1% | |
Other values (18) | 54438 | 26.2% |
Minimum 5 values
Value | Count | Frequency (%) | |
100 | 9 | < 0.1% | |
301 | 8462 | < 0.1% | |
302 | 4484 | < 0.1% | |
303 | 23803 | 11.5% | |
304 | 6669 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
8418 | 2675 | < 0.1% | |
1900 | 134 | < 0.1% | |
390 | 478 | < 0.1% | |
389 | 2320 | < 0.1% | |
362 | 3680 | < 0.1% |
DATUM_BESTAND
Constant
This variable is constant and should be ignored for analysis
Constant value | 2019-05-20 |
---|
GEMIDDELDE_VERKOOPPRIJS
Numeric
Distinct count | 2836 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 17.2% |
Missing (n) | 35702 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Mean | 3431.9 |
---|---|
Minimum | 70 |
Maximum | 2.8722e+05 |
Zeros (%) | 0.0% |
Quantile statistics
Minimum | 70 |
---|---|
5-th percentile | 140 |
Q1 | 455 |
Median | 1215 |
Q3 | 3920 |
95-th percentile | 12932 |
Maximum | 2.8722e+05 |
Range | 2.8715e+05 |
Interquartile range | 3465 |
Descriptive statistics
Standard deviation | 6617.1 |
---|---|
Coef of variation | 1.9281 |
Kurtosis | 201.15 |
Mean | 3431.9 |
MAD | 3520.3 |
Skewness | 8.6127 |
Sum | 5.8978e+08 |
Variance | 4.3786e+07 |
Memory size | 1.6 MiB |
Value | Count | Frequency (%) | |
105 | 1715 | < 0.1% | |
160 | 1640 | < 0.1% | |
180 | 1439 | < 0.1% | |
300 | 1377 | < 0.1% | |
115 | 1066 | < 0.1% | |
145 | 958 | < 0.1% | |
140 | 942 | < 0.1% | |
155 | 879 | < 0.1% | |
165 | 826 | < 0.1% | |
540 | 806 | < 0.1% | |
Other values (2825) | 160205 | 77.2% | |
(Missing) | 35702 | 17.2% |
Minimum 5 values
Value | Count | Frequency (%) | |
70 | 226 | < 0.1% | |
75 | 74 | < 0.1% | |
80 | 370 | < 0.1% | |
85 | 685 | < 0.1% | |
90 | 417 | < 0.1% |
Maximum 5 values
Value | Count | Frequency (%) | |
2.8722e+05 | 8 | < 0.1% | |
1.4754e+05 | 3 | < 0.1% | |
1.2216e+05 | 4 | < 0.1% | |
1.1691e+05 | 3 | < 0.1% | |
1.0857e+05 | 7 | < 0.1% |
JAAR
Date
Distinct count | 8 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
Infinite (%) | 0.0% |
Infinite (n) | 0 |
Minimum | 2012-01-01 00:00:00 |
---|---|
Maximum | 2019-01-01 00:00:00 |
PEILDATUM
Constant
This variable is constant and should be ignored for analysis
Constant value | 2019-05-01 |
---|
TYPERENDE_DIAGNOSE_CD
Categorical
Distinct count | 1771 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | 0.0% |
Missing (n) | 0 |
101 | 881 |
---|---|
402 | 862 |
301 | 834 |
Other values (1768) |
Value | Count | Frequency (%) | |
101 | 881 | < 0.1% | |
402 | 862 | < 0.1% | |
301 | 834 | < 0.1% | |
403 | 820 | < 0.1% | |
203 | 791 | < 0.1% | |
201 | 776 | < 0.1% | |
401 | 708 | < 0.1% | |
404 | 695 | < 0.1% | |
409 | 681 | < 0.1% | |
802 | 670 | < 0.1% | |
Other values (1761) | 199837 | > 99.9% |
Max length | 4 |
---|---|
Mean length | 3.3471 |
Min length | 1 |
Contains chars | True |
Contains digits | True |
Contains spaces | False |
Contains non-words | False |
VERSIE
Constant
This variable is constant and should be ignored for analysis
Constant value | 1.0 |
---|
ZORGPRODUCT_CD
Categorical
Distinct count | 5859 |
---|---|
Unique (%) | < 0.1% |
Missing (%) | < 0.1% |
Missing (n) | 2366 |
990004009 | 1503 |
---|---|
990004007 | 1479 |
990003004 | 1475 |
Other values (5855) | |
(Missing) | 2366 |
Value | Count | Frequency (%) | |
990004009 | 1503 | < 0.1% | |
990004007 | 1479 | < 0.1% | |
990003004 | 1475 | < 0.1% | |
990004006 | 1180 | < 0.1% | |
990356076 | 986 | < 0.1% | |
990003007 | 920 | < 0.1% | |
990356073 | 918 | < 0.1% | |
131999228 | 883 | < 0.1% | |
131999164 | 877 | < 0.1% | |
199299013 | 863 | < 0.1% | |
Other values (5848) | 194105 | 93.5% | |
(Missing) | 2366 | < 0.1% |
Max length | 9 |
---|---|
Mean length | 8.9316 |
Min length | 3 |
Contains chars | True |
Contains digits | True |
Contains spaces | True |
Contains non-words | True |
AANTAL_PAT_PER_DIAG | AANTAL_PAT_PER_SPC | AANTAL_PAT_PER_ZPD | AANTAL_SUBTRAJECT_PER_DIAG | AANTAL_SUBTRAJECT_PER_SPC | AANTAL_SUBTRAJECT_PER_ZPD | BEHANDELEND_SPECIALISME_CD | DATUM_BESTAND | GEMIDDELDE_VERKOOPPRIJS | JAAR | PEILDATUM | TYPERENDE_DIAGNOSE_CD | VERSIE | ZORGPRODUCT_CD | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 3611 | 187296 | 52 | 4250 | 304621 | 52 | 327 | 2019-05-20 | 85.0 | 2013-01-01 | 2019-05-01 | 0613 | 1.0 | 990027131 |
1 | 3611 | 187296 | 1 | 4250 | 304621 | 1 | 327 | 2019-05-20 | NaN | 2013-01-01 | 2019-05-01 | 0613 | 1.0 | 990027183 |
2 | 3611 | 187296 | 16 | 4250 | 304621 | 16 | 327 | 2019-05-20 | NaN | 2013-01-01 | 2019-05-01 | 0613 | 1.0 | 990027179 |
3 | 3611 | 187296 | 1 | 4250 | 304621 | 1 | 327 | 2019-05-20 | 2225.0 | 2013-01-01 | 2019-05-01 | 0613 | 1.0 | 990027142 |
4 | 3611 | 187296 | 26 | 4250 | 304621 | 26 | 327 | 2019-05-20 | NaN | 2013-01-01 | 2019-05-01 | 0613 | 1.0 | 990027180 |
5 | 3611 | 187296 | 3 | 4250 | 304621 | 3 | 327 | 2019-05-20 | NaN | 2013-01-01 | 2019-05-01 | 0613 | 1.0 | 990027178 |
6 | 3611 | 187296 | 1 | 4250 | 304621 | 1 | 327 | 2019-05-20 | 2795.0 | 2013-01-01 | 2019-05-01 | 0613 | 1.0 | 990027177 |
7 | 3611 | 187296 | 2 | 4250 | 304621 | 2 | 327 | 2019-05-20 | NaN | 2013-01-01 | 2019-05-01 | 0613 | 1.0 | 990027184 |
8 | 3611 | 187296 | 33 | 4250 | 304621 | 33 | 327 | 2019-05-20 | NaN | 2013-01-01 | 2019-05-01 | 0613 | 1.0 | 990027182 |
9 | 3611 | 187296 | 132 | 4250 | 304621 | 137 | 327 | 2019-05-20 | 19890.0 | 2013-01-01 | 2019-05-01 | 0613 | 1.0 | 990027181 |
AANTAL_PAT_PER_DIAG | AANTAL_PAT_PER_SPC | AANTAL_PAT_PER_ZPD | AANTAL_SUBTRAJECT_PER_DIAG | AANTAL_SUBTRAJECT_PER_SPC | AANTAL_SUBTRAJECT_PER_ZPD | BEHANDELEND_SPECIALISME_CD | DATUM_BESTAND | GEMIDDELDE_VERKOOPPRIJS | JAAR | PEILDATUM | TYPERENDE_DIAGNOSE_CD | VERSIE | ZORGPRODUCT_CD | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
207545 | 219 | 301061 | 1 | 313 | 436331 | 1 | 316 | 2019-05-20 | NaN | 2018-01-01 | 2019-05-01 | 6005 | 1.0 | 990816013 |
207546 | 219 | 301061 | 63 | 313 | 436331 | 66 | 316 | 2019-05-20 | 925.0 | 2018-01-01 | 2019-05-01 | 6005 | 1.0 | 990816028 |
207547 | 219 | 301061 | 121 | 313 | 436331 | 152 | 316 | 2019-05-20 | 370.0 | 2018-01-01 | 2019-05-01 | 6005 | 1.0 | 990816027 |
207548 | 219 | 301061 | 21 | 313 | 436331 | 24 | 316 | 2019-05-20 | NaN | 2018-01-01 | 2019-05-01 | 6005 | 1.0 | 990816007 |
207549 | 219 | 301061 | 12 | 313 | 436331 | 12 | 316 | 2019-05-20 | NaN | 2018-01-01 | 2019-05-01 | 6005 | 1.0 | 990816018 |
207550 | 219 | 301061 | 2 | 313 | 436331 | 3 | 316 | 2019-05-20 | NaN | 2018-01-01 | 2019-05-01 | 6005 | 1.0 | 990816006 |
207551 | 62 | 301061 | 39 | 66 | 436331 | 40 | 316 | 2019-05-20 | 350.0 | 2018-01-01 | 2019-05-01 | 7807 | 1.0 | 991116028 |
207552 | 62 | 301061 | 9 | 66 | 436331 | 10 | 316 | 2019-05-20 | 910.0 | 2018-01-01 | 2019-05-01 | 7807 | 1.0 | 991116029 |
207553 | 62 | 301061 | 4 | 66 | 436331 | 4 | 316 | 2019-05-20 | 7895.0 | 2018-01-01 | 2019-05-01 | 7807 | 1.0 | 991116014 |
207554 | 62 | 301061 | 12 | 66 | 436331 | 12 | 316 | 2019-05-20 | 2605.0 | 2018-01-01 | 2019-05-01 | 7807 | 1.0 | 991116006 |