Overview

Dataset info

Number of variables14
Number of observations207555
Missing cells38068 (< 0.1%)
Duplicate rows0 (0.0%)
Total size in memory22.2 MiB
Average record size in memory112.0 B

Variables types

Numeric5
Categorical2
Boolean0
Date1
URL0
Text (Unique)0
Rejected6
Unsupported0

Warnings

AANTAL_SUBTRAJECT_PER_DIAG is highly correlated with AANTAL_PAT_PER_DIAG (ρ = 0.95404) Rejected
AANTAL_SUBTRAJECT_PER_SPC is highly correlated with AANTAL_PAT_PER_SPC (ρ = 0.93355) Rejected
AANTAL_SUBTRAJECT_PER_ZPD is highly correlated with AANTAL_PAT_PER_ZPD (ρ = 0.93013) Rejected
DATUM_BESTAND has constant value "2019-05-20" Rejected
GEMIDDELDE_VERKOOPPRIJS has 35702 (17.2%) missing values Missing
PEILDATUM has constant value "2019-05-01" Rejected
TYPERENDE_DIAGNOSE_CD has a high cardinality: 1771 distinct values Warning
VERSIE has constant value "1.0" Rejected
ZORGPRODUCT_CD has a high cardinality: 5859 distinct values Warning
ZORGPRODUCT_CD has 2366 (< 0.1%) missing values Missing

Variables

AANTAL_PAT_PER_DIAG
Numeric

Distinct count6779
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean7546.4
Minimum1
Maximum1.9809e+05
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile37
Q1401
Median1692
Q36415
95-th percentile36209
Maximum1.9809e+05
Range1.9809e+05
Interquartile range6014

Descriptive statistics

Standard deviation17219
Coef of variation2.2817
Kurtosis29.942
Mean7546.4
MAD9104.1
Skewness4.8174
Sum1.5663e+09
Variance2.9648e+08
Memory size1.6 MiB
Histogram
ValueCountFrequency (%) 
1 447 < 0.1%
 
2 446 < 0.1%
 
3 353 < 0.1%
 
6 341 < 0.1%
 
4 338 < 0.1%
 
25 334 < 0.1%
 
21 327 < 0.1%
 
11 323 < 0.1%
 
5 322 < 0.1%
 
12 312 < 0.1%
 
Other values (6769) 204012 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
1 447 < 0.1%
 
2 446 < 0.1%
 
3 353 < 0.1%
 
4 338 < 0.1%
 
5 322 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
1.9809e+05 16 < 0.1%
 
1.9578e+05 17 < 0.1%
 
1.9549e+05 19 < 0.1%
 
1.9327e+05 20 < 0.1%
 
1.8911e+05 19 < 0.1%
 

AANTAL_PAT_PER_SPC
Numeric

Distinct count215
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean6.5669e+05
Minimum1
Maximum1.4896e+06
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile42205
Q12.7349e+05
Median7.3334e+05
Q39.5911e+05
95-th percentile1.3048e+06
Maximum1.4896e+06
Range1.4896e+06
Interquartile range6.8562e+05

Descriptive statistics

Standard deviation4.0861e+05
Coef of variation0.62222
Kurtosis-0.98819
Mean6.5669e+05
MAD3.5264e+05
Skewness0.068181
Sum1.363e+11
Variance1.6696e+11
Memory size1.6 MiB
Histogram
ValueCountFrequency (%) 
8.7667e+05 5100 < 0.1%
 
8.5896e+05 4398 < 0.1%
 
8.3173e+05 4357 < 0.1%
 
8.4828e+05 4339 < 0.1%
 
1.0456e+06 3968 < 0.1%
 
1.026e+06 3946 < 0.1%
 
9.5911e+05 3869 < 0.1%
 
1.0312e+06 3858 < 0.1%
 
6.0443e+05 3803 < 0.1%
 
9.9561e+05 3724 < 0.1%
 
Other values (205) 166193 80.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 1 < 0.1%
 
5 2 < 0.1%
 
13 1 < 0.1%
 
43 9 < 0.1%
 
46 10 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
1.4896e+06 2981 < 0.1%
 
1.4506e+06 3055 < 0.1%
 
1.4122e+06 3578 < 0.1%
 
1.3048e+06 3590 < 0.1%
 
1.2967e+06 1182 < 0.1%
 

AANTAL_PAT_PER_ZPD
Numeric

Distinct count7821
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean493.89
Minimum1
Maximum1.4273e+05
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q12
Median13
Q397
95-th percentile1650
Maximum1.4273e+05
Range1.4273e+05
Interquartile range95

Descriptive statistics

Standard deviation3046.2
Coef of variation6.1678
Kurtosis349.56
Mean493.89
MAD789.52
Skewness15.829
Sum1.0251e+08
Variance9.2795e+06
Memory size1.6 MiB
Histogram
ValueCountFrequency (%) 
1 35624 17.2%
 
2 17081 8.2%
 
3 11016 5.3%
 
4 8217 < 0.1%
 
5 6406 < 0.1%
 
6 5294 < 0.1%
 
7 4339 < 0.1%
 
8 3602 < 0.1%
 
9 3425 < 0.1%
 
10 3025 < 0.1%
 
Other values (7811) 109526 52.8%
 

Minimum 5 values

ValueCountFrequency (%) 
1 35624 17.2%
 
2 17081 8.2%
 
3 11016 5.3%
 
4 8217 < 0.1%
 
5 6406 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
1.4273e+05 1 < 0.1%
 
1.4108e+05 1 < 0.1%
 
1.0768e+05 1 < 0.1%
 
1.056e+05 1 < 0.1%
 
1.0468e+05 1 < 0.1%
 

AANTAL_SUBTRAJECT_PER_DIAG
Highly correlated

This variable is highly correlated with AANTAL_PAT_PER_DIAG and should be ignored for analysis

Correlation0.95404

AANTAL_SUBTRAJECT_PER_SPC
Highly correlated

This variable is highly correlated with AANTAL_PAT_PER_SPC and should be ignored for analysis

Correlation0.93355

AANTAL_SUBTRAJECT_PER_ZPD
Highly correlated

This variable is highly correlated with AANTAL_PAT_PER_ZPD and should be ignored for analysis

Correlation0.93013

BEHANDELEND_SPECIALISME_CD
Numeric

Distinct count28
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean420.51
Minimum100
Maximum8418
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum100
5-th percentile302
Q1305
Median313
Q3322
95-th percentile361
Maximum8418
Range8318
Interquartile range17

Descriptive statistics

Standard deviation914.84
Coef of variation2.1756
Kurtosis72.277
Mean420.51
MAD208.06
Skewness8.6115
Sum8.7278e+07
Variance8.3693e+05
Memory size1.6 MiB
Histogram
ValueCountFrequency (%) 
305 29207 14.1%
 
313 27061 13.0%
 
303 23803 11.5%
 
330 16794 8.1%
 
316 14294 6.9%
 
308 9854 < 0.1%
 
324 8530 < 0.1%
 
301 8462 < 0.1%
 
306 8443 < 0.1%
 
304 6669 < 0.1%
 
Other values (18) 54438 26.2%
 

Minimum 5 values

ValueCountFrequency (%) 
100 9 < 0.1%
 
301 8462 < 0.1%
 
302 4484 < 0.1%
 
303 23803 11.5%
 
304 6669 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
8418 2675 < 0.1%
 
1900 134 < 0.1%
 
390 478 < 0.1%
 
389 2320 < 0.1%
 
362 3680 < 0.1%
 

DATUM_BESTAND
Constant

This variable is constant and should be ignored for analysis

Constant value2019-05-20

GEMIDDELDE_VERKOOPPRIJS
Numeric

Distinct count2836
Unique (%)< 0.1%
Missing (%)17.2%
Missing (n)35702
Infinite (%)0.0%
Infinite (n)0
Mean3431.9
Minimum70
Maximum2.8722e+05
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum70
5-th percentile140
Q1455
Median1215
Q33920
95-th percentile12932
Maximum2.8722e+05
Range2.8715e+05
Interquartile range3465

Descriptive statistics

Standard deviation6617.1
Coef of variation1.9281
Kurtosis201.15
Mean3431.9
MAD3520.3
Skewness8.6127
Sum5.8978e+08
Variance4.3786e+07
Memory size1.6 MiB
Histogram
ValueCountFrequency (%) 
105 1715 < 0.1%
 
160 1640 < 0.1%
 
180 1439 < 0.1%
 
300 1377 < 0.1%
 
115 1066 < 0.1%
 
145 958 < 0.1%
 
140 942 < 0.1%
 
155 879 < 0.1%
 
165 826 < 0.1%
 
540 806 < 0.1%
 
Other values (2825) 160205 77.2%
 
(Missing) 35702 17.2%
 

Minimum 5 values

ValueCountFrequency (%) 
70 226 < 0.1%
 
75 74 < 0.1%
 
80 370 < 0.1%
 
85 685 < 0.1%
 
90 417 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
2.8722e+05 8 < 0.1%
 
1.4754e+05 3 < 0.1%
 
1.2216e+05 4 < 0.1%
 
1.1691e+05 3 < 0.1%
 
1.0857e+05 7 < 0.1%
 

JAAR
Date

Distinct count8
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Minimum2012-01-01 00:00:00
Maximum2019-01-01 00:00:00
Mini histogram
Histogram

PEILDATUM
Constant

This variable is constant and should be ignored for analysis

Constant value2019-05-01

TYPERENDE_DIAGNOSE_CD
Categorical

Distinct count1771
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
101
 
881
402
 
862
301
 
834
Other values (1768)
204978
ValueCountFrequency (%) 
101 881 < 0.1%
 
402 862 < 0.1%
 
301 834 < 0.1%
 
403 820 < 0.1%
 
203 791 < 0.1%
 
201 776 < 0.1%
 
401 708 < 0.1%
 
404 695 < 0.1%
 
409 681 < 0.1%
 
802 670 < 0.1%
 
Other values (1761) 199837 > 99.9%
 
Max length4
Mean length3.3471
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesFalse
Contains non-wordsFalse

VERSIE
Constant

This variable is constant and should be ignored for analysis

Constant value1.0

ZORGPRODUCT_CD
Categorical

Distinct count5859
Unique (%)< 0.1%
Missing (%)< 0.1%
Missing (n)2366
990004009
 
1503
990004007
 
1479
990003004
 
1475
Other values (5855)
200732
(Missing)
 
2366
ValueCountFrequency (%) 
990004009 1503 < 0.1%
 
990004007 1479 < 0.1%
 
990003004 1475 < 0.1%
 
990004006 1180 < 0.1%
 
990356076 986 < 0.1%
 
990003007 920 < 0.1%
 
990356073 918 < 0.1%
 
131999228 883 < 0.1%
 
131999164 877 < 0.1%
 
199299013 863 < 0.1%
 
Other values (5848) 194105 93.5%
 
(Missing) 2366 < 0.1%
 
Max length9
Mean length8.9316
Min length3
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

Correlations

Pearson matrix
Spearman matrix
Kendall matrix
Phi<sub>k</sub> matrix

Missing values

Matrix

Matrix

Count

Bar

Heatmap

Heatmap

Dendrogram

Dendrogram

Sample

First rows

AANTAL_PAT_PER_DIAGAANTAL_PAT_PER_SPCAANTAL_PAT_PER_ZPDAANTAL_SUBTRAJECT_PER_DIAGAANTAL_SUBTRAJECT_PER_SPCAANTAL_SUBTRAJECT_PER_ZPDBEHANDELEND_SPECIALISME_CDDATUM_BESTANDGEMIDDELDE_VERKOOPPRIJSJAARPEILDATUMTYPERENDE_DIAGNOSE_CDVERSIEZORGPRODUCT_CD
03611187296524250304621523272019-05-2085.02013-01-012019-05-0106131.0990027131
136111872961425030462113272019-05-20NaN2013-01-012019-05-0106131.0990027183
23611187296164250304621163272019-05-20NaN2013-01-012019-05-0106131.0990027179
336111872961425030462113272019-05-202225.02013-01-012019-05-0106131.0990027142
43611187296264250304621263272019-05-20NaN2013-01-012019-05-0106131.0990027180
536111872963425030462133272019-05-20NaN2013-01-012019-05-0106131.0990027178
636111872961425030462113272019-05-202795.02013-01-012019-05-0106131.0990027177
736111872962425030462123272019-05-20NaN2013-01-012019-05-0106131.0990027184
83611187296334250304621333272019-05-20NaN2013-01-012019-05-0106131.0990027182
9361118729613242503046211373272019-05-2019890.02013-01-012019-05-0106131.0990027181

Last rows

AANTAL_PAT_PER_DIAGAANTAL_PAT_PER_SPCAANTAL_PAT_PER_ZPDAANTAL_SUBTRAJECT_PER_DIAGAANTAL_SUBTRAJECT_PER_SPCAANTAL_SUBTRAJECT_PER_ZPDBEHANDELEND_SPECIALISME_CDDATUM_BESTANDGEMIDDELDE_VERKOOPPRIJSJAARPEILDATUMTYPERENDE_DIAGNOSE_CDVERSIEZORGPRODUCT_CD
207545219301061131343633113162019-05-20NaN2018-01-012019-05-0160051.0990816013
20754621930106163313436331663162019-05-20925.02018-01-012019-05-0160051.0990816028
2075472193010611213134363311523162019-05-20370.02018-01-012019-05-0160051.0990816027
20754821930106121313436331243162019-05-20NaN2018-01-012019-05-0160051.0990816007
20754921930106112313436331123162019-05-20NaN2018-01-012019-05-0160051.0990816018
207550219301061231343633133162019-05-20NaN2018-01-012019-05-0160051.0990816006
207551623010613966436331403162019-05-20350.02018-01-012019-05-0178071.0991116028
20755262301061966436331103162019-05-20910.02018-01-012019-05-0178071.0991116029
2075536230106146643633143162019-05-207895.02018-01-012019-05-0178071.0991116014
207554623010611266436331123162019-05-202605.02018-01-012019-05-0178071.0991116006