Insights Into Disability and Disability Progression in People With Multiple Sclerosis Using Large-Scale Healthcare Data
Funding: Christiane Gasperi received funding from the German Federal Ministry of Education and Research (BMBF-161L0216).
ABSTRACT
Background
Identifying predictors for disability progression is crucial for managing multiple sclerosis (MS). This study aims to explore levels of disability and informative factors for disability progression in people with MS (PwMS) using healthcare data without detailed clinical information.
Methods
We conducted a case–control/cohort study on data from Bavaria's largest health insurance organization. The dataset included records of assistive devices, nursing care, sick leaves, rehabilitation, drug therapies, and diagnoses for individuals with MS, Crohn's disease (CD), rheumatoid arthritis (RA), and controls (CTR) without these diseases. We used generalized linear models to compare healthcare service utilization between MS and other cohorts. A gradient-boosting algorithm identified informative healthcare-related factors associated with disability progression in PwMS, defined by increased nursing care utilization.
Results
PwMS (N = 11,961) demonstrated higher healthcare utilization than CD (N = 21,884), RA (N = 105,450), and CTR (N = 82,677) groups, even at young ages. Besides expected risk factors like age, smoking, diabetes, and psychiatric disorders, the prediction algorithm revealed that PwMS with specific gynecological disorders, upper tract infections, asthma, and thyroiditis were less likely to need higher levels of nursing care.
Conclusions
Leveraging healthcare data allows for an objective assessment of disability in PwMS and can identify informative factors for disability progression. Our approach can be applied to studies on disease progression in large cohorts without detailed clinical data and can be adapted to other diseases, disability measures, and healthcare systems. Higher utilization of healthcare resources even at young ages revealed an unmet need for improved treatment and management strategies for young adults with MS.
1 Introduction
Multiple sclerosis (MS) is a chronic inflammatory disease of the central nervous system and the most common non-traumatic cause of acquired disability in young adults [1]. The clinical course is diverse, from nearly asymptomatic to highly active forms with rapid disability progression. Predicting the disease course as early as possible is crucial [2], and the search for predictors of disease progression is a vital question in MS research.
Several approaches using genomic, proteomic, or clinical information have been proposed to predict the disease progression of people with MS (PwMS) [3-7]. Collecting longitudinal data for these models is challenging; they are often sampled non-periodically, and not all clinical measurements are performed at each patient visit. Therefore, such methods usually suffer from a limited number of individuals and missing information [8].
Most prediction algorithms use clinical measurement methods to depict the MS progression [3-7]. Quantitative measurement methods like the Expanded Disability Status Scale [9] (EDSS) have several limitations [10]. They require detailed clinical data, which is often unavailable, can be subjective, and are affected by the individual's daily condition. These challenges lead us to find alternative ways to determine disability progression.
For individuals with increasing disability, we can expect increasing utilization rates for assistive devices and remedies (e.g., walking aids, wheelchairs, physiotherapy), as well as more rehabilitation and sick leave requirements, or the establishment of nursing care. This study aimed to investigate whether disability progression of PwMS can be predicted using healthcare data.
Here, we first focused on revealing differences regarding healthcare factors between MS and other autoimmune diseases that affect young adults, Crohn's disease (CD) and rheumatoid arthritis (RA), and a control group (CTR) with none of these diseases. Following that, we utilized a gradient-boosting-based machine learning algorithm to identify informative healthcare factors in the disability progression of PwMS.
2 Methods
2.1 Data
Data from Allgemeine Ortskrankenkasse (AOK) Bayern, Bavaria's foremost health insurer, covers more than 35% of the population. Raw data include records for drug therapies, assistive devices, remedies for physiotherapy and ergotherapy (Table S1), nursing care levels, rehabilitation, sick leave reports, and ICD-10 (The International Statistical Classification of Diseases and Related Health Problems, 10th revision [11]) codes (Table S2) for 18,263 individuals with MS, 22827 individuals with CD, 113921 individuals with RA, and 111,246 controls, collected quarterly between 2010 and 2020.
We categorized DMTs by effectiveness as moderate, high, and very high efficacy DMTs [12] (Table S3). We included the following assistive devices in our analyses: mobility-related aids (walking aids, ambulances/vehicles, mobility aids, care aids for more independent living/mobility), incontinence-related aids, nursing items, visual aids, toilet aids, care aids to facilitate care, care aids for body care/hygiene, and for alleviating complaints. We used the residential region (urban or rural) information [13] in our prediction model, determined from the provided partial postal codes.
In Germany, individuals receive nursing care at varying levels depending on their health-related needs, if they last at least 6 months (Table S4). The nursing care levels range from one (minor impairments of independence) to five (heaviest impairment of autonomy) [14] (Table S5). We compared nursing care levels across different cohorts and used nursing care levels of PwMS to define disability progression as the outcome of the prediction model.
2.2 Cohort Definition, Data Filtering, and Cleaning
We used ICD-10 codes G35 for MS, K50 for CD, and M06 for RA to define the cohorts. We included individuals diagnosed with the respective code at least twice in an ambulatory care unit in different quarterly periods or at least once in inpatient care. We applied thorough data filtering and cleaning, considering reliability and specialist information for diagnoses, possible demyelinating diseases in CTR, insurance status, and observation duration. The details of the filtering process can be found in Supporting Information. The final dataset included 11,961 individuals with MS, 21,884 with CD, 105,450 with RA, and 82,677 controls (Table 1).
Phenotype | Number of patients | Age (mean [sd]) | Sex (Female) | Observation in days (mean [sd]) |
---|---|---|---|---|
Multiple sclerosis | 11,961 | 49.30 [14.00] | 69.67% | 3318 [745] |
Crohn's disease | 21,884 | 51.91 [17.64] | 56.48% | 3226 [839] |
Rheumatoid arthritis | 105,450 | 69.46 [15.31] | 69.58% | 3244 [819] |
Controls | 82,677 | 52.28 [15.90] | 70.13% | 3041 [1007] |
2.3 Generalized Linear Models for Statistical Comparisons of MS Versus Control Cohorts
We utilized generalized linear models (GLMs) to analyze differences in healthcare utilization between MS, CD, RA, and CTR. For binary outcomes, we utilized quasibinomial distributions, and for count data, we used negative binomial distributions. We used the quasibinomial distribution to address overdispersion in binary outcomes, providing a better fit to the data by adjusting the observed variability [15, 16].
Using the model coefficients obtained from the GLMs constructed for each healthcare parameter either using quasibinomial or negative binomial distributions, we calculated odds ratios (ORs) and corresponding 95% confidence intervals (CIs) and obtained significance levels (p-values) of each covariate. We used MS as the reference phenotype in the models. Consequently, odds ratios lower than one for CD, RA, and CTR indicate higher odds of observing the relevant outcome parameters in MS, while odds ratios higher than one indicate lower odds of observing the relevant outcome parameters in MS. To increase the interpretability, we report the inverse odds ratios (1/OR), where higher values represent higher odds of observing the relevant outcome parameters in MS. For each model, we report Bonferroni-adjusted p-values in cases involving multiple statistical comparisons. To analyze sick leave, we performed GLM analysis on a subset of individuals within active working ages (ages 25 to 55), as we could not accurately determine individuals' unemployment or retirement status. We also conducted separate analyses for different age groups (20–29, …,70–79, 80+) for each outcome.
2.4 Prediction Model
2.4.1 Feature Set Used in Prediction Model
We employed longitudinal healthcare records to predict disability progression defined by a nursing care level increase. In our training model, we utilized a 3-year patient trajectory and quarterly records with missing information of up to 30% of the observation period (see Supporting Information for the details). Table 2 lists all 1119 features included in our model.
Feature | Variable type | Values |
---|---|---|
Sex | Categorical | 1 = female, 2 = male |
Age at given quarter | Continuous | [20,107] |
Frequency of receiving rehabilitation | Continuous | ≥ 0 |
Duration of rehabilitation treatment in days | Continuous | ≥ 0 |
Number of doctor visits made at the given quarter | Continuous | ≥ 0 |
Year of the observation for the given quarter | Categorical | {2010, …,2019} |
Order of the observation for the given quarter | Categorical | {1 = earliest record, …, 12 = latest record} |
MS subtype: Last known diagnosis of an individual at the given quarter | Categorical | (1 = Unspecified, 2 = RRMS, 3 = SPMS, 4 = PPMS) |
DMT level: Level of DMT prescribed to an individual at the given quarter (Table S3) | Categorical | {1 = None, 2 = Moderate Efficacy, 3 = High Efficacy, 4 = Very high efficacy} |
Residential region: Information about the last known residential region of an individual | Categorical | {1 = Urban, 2 = Rural} |
Usage of an assistive device group at the given quarter: mobility-related aids, incontinence-related aids, nursing items, visual aids, toilet aids, care aids to facilitate care, care aids for body care/hygiene and for alleviating complaints | Binary | Yes, No |
Diagnosis of a disease at the given quarter. List of ICD-10 Codes can be found in Table S2 | Binary | Yes, No |
Usage of a physiotherapy or ergotherapy remedy. The list of physiotherapy and ergotherapy treatments can be found in Table S1 | Binary | Yes, No |
- Abbreviations: DMT, disease modifying therapy; ICD-10, The International Statistical Classification of Diseases and Related Health Problems, 10th revision; MS, multiple sclerosis; PPMS, primary-progressive MS; RRMS, relapsing–remitting MS; SPMS, secondary-progressive MS.
2.4.2 Prediction Task Definition
Our model predicts the disability progression of PwMS 2 years after the observation period by taking 3 years of healthcare records of PwMS, denoted as , and the disability progression of those individuals 2 years after the last observation, denoted as , as input (Figure S1). We classified individuals who required nursing care at level two or lower at the end of the observation period () but required nursing care at level three or higher 2 years after as individuals with disability progression, and individuals who still required nursing care at level two or lower at as individuals without disability progression. We defined as follows: If an individual required nursing care at level three or higher, we picked the last quarter before reaching level three for the first time as the end of the observation period, which ensures that individual was also receiving nursing care level three or higher after 2 years. If not, we picked 2 years before the last observation as . We excluded individuals with nursing care level three or higher during the observation period. The resulting dataset included 8360 PwMS, 687 of whom showed an increase in the nursing care level.
2.4.3 Learning Algorithm
We trained our prediction model using a gradient-boosting tree-based learning algorithm, LightGBM [17], implemented via LightGBM's R package (v3.2.2) [18].
2.4.4 Experimental Settings
Data include multiple records per individual, deviating from the independent and identically distributed (IID) structure due to correlated instances. We split the non-IID structured data as follows: We divided the dataset into training and test groups by randomly picking 80% of the individuals for training and 20% for the test set. By applying a person-level splitting procedure, we kept all samples belonging to the same individual either in training or test sets. We tried to keep the ratio between individuals who showed disability progression and those who did not the same in both sets. We repeated this process 50 times to obtain more robust predictive results and tuned the hyperparameters using 4-fold cross-validation on the training set, keeping all samples of an individual in the same fold. We tuned and updated one hyperparameter at a time by selecting the best value for the average area under the precision-recall curve (AUPR) through cross-validation. The hyperparameters and their search range are listed in Table S6.
We reported the predictive performance on the test partition over 50 replications using the area under the receiver operating characteristic (AUROC) and the AUPR curves, F1, and Average Balanced Accuracy metrics. We compared our predictive performance with: (i) a LightGBM algorithm trained on data including the same individuals with feature columns created for each time point, preserving IID structure since it includes one sample per individual; and (ii) GPBoost [19], a LightGBM-based Gaussian process and mixed effects model proposed for longitudinal data with non-IID structure. The GPBoost algorithm used the same data as our approach, and both algorithms were trained using the same experimental settings.
We calculated the contribution of each feature to the prediction by utilizing Shapley values [20] obtained through the model trained on our entire dataset. Details on performance metrics and the Shapley method can be found in the Supporting Information.
3 Results
3.1 Healthcare Needs of Individuals With MS Compared to Those With CD, RA, and Controls
PwMS showed a significantly higher need for assistive devices, nursing care, remedies, rehabilitative treatments, and sick leaves than all control groups (Figure 1). Utilization of all healthcare parameters was considerably higher in MS for all age groups, including young individuals (Table S7).

The odds of using assistive devices were significantly higher in MS compared to CD, RA, and CTR, with overall usage being 7.69, 10, and 14.29 times higher, respectively. MS also showed three to five times higher odds for different types of assistive devices across all cohorts. Specifically, for ages 30–39, MS had 1.8 and 2.7 times higher odds than CD and CTR, respectively (Figure S2). Usage of mobility, incontinence, and care-related aids was notably higher in MS across almost all age groups (Figure S3, Table S8). In the 30–39 age group, the odds of using mobility aids were 1.85 and 2.56 times higher for MS than CD and CTR, but similar to RA (1/OR = 1.1, 95% CI = [0.94, 1.28]). Odds for incontinence aids were three, six, and eight times higher in MS than in CD (1/OR = 3.33), RA (1/OR = 5.88), and CTR (1/OR = 8.33), respectively.
Nursing care utilization was more frequent in MS than CD (1/OR = 10), RA (1/OR = 11.11), and CTR (1/OR = 14.29) (Figure 1). For ages 30–39, MS showed three to six times higher odds compared to CD (1/OR = 5.6), RA (1/OR = 3.3), and CTR (1/OR = 5.6) (Figure S4). This difference was consistent across all nursing care levels, especially for higher levels (Figure S5). For ages 30–39, the odds of requiring level two nursing care were 8.13, 13.33, and 4.27 times higher for MS than CTR, CD, and RA, respectively (Table S9). After age 40, the odds of receiving nursing care remained significantly higher for MS across all levels compared to other cohorts.
The odds of participating in remedies were higher in MS compared to CD (1/OR = 4.35), RA (1/OR = 3.23), and CTR (1/OR = 5.88) (Figure 1). For ages 30–39, the odds of receiving physiotherapy were 2.27, 1.28, and 3.70 times higher for PwMS compared to CD, RA, and CTR (Figure S6A). Specifically, between ages 30 and 39, the odds of receiving ergotherapy were approximately 5.3, 2, and 14.3 times higher in MS than in CD, RA, and CTR, respectively. However, there was no significant difference compared to RA after adjusting for multiple testing (adjusted p-value = 1) (Figure S6B).
The odds of receiving rehabilitative treatments were also higher in MS than in CD (1/OR = 1.75), RA (1/OR = 1.43), and CTR (1/OR = 2.70) (Figure 1). Furthermore, the odds of a longer duration of rehabilitation were 2, 1.69, and 3.03 times higher in MS compared to CD, RA, and CTR, respectively.
While the odds of receiving sick leave were similar for CD compared to PwMS (1/OR = 1.03, 95% CI = [0.97, 1.09]) and were slightly higher for CTR (1/OR = 0.93), these odds were lower for the individuals with RA (1/OR = 1.89) compared to PwMS (Figure 1). However, the odds of receiving longer sick leaves were higher for PwMS compared to individuals with CD (1/OR = 1.32), RA (1/OR = 2.17), and CTR (1/OR = 1.47).
3.2 Informative Healthcare Factors in Predicting Disease Progression of PwMS
Our algorithm predicted the need for higher levels of nursing care after 2 years using the healthcare data with 0.94 mean AUROC and 0.71 mean AUPR values, outperforming baseline performance levels (i.e., AUROC = 0.5 and AUPR = 0.082 for the random classifier). Mean F1 and the Average Balanced Accuracy values are 0.52 and 0.87, respectively, indicating better predictions compared to random classifiers. Figure S7 and Table S10 summarize the predictive performance over 50 replications.
Our approach (LGB Non-IID in Figure 2) obtained better or comparable results against the IID version of our algorithm (LGB IID) and the LightGBM-based Gaussian process and mixed effects model (GPBoost). Our approach obtained significantly higher AUROC and AUPR values on the test data than both algorithms and demonstrated stronger generalization than LGB IID, with similar generalization to GPBoost, as evidenced by smaller AUROC and AUPR differences between training and test sets (Figure 2).

Figure 3 illustrates the top 40 model features ranked by mean absolute Shapley values, reflecting their relative contribution to the model. The full list of the feature contributions is given in Table S11 (details on Shapley values in Experimental Settings and Supporting Information).

Unsurprisingly, age was among the most significant risk factors affecting nursing care levels. Also, individuals diagnosed with secondary progressive MS (SPMS) or primary progressive MS (PPMS) were more likely to switch to nursing care level three or higher within 2 years. Mobility-related aids and high-efficacy DMTs were among the most crucial informative factors, while male sex was also a strong predictor for higher nursing care levels (Figure 3).
We found that ICD-10 codes linked to a higher risk of disability progression included nicotine dependence (F17), type 2 diabetes mellitus (E11), mental and anxiety disorders (F06, F41), as well as scoliosis, spondylosis, and disc disorders (M41, M47, and M51). Additionally, individuals with unspecified pain (R52) were more likely to switch to a higher nursing care level.
Several ICD-10 codes were linked to a lower risk of switching to higher nursing care levels, including bronchial asthma (J45) and thyroiditis (E06). Interestingly, two gynecological disorders—menopausal and other perimenopausal disorders (N95), and other noninflammatory disorders of the vagina (N89)—and two infections of the upper respiratory tract-acute upper respiratory infections (J06) and bronchitis (J40)—were also associated with a lower risk of needing a higher nursing care level.
4 Discussion
We analyzed healthcare service usage of PwMS in Bavaria, focusing on assistive devices, remedies, nursing care, medical and rehabilitative treatments, incapacity for work, and diagnostic records. We compared PwMS to a control cohort as well as to individuals with either CD or RA, two other chronic autoimmune diseases. While their disease characteristics and trajectories differ, all three conditions are associated with disability accumulation and often require the use of varying healthcare factors [21-24]. We investigated whether healthcare data can be used to predict disability progression and identify informative factors for disability progression in PwMS.
Analyses showed that although CD and RA can manifest at an early age and cause disability leading to high healthcare utilization [21, 22], PwMS need healthcare services more likely than CD and RA in almost all age groups. Even in young age groups, a relevant proportion of PwMS showed increased healthcare usage. To our knowledge, studies presenting differences between MS and other chronic autoimmune diseases regarding healthcare utilization are rare. The prominent use of healthcare factors among PwMS, even at an early age, emphasizes the need to diagnose and treat PwMS as early as possible to prevent disease progression causing disability.
Using nursing care to indicate disability is uncommon, as most studies focus on EDSS progression or progression to SPMS. We chose nursing care level two as the predictive threshold since level three exhibits severe independence impairment, often requiring extensive mobility device usage. In contrast, individuals at care level two can still move independently with assistive devices.
Since we introduced a new indicator for disease progression, a direct comparison with existing MS progression prediction algorithms was not possible. However, those algorithms reported AUROC and AUPR levels similar to ours, indicating that disability progression of PwMS can also be predicted without detailed clinical/omic data. Previous models often used genomic, proteomic, and detailed clinical data [3-7]. These are typically unavailable for large cohorts or suffer from high missingness and unstandardized recording. Assessing and predicting disability progression using healthcare data allows for analyzing large cohorts from different care settings without detailed clinical information.
Age, progressive MS, mobility and incontinence aids, physiotherapy, and very high efficacy DMTs are known risk factors for disease progression or associated with acquired disability [25-30]. Our findings demonstrate that changes in nursing care requirements can indicate future disability progression as our model also identified those factors as informative. Additionally, —among other comorbidities—nicotine dependence, type 2 diabetes mellitus, and psychiatric comorbidities were associated with a higher nursing care level, aligning with studies linking these conditions to increased disability in PwMS [31-33].
Interestingly, we found ICD-10 codes associated with a lower risk of needing a higher nursing care level, including two gynecological disorders—menopausal and other perimenopausal disorders and noninflammatory disorders of the vagina. Using a different dataset from Bavaria, we previously showed that gynecological disorders, including noninflammatory diseases of the female genital tract [34] were linked to reduced MS risk. Using different data and approaches, we found these disorders are both associated with lower MS risk and lower disability progression in PwMS. Our results hint at a relationship between gynecological disorders and MS susceptibility or the disease course that has yet to be understood.
Additionally, acute upper respiratory infections (J06) and bronchitis (J40) had a risk-lowering association with disability progression. Our previous study indicated that different upper respiratory tract infections were less frequently observed in PwMS before disease onset [35], possibly. These findings may reflect an interaction between upper respiratory infections and immunomodulatory treatments increasing infection risk. They may also indicate a true negative association between specific infections and MS risk and disease progression, warranting further investigation to better understand these associations.
Lastly, bronchial asthma and thyroiditis were associated with a reduced risk of increasing disability. Although these comorbidities were found to be more common in PwMS compared to the general population [36, 37], they were not linked to a higher risk of disability in MS [38, 39]. It was previously described that MS tends to be less severe in individuals with such comorbidities [40, 41], indicating the necessity for further analysis of the association between lower disability progression risks in PwMS and these conditions.
Missing data is typical in longitudinal healthcare records due to irregular medical visits and incomplete insurance records. We addressed this limitation using a learning algorithm capable of handling such gaps. Diagnoses were not externally audited, and even with secured MS diagnoses by neurologists, we observed conflicting records. To mitigate this, we only included individuals with at least two cohort-defining diagnoses, picked secured diagnoses when available, and excluded individuals with conflicting records. Additionally, the timing of first diagnoses was missing in the data, complicating analyses of pre- and post-diagnosis differences and the effects of early DMT use on progression. Including sick leave data in our model yielded conflicting findings (i.e., higher likelihood of disability progression in PwMS with low sick leave days), likely because unemployed individuals do not receive sick leave reports. Due to the absence of employment status information, we excluded incapacity for work data from our prediction model.
In Germany, over 85% of the population has statutory health insurance, while the remainder is privately insured. AOK, the largest statutory insurer, covers approximately 35% of Bavaria's population and predominantly serves individuals with lower socioeconomic status and greater healthcare needs than those with private or other statutory health insurers [42, 43]. While our dataset enables robust analyses within this significant subgroup, we acknowledge that these differences may influence healthcare utilization, necessitating further investigation using multi-insurer datasets for broader generalizability.
Overall, we demonstrated a higher utilization of healthcare resources like assistive devices, remedies, nursing care, rehabilitation, and sick leaves in PwMS compared not only to the general population but also to individuals with other chronic autoimmune diseases affecting young adults. Increased healthcare system utilization in young adults highlights the unmet need for the treatment of young adults with MS. We showed that healthcare data can identify informative factors for disability progression defined as increasing nursing care levels, offering an objective approach to assess disability and disease progression in PwMS. This approach can be applied to studies on disease progression in large cohorts where no detailed clinical data is available and to other diseases, disability measures, and healthcare systems with minor modifications. In addition to expected factors, we identified novel informative factors like gynecological disorders, upper respiratory tract infections, bronchial asthma, and thyroiditis requiring further validation and investigation.
Author Contributions
Onur Dereli: conceptualization, formal analysis, methodology, visualization, writing – original draft, writing – review and editing. Jochen Behringer: writing – review and editing, data curation, resources. Achim Berthele: writing – review and editing. Alexander Hapfelmeier: conceptualization, writing – review and editing, methodology. Bernhard Hemmer: writing – review and editing. Christiane Gasperi: conceptualization, funding acquisition, writing – original draft, writing – review and editing, supervision.
Acknowledgements
Christiane Gasperi received funding from the German Federal Ministry of Education and Research (BMBF—161L0216). Open Access funding enabled and organized by Projekt DEAL.
Consent
There was no need for written informed consent from participants.
Conflicts of Interest
Onur Dereli, Jochen Behringer, and Alexander Hapfelmeier report no disclosures relevant to the manuscript. Achim Berthele receives funding from the German Federal Ministry of Education and Research (BMBF; grant 01ZZ2102B). He has received consulting and/or speaker fees from Alexion, Biogen, Horizon, Novartis, Roche, and Sandoz/Hexal, and his institution has received compensation for clinical trials from Alexion, Biogen, Merck, Novartis, Roche, and Sanofi Genzyme; all outside the present work. Bernhard Hemmer has served on scientific advisory boards for Novartis; he has served as a DMSC member for AllergyCare, Sandoz, Polpharma, Biocon, and TG therapeutics; his institution received research grants from Roche for multiple sclerosis research. He has received honoraria for counseling (Gerson Lehrmann Group). He holds part of two patents: one for the detection of antibodies against KIR4.1 in a subpopulation of patients with multiple sclerosis and one for genetic determinants of neutralizing antibodies to interferon. All conflicts are not relevant to the topic of the study. He is associated with DIFUTURE (Data Integration for Future Medicine) [BMBF 01ZZ1804[A-I]]. He received funding for the study from the European Union's Horizon 2020 Research and Innovation Program [grant MultipleMS, EU RIA 733161] and the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany's Excellence Strategy within the framework of the Munich Cluster for Systems Neurology [EXC 2145 SyNergy—ID 390857198]. Christiane Gasperi received funding from the BMBF, the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation), the Hertie Foundation, and the Hans and Klementia Langmatz Stiftung.
Open Research
Data Availability Statement
This study analyzed anonymous healthcare claims data held by AOK Bayern. Approval by an ethics standards committee on human experimentation was not needed according to the Guidelines and Recommendations for Good Practice of Secondary Data Analysis [44]. Data protection regulations prohibit the open distribution of the underlying data. Approval was obtained from the AOK's data protection officer.