Introduction

Ear, nose and throat (ENT) training varies greatly both at the resident, fellow and post-certification level [1, 2]. However, it is challenging to critically evaluate and compare training programmes across nations as relevant studies rely entirely on trainee self-reports, without any objective comparators between groups. Furthermore, there is no evidence that directly compares the theoretical or practical attainment of trainees between nations.

An understanding of global training variation is important to ensure that practice is up-to-date as otolaryngology has changed greatly over the years. Since Hopkins’ patent on his first endoscope in the 1960s, surgery of the nose, sinuses and skull base has rapidly evolved [3]. The educational enterprise that supports trainee growth and development has, therefore, changed to accommodate such changes in clinical practice [4, 5]. However, there is still incomplete regulation and standardization of training—particularly notable at the fellowship level [6].

Furthermore, a number of training programmes have faced challenges to delivering adequate training in Otolaryngology [7•]. The European Working Time Directive has recently restricted surgical residents’ working hours to an average of 48 h per week. The effect of these restrictions has been to decrease the operative exposure of UK ENT residents by up to 38% [8]. Exposure to surgical sessions significantly vary even in developed countries. Residents in Germany reported difficulty in obtaining operative experience, which was felt to be allocated unfairly based on the approval of senior doctors. A 2015 survey-based study cites this reason for a disparity in trainees’ reports of their surgical competence between Germany and France [1,2,7•]. Historically, surgical training has been “on the job”, with surgeons and surgical trainees learning on live patients in the operating theatre and, when possible, through anatomic laboratory dissection. Recently, questions have been raised about the ethical foundation of exclusive reliance on intraoperative training [9]. In areas outside of medicine such as aviation, simulation has been successfully implemented. Repetitive practice of a well-defined task and feedback allow for an accelerated and ultimately safer learning curve [10]. In order to confront these challenges to cross-specialty surgical training, simulation training has gained increasing popularity both in terms of improving technical and non-technical skills [11].

The aim of this review was to examine existing evidence about training simulators in rhinology, their incorporation into real training programmes and the translation of these skills into operating room. The first part focuses on virtual and augmented reality simulators. The second part describes the role of physical (i.e. non-computer-based) models of endoscopic sinus surgery including animal models [12, 13].

Definition and Requirements of Rhinological Training Models

Successful performance in the operative environment requires surgeons to correctly execute a variety of technical and non-technical skills in parallel. These skills can be simulated either in isolation or concurrently and, in order to adapt to the level of behaviour being modelled in a particular simulation, an understanding of the level of behaviour being modelled is helpful. One model that has been previously utilized to analyse simulation models involves the breaking down of models into three domains: skill-based, rule-based and knowledge-based behaviours [14]. Furthermore, surgeons will proceed through stages in their learning that will likely take them through a cognitive stage where they learn to understand a task through an integrative stage where deliberate practice and feedback take place. The final stage is that of automation whereby little cognitive input is required and performance is refined [10].

The ideal endoscopic sinus surgery (ESS) training model would encompass these relevant behaviours and stages whilst being realistic, interactive and low-cost with high repeatability and feedback. Virtual reality (VR) models struggle to attain some of these goals due to their inherent cost and lack of realism in terms of the type of haptic feedback they provide. Therefore, alternative models are attractive ways to fill these gaps in simulation training whilst reducing costs [15].

Simulators can be assessed and compared by using standard measures of validity which can encompass various combinations of the three behaviours described above:

  • Face validity: Experts subjectively decide whether a model tests what it is designed to measure

  • Content validity: Experts subjectively decide whether the logical steps and skills used in the procedure are representative of the modelled task

  • Construct validity: An objective test that is able to discriminate between novices and experts

  • Concurrent validity: An objective test that compares the new paradigm with the gold standard

  • Discriminate validity: A subset of construct validity that discriminates objectively between candidates with similar experience rather than comparing experts to novices

  • Predictive validity: An objective test of whether performance on the task predicts performance in the operative environment [16]

An ideal model for a training or fellowship programme would score highly on all of the above measures whilst maintaining an attainable cost. This is a challenging goal that is worthy of pursuit given that simulation training has been shown to be a cost-effective option in other domains [15]. The different models outlined below tend to target different aspects of training which are reflected in the types of validity that each model attempts to attain and the behaviours that each model addresses (Table 1).

Table 1 Validation studies of VR simulators

Virtual Reality Simulators

Simulation and VR has become an essential part of training in medicine over the last 40 years [17, 18, 19••, 20]. ENT has been a very active field in which to implement these advancing technologies. In particular, tools for temporal bone surgery have been described in over 200 studies, as this anatomical location shows great complexity and variability [18,19••,21]. VR training for functional endoscopic sinus surgery might be appropriate in rhinology [22].

Wiet et al. reported their collaboration in develo** a VR system that provides interaction with volume data, by employing real-time volume rendering and haptic feedback [23]. The first endoscopic sinus surgery VR simulators studies, authors used a computer-generated 3D model of patients’ anatomy from CT or MR scans [24, 25]. The interaction between the anatomical simulations and applied instruments was archived through electromagnetic tracking systems. However, tactile feedback could was not achieved and acoustic signals were used when critical structures were injured [25]. Rudman et al. used cryo-sections to reconstruct relevant anatomy for their model. In a head model, electromagnetic forces were used to give haptic feedback to the user, who was asked to navigate the endoscope through the nasal cavity, as well as to perform surgical procedures like ethmoidectomy [24]. Another application of VR in rhinologic training is the establishment of a web-based resource. This would especially help residents in countries where specialists are rare [7•,26].

Madigan Endoscopic Sinus Surgery Simulator

From all available simulators in the early 2000s, the Lockheed Martin endoscopic sinus surgery simulator (ES3) became the most studied. The ES3 runs on a Silicon Graphics workstation, a personal computer-based haptic controller and an electromechanical haptic interface, with a model of an endoscope, a handle for surgical tools and a head model. The system provides haptic feedback, reproducing resistance and vibration on the surgical instrument’s handle. The ES3 can simulate bleeding, and image deterioration simulating a dirty lens. The simulator provides different instruments for each dissection step to reproduce a standard endoscopic sinus surgery procedure [27].

Arora et al. studied fundamental perceptual, visuospatial and psychomotor abilities in a group of students and otolaryngology residents (n = 38) using ES3, a minimally invasive surgical trainer VR (MISTVR), a pictorial surface orientation (PicSOr) and 3 visuospatial tests (cube comparison, card rotation and map planning) to validate the reliability of simulators. The scores of ES3 correlated well with previously validated measures. Hence, the ES3 was shown as reliable tool in rhinology training [28]. To identify anatomical structures after training with either medium, Solyar et al. tested 8 subjects with the ES3 against 7 others learning from textbooks. The ES3 group achieved higher scores in less time [29]. Fried et al. then translated ES3 training to the operating theatre, comparing a group of simulator-trained residents to residents who only had access to “conventional” learning material, where the latter was not specified. The first live surgery was recorded and evaluated by blinded expert surgeons. The results showed better outcomes for the ES3 group as regards time to completion of task (minutes), case difficulty, tool manipulation, tissue respect, task completion rate, surgical confidence (10-point scale) and number of errors. Errors were defined as dirty scope, wandering scope, failed attempt in orientation, lack of perspective in instrument view, collision, improper injection (of local anaesthesia), mucosal injury and bloody field [7•,30].

Dextroscope

The Dextroscope was developed in Singapore in 2003 and it is a commercial FDA approved simulator developed by volume interactions. It facilitates reconstruction of images from a patient’s computed tomographic scan. The workstation features stereoscopic glasses, a stylus and a control handle (a joystick). The Dextroscope does not use a model head. The joystick operates the endoscope and the stylus operates the microdebrider or drill. The 3D graphics reflect anatomical shapes; however, force feedback is lacking, as are physiological data such as injection blanching or bleeding. The model is rather stiff, preventing gentle displacement of certain structures. The Dextroscope simulator did not improve operating room performance and was poorly rated by the 3 participating surgeons in the study by Caversaccio et al. [31].

Flinders Sinus Surgery Simulator

Flinders sinus surgery simulator (FSSS) is a promising prototype that was developed at Flinders University. It is a high-resolution VR haptic simulator that utilises phantom haptic devices to mimic endoscopic sinus surgery [32]. FSSS has been validated for teaching sinus anatomy and nasendoscopy to medical students, interns and residents. However, FSSS requires further development before it can be regarded as a valid tool for more advanced surgical training [33]. Moreover, it is not currently available for purchase and is not being used as part of a training programme.

McGill Simulator for Endoscopic Sinus Surgery

In 2014, the most recent VR simulator for FESS, the McGill simulator, was introduced. Additional tools applied include sinus washing and the simulated use of microdebriders. Moreover, the system can calculate the amount of tissues/structures removed by the microdebrider [34]. Despite promising results of previous studies for applicability and validity, Dharmawadana et al. validated four different simulators for four different groups from students to registrars. Conclusion was that VR simulators are helpful for the initial steps of surgical training, but would not be ready for more advanced skills as required by registrars [7•,33].

Augmented Reality

There is often some confusion in understanding the difference between augmented reality (AR), VR and their different uses. AR uses technology to superimpose information (such as images and sounds) on the world we see—it adds to normal reality. VR creates a computer-generated environment for the user to interact with and be fully immersed in—it replaces normal reality. However, the newest AR software uses the features of both technologies to provide a better understanding of sinus anatomy which is probably the most important prerequisite for endoscopic sinus surgery.

A fundamental benefit of AR technology is the ability for surgeons to review preoperative imaging, “map out” a surgical approach and view this information during an operation. From an educational perspective, AR can benefit surgical trainees through platforms in anatomy education, simulations and intraoperative guidance [35•].

Preoperative Planning and Education

The most basic example of preoperative AR is the fusion of MRI scans and pictures of the patient’s sagittal photograph using the Android/iPhone overlay application. This technique was used to depict the lesion’s surface projection onto the skin of the head [36].

Advanced AR planning software allows the transformation of the sinus anatomy into 3-D models based on reconstruction from slices of a CT scan which is then combined with the second annotated scan highlighting the area of interest (tumour, frontal sinus outflow, vascular structure). Precise alignment of virtual datasets is critical. In the case of removing a tumour, for example, the surgeon would be able to not only visualize it but also move around it to better-understand relationships to local anatomical structures [37]. The experience can be maximized with 3-D glasses which enhance the perception of depth or the construct can be 3D-printed into a real model. A literature review did not identify applications of AR in head and neck anatomy education; however, previous studies suggested that 3D learning may enhance spatial awareness and improve learning contextualization [38]. This was supported by a large-scale study by Kugelmann et al. of 880 first-year medical students showing the impact of incorporating AR into a traditional anatomy course. The results showed increased motivation and engagement with enhanced student understanding of anatomy [39].

Intraoperative Guidance

Intraoperative navigation systems in endoscopic sinus surgery have emerged as a critical tool in the last decades [35•]. In some ways, surgical navigation systems are similar to the GPS (global positioning system) used by car drivers: both in providing step-by-step directions and showing users their precise location. However, a rhinologist in the operating room faces a challenge to transform the three-dimensional surgical field (3D) to two-dimensional (2D) sections on the navigation screen. The surgeon is, therefore, confronted with the mental task of integrating these two image data sets. AR technology might be a way to overcome this problem. The virtual data for intraoperative navigation can be simply informative (such as textual or numerical values relevant to what is under observation) or consisting of three-dimensional virtual objects inserted within the real environment in spatially defined positions.

There are several systems offering this unique feature but Scopis Hybrid Navigation (Scopis, GmbH, Berlin, Germany) has been pushing the boundaries of AR in endoscopic sinus surgery (ESS). The current software offers a novel preoperative planning phase, known as Building Blocks for frontal recess anatomy (as proposed by Wormald) [40]. Furthermore, this navigation system incorporates technology for the calibration of standard endoscopic images with CT scan-derived models, a variation of an approach originally proposed as image-enhanced endoscopy. Intraoperatively, the AR views provided useful information before starting the dissection, as the AR permitted the surgeon to “see” the relevant structures without formally removing the uncinated process or any other tissue (Fig. 1). This may have important implications for balloon catheter dilatation of sinus ostia. IGS with AR may also function as a “target avoidance” mechanism. AR may mark structures so that their location is clear, before they are exposed during surgery. Figure 2 shows this concept of ‘anti-targeting’ by illustrating the presence of the optic nerve without direct exposure of the optic nerve. The internal carotid artery could be highlighted in a similar fashion. Currently, Scopis Hybrid Navigation displays all images in 2D representations, not a holographic 3D image [41, 42].

Fig. 1
figure 1

Intraoperatively, the AR views provided useful information before starting the dissection, as the AR permitted the surgeon to “see” the relevant structures without formally removing the uncinated process or any other tissue. (With permission from Stryker Instruments, Portage, MI)

Fig. 2
figure 2

Concept of ‘anti-targeting’ is shown by illustrating the presence of the optic nerve without direct exposure of the optic nerve. (With permission from Stryker Instruments, Portage, MI)

AR in sinus surgery is still a novel navigational system; therefore, there are only few studies documenting its efficacy in this context [37, 41, 42]. Due to close proximity to dangerous structures, ESS in early years of training can be associated with increased mental demand, effort and frustration. AR seems to reduce the task workload for trainees performing ESS and may be a valuable intraoperative teaching aid [43]. In skull base surgery, AR overlays of both endoscope trajectory and neurovascular anatomy were highly valuable in reaching the sellar floor safely in revision procedures, with no additional operative time or hardware setup required [44, 45]. One of the limitations may be a single head-up display which can distract the user and cause blindness. This can be corrected by a sub-monitor [46•].

Another type of AR is the Google Glass (GG) which offers a quite different experience. GG is essentially a smartphone in the form of a pair of conventional spectacles with built-in small screen which allows hands-free access to electronic information through voice command [47, 48]. Its connectivity via WiFi makes GG potentially useful for the needs of telemedicine [49]. The main advantages intraoperatively are facilitating access to electronic medical literature including instructional videos or patient records for a quick reference, connection to the operating theatre’s information system to monitor a patient’s vital signs and speech-to-text dictation such as major intraoperative findings. Moreover, GG can be used as a tool for telemedicine where the members of the surgical team or other specialties could communicate, hence GG may facilitate intraoperative decision making [50].

Although GG represents a novel breakthrough technology in the field of surgery, several limitations remain. This includes a short battery life and poor image quality (screen resolution is 640 × 360 pixels) [51,52,53]. In telemedicine, the use of GG was limited by network delay and also line of sight where the operating surgeon had to tilt the head unnaturally to keep the image on the display so the surgeon B could see the surgical field [50].

Conclusion

Piromchai et al. showed in their meta-analysis that the VR simulators (temporal bone, sinonasal) could be implemented as an adjunct in training programmes but cannot replace conventional methods. This is mainly because the actual surgical outcome following VR training has not been studied in any great depth. This is the primary focus of future larger scale studies [19••]. A survey among participants of ESS courses showed that tasks related to spatial orientation are judged as the hardest, whereas manual tasks are considered less difficult. This suggests that simulators will not necessarily require haptic feedback to train the most important knowledge and skills needed for FESS [54]. The more affordable alternatives might be 3D printed anatomical models [7•,55,56,57].

AR seems to be a promising platform with growing number of applications in preoperative planning, intraoperative navigation and education. However, significant limitations remain, namely lack of robust validation and issues with registration error that prevent the widespread use of AR in clinical practice. Despite that, AR technology is still in its relative infancy, and these limitations will likely improve as research continues to develop.