1 Introduction

Progressive addition lenses (PALs) (also referred to as progressive power lenses) provide presbyopes with clear vision at different distances, with the use of a single ophthalmic lens (Poullain and Cornet 1911). Firstly patented in 1907 (Aves 1907), multifocal ophthalmic lenses have undergone a great deal of evolution (Sullivan and Fowler 1988). When the precursors of modern progressive lenses became commercially available in 1959 (Maitenaz 1969; Volk and Weinberg 1962), their introduction to the market was neither rapid nor widely accepted. Still, over the years, progressive lenses have become the most popular solution for presbyopia.

In general, the fabrication of progressive power lenses leaves inevitably residual surface astigmatism that leads to peripheral distortions (Minkwitz 1963; Esser et al. 2017). How this residual astigmatism is spread across the surface was usually employed to classify PALs into soft (more spread) and hard designs (more concentrated) (Atchison 1987). However, nowadays, these lenses navigate between both classes, diffusing the borders as to whether they belong to one or another category, with manufacturers aiming to obtain the best combination of parameters for each group of individuals.

Besides their popularity and the fact that they do seem to improve wearers’ quality of life (Ahmad Najmee et al. 2017), not everyone gets used to them. Scientific literature is scarce when it comes to gathering knowledge about why few "rookie" wearers suffer in adapting to these lenses or which could be the main factors contributing to this. Some papers mainly refer to spectacles in general and report errors in the prescription as the primary source of discomfort (Bist et al. 2021). However, assuming no errors from the optician, a small group still reports discomfort due to distortions and multifocality. Alvarez et al. (2017) proposed that non-adopters may have a weaker ability to modify their convergence and a reduced rate and magnitude of phoria adaptation. Yet, individuals who reject these lenses report blurred vision, headaches, perceived movement of the peripheral visual field (a.k.a. swim), balance issues and even nausea (Cho et al. 1991; Han et al. 2003), symptoms more likely associated with poor adaptation to novel visual conditions (Alvarez et al. 2009).

Traditionally, as reported by Ogle and Saunders W.B. (1950), one of the most significant handicaps when analysing the tolerance to distortions induced by ophthalmic lenses was the difficulty in separating them from blur (Barbero and González 2020). However, with the computational development of the last decades, it is entirely possible to present an image that is solely distorted (Habtegiorgis et al. 2017; Sauer et al. 2020) or blurred (Sawides et al. 2010; Vinas et al. 2012), helping to understand how our visual system adapts to these changes. Yet, these experiments are somewhat forced, built upon certain constraints, such as specific gazing or a fixed head position, and while they do drive knowledge forward, they often lose sight of the big picture.

On the other hand, one can analyse eye-tracking data while wearing PALs (Hutchings et al. 2007), bringing more natural conditions at the expense of not fully comprehending which features are affecting the visual experience since they can not be separated. Moreover, one can not ignore the potential source of error that arises when fitting PALs. Considering all these factors, virtual reality presents itself like a perfect test bench where one can replicate the effect of distortions and blurring apiece while maintaining the freedom of movement and immersion of virtual environments.

This pilot study aims to test the feasibility of virtual reality to assess, benchmark and rate the discomfort (as visual experience comfort) that four progressive addition lenses with various refractive powers can introduce, given the distortions they present. Subjective grading was performed continuously after finishing a simple task involving navigation, locomotion, scene exploration, and grasping to recognise the subjects’ ability to perceive such simulated distortions and assess their comfort when affected by them. Possible behavioural changes prompted by optical distortions were also sought, and a model was developed to predict what the discomfort might be like without requiring additional measurements.

2 Material and methods

2.1 Subjects and informed consent

A total of 18 naïve participants (9 males/9 females) took part in the study. The participants were aged between 19 and 30 years (mean = 24; SD = 3). None of the subjects presented a prior history of problems using a virtual reality headset or motion issues, nor did they present any refractive error that could have compromised their vision during the experiment or influenced their perception due to the usual wear of their lenses.

2.2 Ethics

The study adhered to the tenets of the Helsinki Declaration (2013) and subsequent amends. The ethics authorisation to perform the measurements was granted by the Ethics Committee at the Medical Faculty of the Eberhard-Karls University and the University Hospital Tübingen with the ID 986/2020BO2. Before data collection, the experiment was explained in detail to the participants, and they provided written informed consent. All data was pseudo-anonymised and stored in full compliance with the principles of the Data Protection Act GDPR 2016/679 of the European Union (The European Parliament and the Council of the European Union 2016).

2.3 Set-up

A cuboidal booth of \(3 \times 3\;{\text{m}}\) defined the physical space around which the participants were entitled to freely move. On every upper corner (4 in total), one HTC Vive lighthouse base station tracker v2 (HTC Cooperation, Xindian, Taiwan) recorded the position and orientation of the headset as well as the controller used. The StarVR One (StarVR, Taipei, Taiwan) head-mounted display was used to present the virtual environment to the participants. This head-mounted display (HMD) has an effective field of view (FoV) of 182\(^{\circ }\) by 99\(^{\circ }\) (Sauer et al. 2022), and an average eye relief (or distance from lens/display to the eye of 18.2mm (15.8 to 23.9mm). The display has a resolution of 2240px \(\times\) 1792px, a maximum luminance of 68.4cd m\(^{-2}\) and the refresh rate fixed at 90Hz. This device presents eye-tracking capabilities through Tobii Pro libraries. For reference, the display area covered by the eye-tracker, as reported by the manufacturer, is plotted in Fig. 1. To provide input and locate the hand position, a standard HTC Vive controller was used.

Fig. 1
figure 1

Area of display resolution of the VR headset (grey) as well as area in pixels covered by the eye tracker (orange) as provided by the manufacturer (colour figure online)

2.3.1 Virtual environment and computer

The virtual environment was constructed using the rendering engine Unity version 2019.4.25.f1 (Unity Technologies, California, USA) and Blender 3.0 (Stichting Blender Foundation, Amsterdam, Netherlands). The experiment was written in C# and the eye-tracking data was recorded using the Tobii Pro (Tobii Pro AB, Danderyd, Sweden) eye-tracking libraries. A view of the virtual reality environment used is depicted in Fig. 2.

Fig. 2
figure 2

View of the virtual environment. The blue square denotes where the instructions were located in the virtual environment, and the rating user interface (UI) was presented. The red circle is located where the cubes need to be placed. Finally, the green star is placed upon the cube spawns’ origin (colour figure online)

The whole experiment ran on a Windows 10 (20H2) PC with an Intel(R) Core i7-10700, 16 GB of RAM and an Nvidia 3080 GPU to ensure no lag or frame-drops were experienced. The FPS were not recorded, but future studies should consider reporting these values.

2.4 The task

After providing informed consent and having the experiment explained in detail, participants had time to ask questions. They then wander around the virtual environment (a replica of a lab room) while wearing the virtual reality headset to acclimate to the scene before the actual assignment started. During the task, participants were asked to position three virtual cubes from a random spawn point in a shelf unit to a designated target location (from point A, marked by a green star, to point B, marked by a red circle) (see Fig. 2). The colours and surface patterns of the cubes, as well as the target’s locations, were varied randomly across trials. Instructions on where the cubes had to be placed were provided on a virtual whiteboard (Blue square) opposite the cubes’ spawn origin. Thus, participants were required to move and look around in the virtual environment during each trial.

2.4.1 Ratings of discomfort

After correctly placing each cube on the target plate, a UI panel with a slider scale popped up in front of the participant. In this panel, the participants were asked to report their level of visual comfort (as how comfortable were they with how they perceive the environment, not to be confused with eye stain, dry eye, itching, irritation or fatigue) on a discrete visual analogue scale from 0 (no discomfort) to 7 (highest discomfort) by placing a cursor using the VR controller as a raycast pointer. Previous to providing this feedback, participants were instructed to rate only based on how the distortions affected their perception of their visual comfort and to not take into account factors such as possible frame drops or the image resolution. Given the rating and confirmation, the trial was completed, and a new trial began.

A total of 25 trials were answered (five per lens condition), with an average duration of a trial of 36s (SD = 12 s, range from 18 to 101 s).

2.5 Lens conditions—distortion simulation

Four progressive additional lenses of the same type, but different prescription power were used. Different addition powers or spherical power prescriptions do modify how distortions are present. The distortions profiles were provided by the manufacturer from a set of lenses with the following parameters fixed: a refractive index of 1.5, a pantoscopic angle of 7.5\(^\circ\), a 5.28 mm base curvature, a back vertex distance of 12 mm and a corridor length of 14 mm. Two of these lenses had non spherical power and addition powers from + 2 and + 3D, and the other two lenses had an addition power + 2D and a spherical refractive correction of + 2 and − 2D. In line with modern PALs, these lenses have their "umbilical line" tilted, with the near point shifted towards the nasal side to account for vergence.

A set of grid points (x, y) on a plane perpendicular to a straight gaze direction is perceived in a distorted position (\(x_{d}\), \(y_{d}\)) when seen through a progressive addition lens. The displacement between (\(x_{d}\), \(y_{d}\)) and (x, y) was pre-computed for each lens using ray tracing based on these lenses’ digital lens surface data. The displacement in the horizontal and vertical coordinates were encoded into a raster image (EXR format) where the red colour channel was used for the shift of the pixel in the horizontal direction and the green colour for the vertical component.

These distortions were then applied pixel-wise to the image displayed in the virtual reality headset through a custom shader in Unity 3D, written in HLSL. The amount of pixel displacement that these lens simulations prompted in visual angle, along with the magnification, skew, aspect or rotation, as seen by the participants, can be observed in Fig. 3.

Fig. 3
figure 3

Representation of how distortions affect a grid and the image observed (background), and maps of pixel displacement in visual angles as observed from the subject’s perspective for the OD (oculus dextra, i.e. right eye), as well as magnification, differences in aspect ratio, skew and rotation calculated from the first derivatives of the distortions

The angular displacement in the visual field as observed was computed using the following formula:

$$\begin{gathered} {\text{Displacement}}{\mkern 1mu} \left( {{\text{V}}_{{({\text{Visual}}{\kern 1pt} {\text{angle}}{\kern 1pt} {\text{in}}{\kern 1pt} {\text{degrees}})}} } \right) \hfill \\ \quad = \cos ^{{ - 1}} \left( {\frac{{x_{d} \times x + y_{d} \times y + 1}}{{\sqrt {(x_{d}^{2} + y_{d}^{2} + 1) \times (x^{2} + y^{2} + 1)} }}} \right); \hfill \\ \end{gathered}$$
(1)

where (x, y) are the coordinates of the original grid points and (\(x_{d}\), \(y_{d}\)) are the location of those points after the distortion is applied.

2.6 Tracking

On every frame, the position coordinates of the head-mounted display (HMD) along with the rotation quaternions and gaze vectors were stored.

2.6.1 Head-tracking

Specific eye and head pose coordination is known to enhance adaptation to progressive lenses(Rifai and Wahl 2016). Thus, head rotation values were recorded and converted using quat2eul into Euler angles following the MATLAB coordinates system for analysis and plotting. The Table 1 summarises the different coordinates systems that are used in the main libraries of this study.

Table 1 Coordinates system references for each software system

On every trial the yaw, pitch and roll were computed using the HMD quaternions as described in the Eq. (2).

$$\begin{gathered} {\text{Tait}}{\mkern 1mu} {\text{Bryan}}{\mkern 1mu} {\text{Angles}}{\mkern 1mu} ({\text{roll}},{\text{pitch}},{\text{yaw}}){\mkern 1mu} \hfill \\ \quad = {\text{quat2eul}}({\text{rot}}W,{\text{ rot}}Z, - {\text{rotX}},{\text{rot}}Y,XYZ); \hfill \\ \end{gathered}$$
(2)

The roll data (tilt of the head) was further characterised with a Von Misses distribution fit (See Eq. 3).

$$\begin{aligned} \begin{aligned} {\mathcal {M}}\,(\theta | \,\mu ,\kappa ) = {\left[ 2\pi I_{0}(\kappa )\right] }^{-1} \exp\{\kappa \, \cos(\theta -\mu )\}; \\ \end{aligned} \end{aligned}$$
(3)

where \(\mu\) is the mean direction, \(I_{0}\) is the bessel function of the first kind, \(\kappa\) is the concentration parameter, and \(\theta\) refers to the angle in radians from \(-\pi\) to \(\pi\).

The pitch (or head inclination up to down) was described using a combination of two Von Misses distributions. And finally, the yaw (or horizontal head rotation) was represented by the fit (See Eq. 4) of two inverse Power Batschelet distributions (Mulder 2019; Mulder et al. 2020), as the data was highly peaked towards the cube spawns origin and the horizontal location of the plate, whiteboard and rating UI panel.

$$\begin{aligned} \begin{aligned} &f_{\mathcal{P}\mathcal{B}}\,(\theta | \,\mu ,\kappa , \lambda ) = \left[ {\mathcal {K}}_{\kappa , \lambda }^{*}\right] ^{-1} \exp\{\kappa \, \cos \, t_{\lambda }^{*} (\theta -\mu )\};\\ & {\mathcal {K}}_{\kappa , \lambda }^{*} = \int _{-\pi }^{\pi } \exp\{\kappa \, \cos \, t_{\lambda }^{*} (\theta -\mu )\} \,d\theta ;\\ & t_{\lambda }^{*} \,(\theta ) = \text{sign}(\theta ) \pi \left( \frac{|\theta |}{\pi }\right) ^{\gamma (\lambda )};\\ &\gamma (\lambda ) = \frac{1- c \lambda }{1+ c \lambda };\\ \end{aligned} \end{aligned}$$
(4)

where c = 0.04082284 as noted by Mulder (2019).

2.6.2 Eye tracking

The eye-tracking data was obtained using the Tobii Pro SDK 1.10 for Unity. The SDK was modified to further record the hit-point of the gaze vector in the world coordinates.

2.6.3 Fixations

While recording the eye-tracking data, on every frame, the combined (head and gaze) origin and direction were used to cast a ray within Unity until it hit a collider in the scene. The coordinates of this hit-point (x, y, z) were later used to detect fixations. If a cluster of continuous hit-points stood within the volume of a sphere of 0.05 m radius, during more than 200 ms (See van der Lans et al. (2011) for a review on average fixations duration while visual search and scene viewing), it was considered a fixation. The amounts of fixations performed per minute and their average duration were recorded on every trial.

2.6.4 Saccades

To estimate saccadic eye movements, the gaze ray from the right eye (in headset relative coordinates) was transformed into polar (\(\theta\)) and azimuthal angles (\(\phi\)). The median angular distance travelled (\(\psi\)) between the original frame vector, and the five subsequent frames was computed following the Eq. (5). Then, the angular speed was estimated as the angular distance per second, on which a Savitzky-Golay filter (Dai et al. 2016) was applied.

$$\begin{gathered} \phi = \arctan \left( {\frac{y}{x}} \right); \hfill \\ \theta = \arccos (z); \hfill \\ \psi = \arccos ((\sin (\theta _{{n + i}} ) \times \sin (\theta _{n} ) \times \cos (\phi _{{n + i}} - \phi _{n} ) \hfill \\ \quad \quad + \cos (\theta _{n} ) \times \cos (\theta _{{n + i}} )); \hfill \\ \end{gathered}$$
(5)

A speed threshold of 50\(^{\circ }\text {s}^{-1}\) and a minimum distance (\(\psi\)) of 1\(^{\circ }\) were defined as thresholds for defining a saccade. The number of saccades per minute, the average amplitude and maximum peak velocity performed on every trial were recorded.

2.6.5 Eye tracking to measure average observed displacement, magnification, aspect, skew and rotation

Every gaze position was projected over the displacement, magnification, skew, rotation, and differences on aspect ratio maps, and the observed absolute and relative ("foveally") pixel displacement, magnification, skew, rotation and aspect ratio parameters were computed by summing the values at gaze position over all the frames of the trial. The mean value and standard deviation per trial were recorded and compared.

3 Analysis and preprocessing of the data

All the statistical analysis and plotting were performed in MATLAB 2020b (Mathworks Inc, Natick, CA, USA) and Python 3.10 with sci-kit learn 1.0.2 (Pedregosa et al. 2011).

4 Results

4.1 Visual experience and comfort

On every trial, the participants rated their visual experience and comfort on a scale from 0 to 7, with zero being normal and seven being the most uncomfortable. Discomfort is, in fact, a subjective parameter that is highly susceptible to vary between subjects and depends on where the participant sets the threshold towards unpleasantness. Given the same distortion and conditions, two subjects can be uncomfortable or not. To standardise our answers and limit the subject’s susceptibility, the ratings were normalised using the ratings where no lens distortion was presented as a baseline. The formula used can be observed in the Eq. (6).

$$\begin{gathered} R_{{\{ i\} }} {\mkern 1mu} = R_{{\{ i\} }} {\mkern 1mu} {\text{orig}} \times \frac{{10}}{7} + 1; \hfill \\ R_{{\{ i\} }} {\mkern 1mu} {\text{norm}} = \frac{{R_{{\{ i\} }} }}{{\bar{R}_{{\{ {\text{baseline}}\} }} }}; \hfill \\ \end{gathered}$$
(6)

The distribution of visual experience and comfort ratings was not parametric in any of the conditions tested. Thus, a non-parametric test (Kruskal–Wallis) was used to compare lens condition discomfort ratings. Significant differences were found between the distributions of visual comfort ratings across conditions (p < 0.01; \({\tilde{\chi }}^2\) = 69). Bonferroni’s Post-Hoc test indicated that the lens distortions from the lens + 2D and addition + 2 were more disturbing than the rest. Likewise, the lens with a spherical power of − 2 and addition + 2 had the smallest discomfort. Figure 4 shows the distribution of ratings and the statistical differences.

Fig. 4
figure 4

Visual experience and comfort ratings’ distribution across conditions, all values have been normalised. The horizontal bars on top indicate significant differences in Post-Hoc analysis

The Cronbach‘s alpha value (\(\rho _{\tau }\)) for the rating scale was found to be statistically consistent (0.88).

4.2 Gender and age

No bias was found in the ratings of discomfort due to gender (Mann–Withney-U; \(N_\text{females}\) = 9 (MdN = 0.29); \(N_\text{males}\) = 9 (MdN = 0.29); U = 51917; z-val = 0.87; p = 0.39).

No correlation was found between the ratings and the age of the participants (Spearman–\(R^{2}\) = 0.0018; p = 0.97).

4.3 Duration

A small correlation (\(R^{2}\) = 0.11, p = 0.02) was found between the duration of the trials and the rating given. In general, subjects seemingly were faster deciding whether to give the highest and the lowest ratings of discomfort. However, subjects may have taken longer times during the trial if the decision was unclear, as they perceived some level of discomfort but not the highest.

4.4 Euler angles (Yaw, pitch and roll)—Head movements

Tables 2, 3 and 4 present the main characteristics of the fitted distributions for pitch, yaw and roll per trial with their mean and standard deviations. The statistical differences of each parameter (individual, lens or rating (rounded to the closest integer)) were computed using the Kruskal-Wallis test, and are also reported in the tables and their fits are illustrated in Fig.  5

Fig. 5
figure 5

Examples of distributions fitted for Yaw, Pitch and Roll. The green line denotes the final distribution fitted, and the dashed red and blue lines indicate the individual distributions (inverse power Batschelet or von Mises, contributing to the final one). The icons on the Yaw plot represent the same locations as in the Fig. 2 (colour figure online)

Table 2 Yaw. Two inverse power Batschelet \({\mathcal {F}}\) PB were fitted
Table 3 Pitch. Two Von Mises (\({\mathcal {M}}\)) were fitted
Table 4 Roll. A single Von Mises distribution (\({\mathcal {M}}\))

None of the parameters above mentioned was significantly different across lens conditions, but the full-width half maximum (FWHM) of the first inverse power Bachelet distribution on yaw. The same FWHM, as well as the mean directions of pitch, and the FWHM of the roll changed across rating groups.

4.5 Gaze behaviour

4.5.1 Fixations

The number of fixations per minute and duration of fixations were significantly different across subjects (Kruskal–Wallis, H(17) = 255, p < 0.001) and (H(17) = 197, p < 0.001). Significant differences were also found in the number of fixations per minute across ratings (KW, H(5) = 25.7, p < 0.001), where the greater discomfort is perceived, the higher amount of fixations are found, with the exception of the group with the highest discomfort (4–5 ratings).

4.5.2 Saccades

The number of saccades per minute, as well as the peak velocity and the average distance, travelled on a saccade, were significantly different across participants (Kruskal-Wallis, H(17) = 269, p < 0.001),(KW, H(17) = 301, p < 0.001) and (KW, H17) = 285, p < 0.001). In the trials with more discomfort, the saccades were on average larger (KW, H(5) = 14, p < 0.05) and had a higher peak velocity (KW, H(5) = 18, p < 0.01). Significantly (KW, H(4) = 9.6, p < 0.05) more saccades were performed when the lens presented had higher amount of distortions.

4.5.3 Observed displacement, magnification, rotation, skew and aspect

All the components that defined the experienced distortion computed from the movement data of each participant and the lens worn were significantly different (KW, H(4) < 434, p < 0.001) for every lens condition tested in their mean and standard deviation. The ranking order for it can be observed in Table 5. Significantly more mean pixel displacement, magnification, differences on aspect ratio, skew, and rotation were observed when the subjects indicated higher discomfort (KW, H(5) < 103, p < 0.001). The standard deviation of all the observed parameters was significantly higher when more discomfort was perceived. No differences across subjects were found for any of the mean values or standard deviation, but rotation standard deviation differences across subjects was close to significant (KW, H(17) = 27, p = 0.057).

Table 5 Matrix draw showing the lens conditions ordered from highest (darker) to lowest (brighter), if the same colour appeared no significant differences were found between these two conditions (colour figure online)

5 Model—gradient boosting regressor

Understanding what features might have influenced the decision for specific discomfort ratings can be a herculean task if deferred to traditional methods. However, thanks to the advances in computational power and machine learning algorithms, it is possible nowadays. Using scikit learn package in Python (Pedregosa et al. 2011), we built a gradient boosting regressor (Friedman 2002), which ensembles several decision trees, using the residuals to fit the next iteration and weighting them to obtain a final model, capable of not only predicting new ratings given the estimators but also providing the relevance of the estimators.

The dataset used comprised a total of 450 trials, and contained age, gender, duration, baseline, maximum and minimum ratings of the subject, absolute mean and standard deviations of the "foveally" observed factors as described before, each of the parameters fitted to pitch, yaw and roll, and the already analysed data from the eye-tracking (i.e. fixations and saccades).

70% of the dataset was used to train the model, and the remaining 30% was only used to test the model’s accuracy, i.e. the data was never shown to the model until the testing phase. The split was performed with stratification on subjects and lens conditions and testing for its robustness. The best hyper-parameters were decided using a bayesian optimisation search from skopt (Head et al. 2020), and a repeated stratified group cross-validation fold within a pipeline.

The best features were selected from the model using the meta-transformer SelectFromModel and their relevance to the model was estimated using the "SHAP" or SHapley Additive exPlanation values (Lundberg and Lee 2017).

Fig. 6
figure 6

Visual experience and comfort ratings’ distribution across lenses, all values have been normalised. Different hues show the training dataset (true values), test dataset and the predictions from the model

Over 400 seeds of random splits were tested to assure the model robustness, the table 6 shows the results of it.

Table 6 Performance metrics from the model

Figure 6 presents the predictions of the model classified for different lenses and compared with the real values. Figure 7 shows the residuals of the model for the train and test dataset.

Fig. 7
figure 7

Residuals of the model depicted using Bengfort and Bilbro (2019 yellowbrick library) for one of the best outcomes of the model

The Fig. 8 presents the "SHAP" values for the selected features of the model.

Fig. 8
figure 8

SHAP values for the selected features of the model. Being those from top to bottom, the minimum rating given by the participant, the baseline rating, the standard deviation of observed skew and aspect on the trial, the fitting concentration parameter describing the first curve of yaw, the standard deviation of the observed magnification effect, the full-width half maximum describing the second curve of yaw and the standard deviation of the observed pixel displacement

6 Discussion

In this study distortions of PALs were simulated in Virtual Reality. Simulations of different lenses holding various degrees of distortions were perceived by the participants as having different levels of vexation, being those that presented a higher level of distortions experienced as more uncomfortable. Specifically, the lens with spherical power +2D and addition +2D was rated the worst of all, followed by those without spherical power and addition +3D and +2D. One would expect the baseline (i.e. no distortions) to be rated as the most comfortable, but nevertheless, the lens distortions due to the negative spherical power and the +2 addition presented levels of discomfort equal to the baseline. A sum of several factors could have contributed to this result. On the one hand, these lens distortions have low levels of displacement in the central area, as can be seen in Fig. 3, which could explain similar ratings to the baseline. We also cannot be entirely sure that the manufacturer has fully corrected in the software the distortions caused by the VR headset lenses, and considering this headset uses a canted display, magnification from negative lenses may have made the image look more natural.

The subjective input scale was consistent given the Cronbach’s index.

Although no correlation was found between the age of the subjects and the ratings given, it is important to mention that we have a reduced sample of young participants with a small variance. It can not be discarded that ratings might be different if an older group was tested, including presbyopes. Moreover, all of our subjects were emmetropes with an aim to avoid bias in our reduce sample, but usual ophthalmic wearers might like more the distortions that look alike to those they are habituated to.

Subjects’ heading and gazing presented a significant variance across individuals, independently of the distortions applied, which is expected as it is known that these parameters are tied to each individual. Only one of the inverse power Batschelet distributions fitted for yaw varied significantly between the lenses and ratings, with a wider "+ 2/Add2" distribution, implying the more distortions, the less accurate our horizontal head movement is. The statistical results comparing against clusters of ratings should be taken with care. Although non-parametric tests were used, the number of measurements on the worse/higher ratings (4–5, after normalising) were quite low. For example, some parameters of yaw and roll were significantly different across rating groups, but no clear pattern was found, and no additional differences were found in the post-hoc analysis.

Regarding gaze movements, fewer fixations per minute lead to better ratings, this was not found across lens conditions, where there were no statistical differences, but + 2/Add2 presented more fixations per minute. In this case, a pattern could be observed from the ratings, where the more fixations are made per minute, the more distress is perceived, and only in the 4–5 groups it was disrupted, probably due to the reduced sample of this groups. Thus, more distortions may have led to more fixations, and the more one fixates, the more one perceives those distortions, giving them worse ratings. Furthermore, more saccades were performed in the textures with greater distortions, and ratings followed the same pattern but did not reach significance. Otherwise, the worse ratings presented shorter saccadic lengths and slower peak velocities in the saccades, but, in this case, nothing could be found for the different lens conditions. In general, gazing behaviour is highly tied to each individual, like the heading movements. However, more saccades and more fixations led to higher rates of discomfort, although that might have been coupled with the lens in use and future studies would need to clarify this.

Generally, the lens condition dictated how much of the mean and standard deviation of displacement, magnification, rotation, differences in the aspect ratio, and skew were observed, a factor which influenced the final rating. However, no differences across subjects were found, probably because even if they gaze differently, the area they cover and locations they gaze through were pretty similar.

6.1 Limitations of the study

There are several limiting factors that may have constrained the study’s outcome. One of the main hiccups of the data was the reduced amount of high/worse discomfort ratings (see Fig. 7), precluding the drawing of some conclusions and probably limiting the model’s accuracy. To gather more data with high ratings, the sample size must be increased by presenting more trials with higher distortions lenses, more lens conditions with a high amount of distortions, or simply measuring more subjects. In fact, including more lenses, with not only different prescription powers but also different designs may provide more information on individual characteristics and their influence on discomfort.

Another limitation of our study is that despite measuring a baseline to correct for the potential confounding factor of discomfort caused by simulated distortions versus HMD related discomfort (?), it remains challenging to distinguish between the two. Therefore, caution should be exercised when interpreting our absolute discomfort rating values as they may have been influenced by this confounding factor.

In this study, only "static distortions" were measured, i.e. the distortions of the PALs were only measured on the lens surface and presented as if looking through the main line of sight. Although the perceived distortions varied when looking through different areas, the distortions applied through the custom shader did not update with the gaze. Presenting "dynamic distortions" entails having previously measured and stored a texture of how each pixel shifts on the texture at each possible gaze point. Considering the FoV reported for this HMD, one would need roughly 18000 measurements per lens to do it on every degree of the visual field. This task becomes unfeasible unless a model is built that, from lesser measurements, interpolates the rest of the textures.

As already mentioned, the lenses used by the HMD present several distortions that must be corrected through software. Therefore, the image presented on display is already warped to compensate for the lenses. It sets a limitation as only the manufacturer usually knows how well it is corrected. Additionally, this compensation only takes one fixed pupil position (’static distortions’), and only a recent paper (Chan et al. 2022) has started to look at how dynamic distortions affect the whole VR experience and discomfort by inducing unintended optic flow.

Beyond distortions, progressive lenses present different blurring across the visual field, usually requiring a change in gaze behaviour to avoid blurry vision, which can additionally influence the comfort while wearing these types of lenses. In fact, the visual system is thought to be more tolerant to distortions than it is to blur (Barbero and Portilla 2015). Nevertheless, a future study should assess the discomfort that these lenses present due to blurring alone and perhaps a combination of both.

6.2 The model

A gradient boosting regressor model was built. Although the model’s accuracy on a single prediction is not outstanding, the model does behave as expected for the different lenses. It is capable of predicting ratings well within ranges defined for each lens condition. As mentioned in the limitations, the model could be further enhanced if more data was acquired, mostly to support the high discomfort ratings.

Although SHAP values (Lundberg and Lee 2017) were calculated to look for how model features act on predictions of discomfort, these values should be taken with care, especially when trying to understand decision tree models. The minimum rating given by the subject can be found as a main contributor. This could have occurred because the model may have learned to identify the subject from the minimum rating given, hence putting all individual preference weights into that decision, i.e. basing its decisions on its idea of which subject was estimating, even though this information was not explicitly given. Other factors contributing to the predictions were the baseline values reported by the subject, which also could have served to acknowledge the participant, the SD of the observed skew, aspect ratio differences, magnification and a minor role of displacement. The concentration parameter of the first distribution fitted in yaw also contributed, which might be an indirect consequence of how straight was the head relative to a gazed target, upon a yaw movement, due to the different lenses.

It is not strange that skew was found to be a contributing factor to discomfort while "wearing" PALs. In fact, other studies already connected skew and adaptation (Habtegiorgis et al. 2017; Rifai et al. 2020). Magnification was also previously connected to the so-called swim effect, which relates to the illusory and variable seesaw-like movement of the visual field that originates with lateral head movements. This is further perceived with the nonuniform optical magnification effects between near and far objects (Han et al. 2003). Magnification in the adaptation to spectacles was also mentioned to induce changes in the vestibular ocular reflex and discomfort (Cannon et al. 1985), which might induce more discomfort in VR (Chang et al. 2020).

7 Conclusions

Similar to how using free-form technologies impacted the PALs market, reducing stock, costs, and expanding the fitting possibilities (Alonso et al. 2019), virtual reality holds the potential to become a testing bench for acceptance of new optical consumer solutions and thus to decrease the innovation costs of developing such lenses. VR combines the naturality of performing a daily task with the ability to tweak specific conditions which might not be possible in the real world.

This is the first study of this type that simulates different progressive addition lens distortions, in virtual reality (up to a certain degree) and measures how participants subjectively perceive them. Besides the high inter-subject variability, every participant perceived different levels of discomfort for different amounts of distortions. Other traits were found beyond subjects’ susceptibility or the distortions quantity defining the discomfort level that one can perceive with a specific lens in certain conditions, such as certain gazing or heading behaviours.

Finally, a machine learning model may help reduce the amount of testing required and help to understand what features contribute the most or why this sensation of discomfort does appear for some but not everyone. However, an ampler amount of data is required to obtain a more applicable model.