Transmission Efficiency Limit for Nonlocal Metalenses
Abstract
The rapidly advancing capabilities in nanophotonic design are enabling complex functionalities limited mainly by physical bounds. The efficiency of transmission is a major consideration, but its ultimate limit remains unknown for most systems. This study introduces a matrix formalism that puts a fundamental bound on the channel-averaged transmission efficiency of any passive multi-channel optical system based only on energy conservation and the desired functionality, independent of the interior structure and material composition. Applying this formalism to diffraction-limited nonlocal metalenses with a wide field of view shows that the transmission efficiency must decrease with the numerical aperture for the commonly adopted designs with equal entrance and output aperture diameters. It also shows that reducing the size of the entrance aperture can raise the efficiency bound. This study reveals a fundamental limit on the transmission efficiency as well as provides guidance for the design of high-efficiency multi-channel optical systems.
1 Introduction
Over the past decade, nanophotonic design and fabrication became more and more advanced. Oftentimes, what limits the device's performance is no longer fabrication constraints or the cleverness of the design, but fundamental physical bounds. Furthermore, the design process typically requires time-consuming development, simulation, and optimization. It is invaluable to know beforehand what the fundamental bounds are and how they are related to the design choices.[1] Such knowledge can significantly reduce the time spent in blind explorations and also point to better design choices that are not otherwise obvious. Of particular interest are multi-channel optical systems, such as metalenses[2-6] with a wide field of view (FOV). Their large angular diversity puts a bound on the performance and spatial footprint, such as bandwidth,[7] resolution,[8] and thickness.[9] Meanwhile, another major consideration of metalenses is the efficiency. For example, one would like the incident wave from each angle to be focused to the corresponding focal spot with unity efficiency. But is a uniformly perfect efficiency across all angles compatible with the angle dependence in the desired response? What is the highest efficiency allowed by fundamental laws?
Here, we introduce a matrix-based formalism that sets a fundamental bound on the channel-averaged transmission efficiency of any passive linear multi-channel system given its functionality. Then we apply it to diffraction-limited lenses as an example. The bound applies to any design that realizes the target functionality, so it is applicable not only to metasurfaces but also to conventional refractive and diffractive optical systems.
Metalenses are compact lenses made with metasurfaces, which show great potential for thinner and lighter imaging systems with performances comparable to or exceeding conventional lenses.[2-6] Metalenses designed from a library of unit cells have limited focusing efficiency, which can be overcome by more flexible designs.[10, 11] Inverse design,[10-18] grating averaging technique[19], and stitching separately designed sections together[20-22] are effective approaches. However, achieving high focusing efficiency at large numerical aperture (NA) remains difficult, as all such “local” metasurfaces have limited deflection efficiency at large angles.[23, 24] Since local metasurfaces have a spatial impulse response close to a delta function, they provide the same response for different incident angles, so they are also limited in their angular FOV.[9]
Nonlocal metalenses with tailored interactions between adjacent building blocks (i.e., the spatial impulse response is extended beyond a delta function) can overcome the limited angular diversity of local metalenses to enable diffraction-limited focusing over a large FOV.[9, 25, 26] Their strong nonlocality can be realized by the interaction among adjacent meta-atoms[27] or the excitation of guided resonances that travel across the device over large distances.[28, 29] Nonlocal metalenses based on doublets[30, 31] or aperture stops[32, 33] can support focusing efficiencies higher than 50% over a wide FOV, but with NA lower than 0.5. Multi-layer structures obtained from inverse design have achieved diffraction-limited focusing with NA = 0.7 over FOV = 80°, but the averaged focusing efficiency is only about 25%.[34] However, there was no guidance on the efficiency bound of these nonlocal metalenses.
From the desired response of a multi-channel optical system, we can write down its transmission matrix that relates the input to the output. Here, we rigorously bound the channel-averaged transmission efficiency using the singular values of the transmission matrix and the fact that the transmitted energy must not exceed the input energy. For commonly adopted designs with equal entrance and output apertures, we find the transmission efficiency bound of a nonlocal metalens to drop with the NA. We also find that reducing the entrance aperture size can raise the efficiency bound to close to unity.
2 Results
2.1 Nonlocality
As schematically illustrated in Figure 1a, subwavelength-thick local metalenses (such as metalenses with the hyperbolic phase profile[35]) perform well only over a limited input angular range. Nonlocal metalenses can achieve diffraction-limited focusing over a much wider FOV but need a minimal thickness to provide the nonlocality.[9, 36] Since a large FOV requires an angle-dependent response (i.e., angular dispersion), a spatially localized incident wave must spread as it propagates through the metalens under space-angle Fourier transform. Thus, more angular diversity necessitates more nonlocality. As shown in Figure S1 (Supporting Information), nonlocal effects become important when the FOV is larger than a threshold. For FOV above such a threshold, a sufficient thickness is needed for light to spread and create nonlocality, so the input diameter Din and output diameter Dout can be different [Figure 1b]. The different aperture sizes provide an additional degree of freedom compared to local metalenses with subwavelength thicknesses (for which intrinsically).

2.2 Transmission Matrix from the Target Response
Fourier transforming the output field of the ideal metalens in Equation (2) from y to yields the transmission matrix ; see Section S2 (Supporting Information) for details. Such transmission matrix applies to any lens with diffraction-limited focusing over this FOV. The squared amplitude of the transmission matrix for a metalens with , and FOV = 140° is shown in Figure 1c.
For an ideal lens with no vignetting, here we choose the amplitude in Equation (2) such that the transmission efficiency is the same for all incident angles within the FOV. However, an overall amplitude prefactor of the transmission matrix that determines this is yet to be specified. Larger transmission efficiency is desired. But as shown below, setting for all a will violate energy conservation because doing so leads to more transmitted flux than the incident flux for some non-plane-wave incident wavefronts, which is not allowed in a passive system. Next, we will derive a bound on the transmission efficiency for systems that yield the desired angle-dependent response while satisfying energy conservation.
The paragraphs above consider metalenses. One can similarly write down the desired transmission matrix for other multi-channel optical systems of interest.
2.3 Transmission Efficiency Bound
For any system of interest, the transmission efficiency averaged over inputs within the prescribed FOV is . To obtain an upper bound on without specifying the overall amplitude prefactor of the transmission matrix, we consider the singular values of the transmission matrix.[39] With the singular value decomposition, the transmission matrix is factorized into orthonormal inputs and outputs as , where each right-singular column vector is a normalized incident wavefront being a linear superposition of the input channels. The corresponding transmitted wavefront is , with a transmission efficiency of and with the normalized transmitted wavefront being column vector . Per min-max theorem, and are the highest-possible and lowest-possible transmission efficiency among all possible input states (singular or not). Figure 1d shows the normalized singular values of the transmission matrix in Figure 1c. Since tr() = equals the sum of the eigenvalues of matrix , we have . The obstacle is that the overall amplitude prefactor of is not known.
Given any angle-dependent response of interest, one can write down the corresponding transmission matrix (without specifying the overall amplitude prefactor), evaluate its Nin singular values and the associated Neff, and the efficiency bound follows as . Equation (4) is rigorously derived, with the only assumption being passivity, . Therefore, this bound is fundamental and applies to any passive linear optical system for which the desired response is specified. Unlike the thickness bound of References [9, 36], Equation (4) does not rely on any empirical relation or any heuristic argument.
A bound is useful only when it is sufficiently tight. Figure S2 (Supporting Information) shows from full-wave numerical simulations[41] that Equation (4) indeed provides a reasonably tight upper bound for the averaged transmission efficiency , considering hyperbolic and quadratic metalenses as examples.
Figure 2a shows the transmission efficiency bound as a function of NA for aberration-free nonlocal metalenses with FOV larger than the threshold shown in Figure S1 (Supporting Information). In general, both the FOV and the output diameter Dout can affect the transmission efficiency. For the metalens example considered here, Figures S3 and S4 (Supporting Information) show that they happen to have a small influence on , so in Figure 2a we map out how the efficiency bound depends on the NA while averaging over FOV and Dout. We see that using equal entrance and output diameters, i.e., , results in an efficiency bound that drops approximately as . This bound applies regardless of how complicated or optimized the design is, what materials are used, and how thick the device is.

In Figure 2b,c, we show the distribution (among different FOV and output diameters) of the transmission efficiency bound with NA = 0.7 and 0.9, considering 120 combinations of different FOV and Dout for each NA. The bound is consistent with the inverse design results of Reference [34], where the achieved average absolute focusing efficiency (considering only the transmitted power within three full-widths at half-maximum around the focal peak) is 25% for a nonlocal metalens with NA = 0.7 and FOV = 80°.
In general, the transmission efficiency limit can depend on the details of the target response. In Figure S5 (Supporting Information), we show the efficiency bound for telecentric lenses where only positions within an effective output diameter are used for focusing, which is comparable to the non-telecentric bound when the FOV is small. In Figure S6 (Supporting Information), we consider non-telecentric systems described by Equation (2) and show that the choice of the angle-dependent amplitude has little effect on the efficiency bound.
2.4 Optimal Aperture Size
In addition to establishing a transmission efficiency limit, it would be even more useful to know what strategy one may adopt to raise such an efficiency limit. To this end, we examine the singular values of the transmission matrix. In Figure 1d where and NA = 0.8, we see there are many zero singular values. These zero singular values lower Neff and the transmission efficiency. Removing these zero singular values (i.e., eliminating non-transmitting wavefronts) can raise the transmission efficiency bound . A zero singular value corresponds to a superposition of the columns of the transmission matrix that yields a zero vector, meaning those columns are linearly dependent. Therefore, we can eliminate zero singular values by reducing the number of columns in the transmission matrix. Because the input wave vectors are sampled with momentum spacing at the Nyquist rate, we expect that reducing Din can lower the number of input columns in the transmission matrix to raise the transmission efficiency bound. Figure 3 shows this strategy indeed works: reducing the input aperture size Din increases the efficiency bound .

The next question is: what would be an optimal input diameter Din to use? While reducing Din raises the transmission efficiency bound, doing so also reduces the amount of light that can enter the metalens, which is not desirable. To find a balance, we examine the condition number κ, defined as the ratio between the maximal and the minimal singular values. When zero exist, the condition number κ diverges to infinity (subject to numerical precision). The right-axis of Figure 3 shows that κ abruptly shoots up by many orders of magnitude when the input diameter Din raises above a threshold that we label as (black dotted line). When , κ is of order unity, all singular values are comparable with no zero-transmission wavefronts, so we have , and the transmission efficiency bound is close to unity. When , near-zero-transmission wavefronts start to appear, which results in a fast reduction of the transmission efficiency bound. Therefore, this threshold value is an optimal input diameter to use, providing maximal entrance flux while keeping a near-unity transmission efficiency bound.
To automate the determination of , we examine the slope , which transitions from near zero to a very large number at . Table S1 (Supporting Information) shows that different threshold values for yield almost identical and . Note that the choice of the global phase in Equation (1) does not influence and the transmission efficiency bound, as shown in Figure S7 (Supporting Information).

To demonstrate the increased transmission efficiency bound, we also show in Figure 2a–c the transmission efficiency bound when the input aperture size Din is set to the optimal in Equation (5). We observe a large transmission efficiency bound that overcomes the limit when equal entrance and output apertures are used.
3 Discussion
While we use the efficiency of high-NA nonlocal metalenses as the example here, the formalism we introduce is very general and applies to any passive linear optical system. Once the target response is specified, this formalism can be used to put a bound on the channel-averaged transmission efficiency. The bound holds for any structural design and material composition, including not only metasurfaces but also volumetric nanophotonic structures and conventional optical systems.
We note that even though the transmission efficiency limit here depends only on the target functionality and not on the device thickness, this does not mean that thin and thick structures will perform similarly. Thin structures have a limited degree of nonlocality, which may not be sufficient for realizing the target angle-dependent response of interest[9]—not even with a low transmission efficiency. The efficiency bound here and the thickness bound of Reference [9] are independent constraints.
In establishing a target-driven thickness bound, Reference [9] had to use an intuition-based and empirically established relation between lateral spreading and thickness, and Reference [36] had to use a heuristic relation between channel number and thickness and the estimation of a maximal internal angle that is not clearly defined in the presence of wave scattering. In contrast, the efficiency bound here is rigorously derived with no heuristics or empirical assumption and is clearly defined.
When the input diameter is above its threshold value , the efficiency loss may be associated with vignetting but is not necessarily so. We also note that even an ideal system with 100% transmission efficiency for all incident angles will still experience a natural falloff of the illumination flux density governed by the law due to projection,[42] which is distinct from the transmission efficiency limit here.
Nonlocal metasurfaces open up a wide range of applications and tailored angle-dependent responses that are impossible for traditional local metasurfaces.[26] The efficiency bound in this work provides valuable guidance for this rapidly evolving field. For example, this work suggests that inverse-designed multi-layer wide-FOV metalenses such as in Reference [34] may benefit from expanding the size of each successive layer.
Acknowledgements
The authors thank X. Gao for helpful discussions. This work was supported by the National Science Foundation CAREER award (ECCS-2146021) and the Sony Research Award Program.
Conflict of Interest
The authors declare no conflict of interest.
Open Research
Data Availability Statement
All data needed to evaluate the conclusions in this study are presented in the paper and in the supplemental document.