Linearization: Geometric, Complex, and Conditional
Abstract
Lie symmetry analysis provides a systematic method of obtaining exact solutions of nonlinear (systems of) differential equations, whether partial or ordinary. Of special interest is the procedure that Lie developed to transform scalar nonlinear second-order ordinary differential equations to linear form. Not much work was done in this direction to start with, but recently there have been various developments. Here, first the original work of Lie (and the early developments on it), and then more recent developments based on geometry and complex analysis, apart from Lie’s own method of algebra (namely, Lie group theory), are reviewed. It is relevant to mention that much of the work is not linearization but uses the base of linearization.
1. Introduction
Symmetry has not only been one of the criteria of aesthetics and beauty but has repeatedly proved extremely useful. It lies at the base of the geometry of the Greeks and is at the base of modern developments in high energy physics and in gravity. It was used by Evariste Galois in 1830 [1] for proving that quartic equations are solvable by means of radicals but that it is impossible to canonically solve higher order polynomial equations by means of radicals. This led to the concept of groups. The groups used are now called Galois groups. Lie wanted to extend the approach of Galois to deal with differential equations (DEs). Of course, this is a vastly more ambitious programme. Apart from the order of the DEs there are ordinary DEs (ODEs) and partial DEs (PDEs); scalar DEs and vector DEs; initial and boundary conditions to be satisfied. Worse follows; while polynomial equations generically have at most as many solutions as their order, DEs have infinitely many. For ODEs the infinity is tamed because there are arbitrary parameters (constants) that appear, and they are as many as the order of the ODEs. However, they remain untamed for PDEs. To extend the use of symmetry to differential equations, Lie (1880/83/91) had to extend from finite groups to continuously infinite groups that could be (at least twice) differentiated [2–5]. These are now called Lie groups.
One method Lie adopted was a generalization of the methods for some specific first-order ODEs, changing them to linear form by using an invertible transformation of the dependent and independent variables. He showed that all order ODEs can be transformed to linear form by such transformations. He then obtained general criteria for such transformations to exist for second-order ODEs. Such transformations are called point transformations, and the transformed equation is said to be linearized. Equations so transformed are said to be linearizable. Lie proved that the necessary and sufficient condition for a scalar nonlinear ODE to be linearizable is that it must have 8 Lie point symmetries. He exploited the fact that all scalar linear second-order ODEs are equivalent under point transformations; that is every linearizable scalar ODE is reducible to the free particle equation. He showed that the ODE had to be cubically semilinear and was able to state criteria that the coefficients must satisfy for the equation to be linearizable. These consisted of a set of four consistency conditions for the four coefficients of the first-derivative terms, involving two auxiliary functions. In 1894 Tresse [6] eliminated the auxiliary functions and reduced them to two conditions. It turns out that all second-order ODEs are transformable to linear form by contact transformations (that involve firstderivatives of the dependent variables as well). We will not be concerned with them here.
Lie took this approach no further but considerably later (1937/40) Chern [7, 8] extended the analysis to a class of scalar third-order ODEs by using contact transformations. It was not till much later (1996/97) that the same results were obtained using Lie’s classical method by Grebot [9, 10]. In 1990 Mahomed and Leach [11] proved that for mth-order ODEs, m ≥ 3, there was no unique class of linearizable ODEs. Instead there were three equivalence classes with m + 1, m + 2, or m + 4 infinitesimal symmetry generators. Notice that whereas for second-order there are 8 symmetry generators as the maximal number, for third-order there are only 7, but even with 5 or 4 generators the ODE may be linearizable. However, there was no procedure provided to linearize them. The general third-order linearizable ODEs were dealt with only recently (2002, 2005) by Neut and Petitot [12] and independently by Ibragimov and Meleshko [13]. Though the procedure followed was the same as that used by Lie, the calculations become much more complicated, and algebraic computation was needed. The calculation for order 4 is even more complicated and was achieved by Ibragimov et al. [14] only in 2008. Though very useful, they did not provide means of actually solving the equations.
All the work mentioned so far has been for scalar ODEs. Nothing has been said about systems of ODEs. In 1988 systems of two second-order ODEs, linearizable to constant coefficient systems, were proved to have three equivalence classes [15] with 7, 8, or 15 dimensional Lie algebras. This result was extended to the general case in 2000 by Wafo Soh and Mahomed [16], giving 5, 6, 7, 8, or 15 dimensional Lie algebras, and was further extended by them in 2001 to m ≥ 2 second-order ODEs [17], giving m + 3, m + 4,…,2(m + 2) or (m + 1)(m + 2)/2 dimensional Lie algebras. Thus there are a total of m + 3 classes. (Notice that the formula would not hold for m = 1.) However, once again, there was no procedure provided to obtain the linearizing transformations, the linearized equations, or the solutions in general. While very useful as existence theorems, the methods gave little practical development beyond the general symmetry analysis methods developed by Lie himself.
This is as far as the earlier developments using the classical procedures went. The next step came from the attempt to connect the symmetries of ODEs with the symmetries of geometry. The connection was found by considering geodesic equations, independently by Aminova and Aminov [18, 19] in 2000/06 and by Ferozeet al.[20] in 2006. This connection proved very fruitful. The geodesic equations are systems of second-order ODEs, so it dealt with systems. It had been noted in passing by the former authors that the system is linearizable if the space is flat. Independently it was taken much further in 2007 by Mahomed and Qadir [21], who considered a second-order system of ODEs of geodesic type and found that the consistency conditions for the system to be linearizable are to treat the coefficients of the system of ODEs as if they were Christoffel symbols and require that the curvature tensor constructed from them be zero. By projecting the equations down one dimension, using the invariance of the geodesics under translations of the geodetic parameter, one obtains a system of cubically semilinear ODEs that are linearizable if they are obtainable by projection from a system of geodesic-type equations in a flat space [22]. The projection procedure had been discussed by Aminova and Aminov, but its usefulness for linearization had not been noticed by them. There is some further work on linearization that follows from the algebraic linearization and group classification of Lie and from the geometric developments mentioned [23–25], that I will not be going into here. However, these works would be well worth to be followed up.
Very recently it was noted [26, 27] that whereas Lie had used complex DEs for complex functions, he had not explicitly used their analyticity in real terms. The fact that the equations are in the complex domain is very relevant for geometric purposes, as the topology of the manifold is changed by going from the real to the complex. In fact, as pointed out by Penrose [28], the topology becomes simpler. Thus for example, if a point is removed from the real line the manifold is broken into two pieces. Hence, if we want to construct a Lie group under multiplication from it we are unable to do so (as there is no inverse for the element 0 available). However, for the complex “line”, taking out a single point leaves a multiply connected space. We can then make a Lie group under multiplication from it by leaving out the complex number 0. This makes the complex domain for Lie groups very important for differential geometry, but the corresponding aspect for DEs had not been exploited. It was realized that the dependent variables must be analytic for a DE. As such, when considering the scalar complex DEs broken into a system of real DEs, we have to include the Cauchy-Riemann equations (CREs) in the system. This will change the symmetry structure of the system substantially, thus leading to many unforeseen consequences.
There have been many new developments using geometry and complex analysis, with linearization proper and with methods developed that do not give linearization but use it to get solutions. The main thrust of this paper is to discuss these recent developments. We limit ourselves to point transformations and do not discuss the developments involving contact or higher order transformations or discuss the developments for PDEs, except in so far as the new methods give some results for them. The plan of the paper is as follows: in the next section we provide some preliminaries giving the notation and terminology used for symmetry analysis of systems of ODEs. In the subsequent section the original method of Lie and the algebraic methods used are discussed. In Section 4 the recent geometric methods and in Section 5 the complex methods are presented. In the next section after that some other developments regarding conditional linearization are given. These lead to a proposal for the systems of ODEs along the lines of Lie’s original intention. However, it is found that the proposal needs other ingredients. This is discussed in Section 7. Finally, in Section 8 a summary and discussion of some ongoing work are provided.
2. Preliminaries
Two DEs are said to be equivalent if one can be mapped into the other by point transformations. All first-order ODEs are equivalent under point transformations and hence all can be linearized. This is not true for second-order ODEs. However, all linear scalar second-order ODEs are equivalent [29]. Further, there are different classes of linear ODEs of order m ≥ 3. As such, there are different classes of higher order linearizable nonlinear ODEs. For systems of ODEs we have to regard the space as (m + 1)-dimensional (m independent variables and one dependent variable) instead of 2-dimensional. Clearly the earlier results for ODEs of higher order (than 3) would not generally apply to systems. In fact little is known about them. In particular, it is not clear how the numbers will change with dimensions and order, though one can make guesses by naively putting the two modifications together. The problem is that the number of symmetry generators keeps pulling new surprises. (Remember the change in going from second to third order.) A rigorous proof for the general formulae would be very useful.
As mentioned before, a connection was found between the symmetries of systems of second-order ODEs and isometries of a space by considering systems of geodesic equations. As such, it is worthwhile to briefly review the differential geometry involved in isometries and systems of geodesic equations.
For differential geometry one normally considers a manifold of dimension n with a Riemannian metric, g defined on it. However, it is not necessary to limit ourselves to a positive definite metric, and we can consider indefinite metrics as well. For a manifold it is necessary that locally there is a homeomorphism from the space to ℝn. Using the homeomorphism we can assign n coordinates on it, xi. In general one can transform coordinates at will and often needs to change coordinates in going from one element of the open cover of the manifold to another. In any chosen coordinates the metric tensor can then be written as a symmetric matrix gij(x). For our purposes we need differentiable manifolds and hence need that there be diffeomorphisms instead of only homeomorphisms.
Since it is a fourth-rank tensor one can take different traces. Two are zero and four are either equal or the negative of each other. That is called the Ricci tensor . Its trace is called the Ricci scalar. In 2 dimensions it is twice, and in n-dimensions it is n! times, the Gaussian curvature.
The geometric methods developed for the study and linearization of ODEs use these quantities extensively. Further background geometry is provided where needed.
3. Algebraic Linearization
3.1. Lie’s Original Approach
Note the symmetry between the two equations by interchanging (c, g, x) and (d, h, y). Note also that these equations, however, complicated they may look, are only constraint equations to be checked and not DEs to be solved.
Symmetry generators were not only useful for the purposes of linearization but could be directly used to reduce the order of the ODE by 1. Thus, if we have enough symmetry generators we can solve any ODE. Even if there are not enough, we can reduce the order. If, for example, a second-order ODE has a single infinitesimal symmetry generator that maps this generator to a translation generator, by a point transformation it can be reduced to a first-order ODE. Now, by the fundamental theorem of calculus, it can be solved if it is semilinear. As such, we would have proved that it can be solved and could then use numerical methods to solve it. Lie developed the methods further. In the process he used the power of group theory to derive very general results for the solution of ODEs by means of symmetries. In fact, he needed to classify the ODEs by their groups. If the ODE had a solvable Lie group of the correct order it could be solved. Group classification became one of the staples of symmetry analysis. This played a role for the purpose of linearization later.
Presumably, Lie would have noted that the number of symmetry generators for third-order linear ODEs was not the same. For the ODE y′′′(x) = 0 the number is 7. For y′′′(x) + λy(x) = 0, where λ is a complex number, the number of generators is 5, and for y′′′(x) + f(x)y(x) = 0, where f(x) is a nonconstant function, the number is 4. As such, Lie’s own argument did not work, and he did not give any results on the linearization of third-order ODEs.
The extension to the fourth-order follows exactly the same procedure as does the third-order. Here, again, there are the two types of linearizable equations, but this time there are four classes. The point is that for the third order we can get rid of the first second-derivative terms by appropriate choice of the linearizing transformations. The extra class here comes because the second and third derivatives can be gotten rid of, but the first derivative cannot. Laguerre [30, 31] says that the same formula applies for relating all fourth- and higher order linear ODEs. As such, they argue that there is no need to extend beyond the fourth-order. Again, they did not connect their work with the number or algebra of the symmetry generators. It turns out that the 8 symmetries are for the simpler type, and all the other three classes are contained in the more complicated case. We will not go into further details with it here.
3.2. Group Classification of Higher Order ODEs
It is worthwhile, at this stage, to go back to the original problem from which group theory arose: the solution of algebraic equations. When solving a quadratic equation we can think of y as the quadratic function of x: we translate along the x-axis to the point about which the function is symmetric and then translate along the y-axis to the point at which there is exactly one root. Transforming back to the original variables gives the solution. When solving for cubic equations there is a problem. There are three classes of function: one of which is monotonically increasing or decreasing; the second of which has a point of inflection, and the third-has two turnovers. We cannot solve by reducing the equation to quadratic form directly and first transform variables so as to eliminate the second-order term. We now follow the previous strategy and go to the point about which the function has symmetry under x reflections and then use the requirement that the graph of the curve is invariant under rotation through π radians. This can always be done for the cubic, as may be seen by looking at its graph in any of the three cases. One can find the point by looking for the maxima and minima of the function and finding the central point by translating along the x- and y-axes.
The natural next step was to solve the quintic. However, all attempts failed. To try to find the general solution Abel and Galois independently considered the function as a product of 5 (complex) factors, which are the roots of the quintic equation, (x − r1)(x − r2)(x − r3)(x − r4)(x − r5). It is clear that the function “does not care” which root is called which, as multiplication is commutative. It seems that we could then reduce this to a quartic by taking away one of the factors and solve the quartic. The hurdle is the requirement of symmetry under interchange of the roots. It was Galois who showed under what conditions on the coefficients the required symmetry would hold to allow this procedure to work. Both of them demonstrated that the general quintic cannot be solved by means of radicals. Galois was also able to show that this hurdle would apply to all higher order polynomial equations. The lesson for higher order scalar ODEs should be clear. Many classes are generically to be expected. Only for the second order there is no problem of classes. As regards systems, it may be noted that a system of two first-order ODEs “is equivalent” (in some sense) to a scalar second-order ODE. As such, the generic problem of many classes can again be expected to recur. Indeed it does!
Note the use of groups to classify ODEs according to the number of their symmetries. For algebraic equations the permutation group sufficed. Here we need Lie groups. This means that we now consider the number of symmetry generators instead of the order of the permutation group. Generally we do not know the topology of the associated manifold. As such we have only a local realization of the group. It is the Lie algebra associated to the group that is relevant. Recall that there is a unique Lie algebra associated with any Lie group, but there can be many Lie groups (with different topologies) for a given Lie algebra. Mahomed and Leach [11] classified mth order ODEs and found that there remain 3 (as for second order), with m + 1, m + 2, m + 4 generators. Though technically much more complicated, the principle remains the same; invariance under a point transformation depends on the nature of the ODE, independent of the “coordinates” used. Thus we obtain a canonical form into which all equivalent ODEs can be transformed (as we factorized the algebraic function). Now we can check the symmetries of the different canonical forms of ODEs according to their symmetry algebras. Notice that different Lie algebras can have the same number of generators, and hence the number of classes does not need be the distinctive feature. The Lie algebra of the mth order ODE is [11] ℝm−1⊕sgl(2, ℝ), (where ⊕s is the semidirect sum, meaning that the two subalgebras do not commute).
3.3. Meleshko’s “Linearization” of Third-Order ODEs
For solving an autonomous third-order ODE, Meleshko [32] provided an alternative method that he called “linearization”. This seemed odd, because shortly before it Ibragimov and he had characterized all possible third-order linearizable ODEs, and this new method dealt with equations that did not satisfy those criteria. What Meleshko does is to convert this third-order ODE to a second-order ODE and then linearize that (if it is linearizable). Of course, it cannot be linearizable if it does not satisfy the IM linearizability criteria as IM have pointed out. This use of linearization, without being linearization itself, is what this review paper is about and will be relevant for a proposal for classifying ODEs by their solvability by transformations according to the number of initial conditions they satisfy. I give the essence of Meleshko’s method here.
3.4. Systems of ODEs
The number of classes is another matter. We need to then consider the symmetries of the equation. It can generally be proved, see for example [33], that the symmetry algebra for the n-dimensional system of free-particle equations, y′′ = 0, is sl(n + 2, ℝ), and hence it has n2 + 4n + 3 generators. For n = 2 it is 15. The group classification has to be resorted to again, and we need to distinguish between the various algebras that have the same dimension but are different. For example, the algebra of rotations in 3 dimensions is so(3), of symmetries of the plane is so(2)⊕s ℝ2, and the group of translations in 3 dimensions is ℝ3. The first is a simple algebra, the second is semisimple, and the third is abelian. There are 5 classes of different dimensions in this case.
Gonzalez Gascon and Gonzalez-Lopez[34] gave the maximal symmetry for linearizable n × n systems. Gorringe and Leach [15] considered general systems and showed that linearizable 2-dimensional second-order systems with constant coefficients lie in one of three classes with 7, 8, and 15 generators. (The canonical form of the last one has no coefficients as it has the maximal Lie algebra.) Wafo Soh and Mahomed [16] allowed variable coefficients and found two more classes (with 5- and 6-dimensional Lie algebras). They then generalized to n-dimensional systems [16] by using group classification and found that the number of classes increases by one with each increased dimension. The number of generators in the minimal case is n + 3 and for the highest submaximal case is 2n + 4. The maximal, of course, has n2 + 3n + 4 generators, as mentioned earlier.
4. Geometric Linearization
The use of geometry for Lie symmetry analysis dates back to Lie’s own work. Also, Lie’s work lies at the base of modern differential geometry. However, the two areas diverged after Lie. The methods of modern differential geometry can be very effectively used for systems of second-order ODEs. This use is based on the connection noted between symmetries of differential equations and systems of geodesic equations projected down one dimension, by Aminova and Aminov and by Feroze, Mahomed, and Qadir.
One could ask for the consistency conditions that they could come from geometry. It turns out that explicit criteria are not so easy to state. If the criteria are fulfilled then one should be able to write down the metric corresponding to the coefficients of the system of ODEs. Thus, given the system of ODEs one should be able to construct the space on which they can be regarded as systems of geodesics. In other words, given the geodesics, one should be able to construct the space on which they lie. A mathematica code for this purpose has been written [35].
4.1. Linearization of Second-Order Systems of Geodesic Type
A procedure is provided [36] to determine the given the gij. This enables us to write down the solution directly.
Example 4.1. The 2-dimensional system,
Example 4.2. The system,
Example 4.3. The 3-dimensional system,
4.2. Linearization of Cubically Semilinear Second-Order Systems
Since the geodesic equations are invariant under translation and rescaling of the geodetic parameter, we can use the translational symmetry to project down one dimension, replacing the geodetic parameter by one of the dependent variables (say xn) [18, 19]. One might have thought that the other symmetry generator could be used to project down one more dimension, but that is not so. The reason is that the two generators are degenerate, as they only depend on one and the same variable. When I was thinking about the problem I had not considered this method. Instead, I wanted to embed a general n-dimensional system into an (n + 1)-dimensional manifold and ask that the embedded equations be of geodesic type. When I later found that Aminova and Aminov had already got the same results by projection, I did not publish the other idea. However, there is no bar that I can see on a double embedding, which may lead to interesting systems of equations.
The linearization conditions are that the curvature tensor constructed from the Christoffel symbols is zero.
Remark 4.4. There are more Christoffel symbols than coefficients in the system. This means that there is an arbitrariness of choice of Christoffel symbols for a given system. However, one can provide a canonical procedure for making the choice. It is not clear that sometimes another choice may not be more convenient.
Remark 4.5. By taking geodesics in n = 2 and projecting down we obtain a scalar cubically semilinear ODE, and the linearization conditions are precisely the Lie conditions. The auxiliary functions here appear because of the degeneracy in the choice of Christoffel symbols. Obviously, the Tressé conditions follow.
Remark 4.6. The most general cubically semilinear system is
5. Complex Linearization
Whereas Lie had used complex DEs of complex variables in his analysis, he did not use the analyticity properties embodied in the CREs. At first sight it may be expected that this would not give anything new. However, there was one curious fact that attracted my attention. The algebra of the real linearizable scalar second-order ODEs is sl(3, ℝ), which has 8 real linearly independent generators. For the complex case it would be sl(3, ℂ), which has 8 complex linearly independent generators and hence 16 real independent operators. As explained later, these operators do not form a Lie algebra and are therefore not symmetry generators. The corresponding canonical ODE is the 2-dimensional real second-order system of ODEs with symmetry algebra sl(4, ℝ), which has 15 generators and not 16. Where did the extra generator go? One might think that one could drop one of the 16 generators to get the required subalgebra. This expectation is based on the intuition of dropping one generator of gl(4, ℝ) to get sl(4, ℝ). However, the real system cannot get the symmetry generators in this naive way, as the generators of the real system appear in pairs. What happens is that one leaves out one of the complex generators gets a set of only 14 generators, and then requiring the closure of the algebra, obtains the 15-generator algebra. This strange behaviour seemed to me to deserve more attention. On investigation [26, 27] it was found that splitting complex generators into real and imaginary parts yields interesting insights. For example, the complex scaling symmetry contains the real scaling and the real rotation in 2 dimensions!
Of course, a complex dependent variable will split into two real dependent variables and the complex independent variable into two real independent variables. Thus the scalar ODE would split into a system of two PDEs [26, 37]. The CREs would apply not only between the independent and the dependent variables but also between the independent variables and the derivatives of the dependent variables, to the relevant order. Here we are only concerned with the splitting of the complex scalar ODEs into systems of two real ODEs. I briefly explain the basics of the splitting procedure.
5.1. Complex Symmetry Analysis
The point is that we can now ask for the function f to be analytic. As such, its derivatives with respect to the dependent variables and their derivatives must satisfy the CREs.
Thus, to every symmetry generator of the complex scalar ODE, there correspond two symmetry operators of the real 2-dimensional system of ODEs. This is where the problem of the lost extra generator comes from. The 8 for the complex equation w′′ = 0, which has the symmetry algebra sl(3, ℂ), will split into 16 operators. However, for the corresponding system, y′′ = 0, z′′ = 0, the maximal algebra is sl(4, ℝ), which has 15 generators. It should now be obvious why one cannot just remove one of the 16, as it will take “its partner” out with it.
The scalar second-order ODE may come from a variational principle. Classification may be done not only for the Lie symmetries but also for the symmetries of the action integral are called Noether symmetries. (Notice that these are distinct from the symmetries of the Lagrangian.) A connection between geometry and Noether symmetries has been explored, for example, in [38]. Noether symmetries are of special interest as they give double reduction of the Euler-Lagrange equation. It would be of interest to study the linearizability of these ODEs. The question then arises as to what happens to the Lagrangian when we split into real and imaginary parts. In general, we should then get a complex Lagrangian. However, that seems to become meaningless, as the variational procedure requires an ordered set for the action to be minimized, but the complex field is only partially ordered. Also, it might be thought that the physical quantity is a real Lagrangian. However, it turns out that complex Lagrangians, and correspondingly complex Hamiltonians, have been used in the literature, for example by Bender and others, in which they also explain some anomalies in atomic phenomena [39–44]. The variational principle has been used in complex symmetry analysis [45] but is not reviewed here, due to space considerations.
While the complex splitting may be an intriguing curiosity, at first sight it does not seem to provide a method for solving systems of ODEs. However, it can be used by converting a complex scalar ODE that can be easily solved to a system of two real ODEs and then reading off the solution of the system. It seems trivial as stated, but one can start with a general 2-dimensional system and check to see if it corresponds to a scalar ODE. One might feel that this inverse procedure “is cheating” and that it lacks generality, but the same objection could be raised on basic analytic integration methods that just invert the differentiation procedure. It is to be borne in mind that there is an enormous reduction in the amount of work to be done to solve a system of ODEs by symmetry methods if one can convert it to a scalar ODE. It is especially useful for second-order systems as they can be linearized and the power of geometry used. Bear in mind that the prescriptions for solving by other symmetry methods are tedious and cumbersome.
Example 5.1. The system of two ODEs,
This is not where it ends. One can start with a 2-dimensional complex system of ODEs and split it into a 4-dimensional real system [46] and now ask for the 2-dimensional system to be linearizable. Using the power of geometry for the 2-dimensional system one can write down the solution for the 4-dimensional system.
5.2. Classes of Complex Linearizable Systems
There are 5 classes of 2-dimensional linearizable systems of ODEs with 5, 6, 7, 8, or 15 infinitesimal symmetry generators [16]. There was further work done on systems regarding further details of their algebraic classification [23, 47] that I will not discuss further. The class of geometrically linearizable ODEs must have an sl(3, ℂ) symmetry algebra with 15 generators. As such, it misses the other 4 classes of lower symmetry. Where did they go? The answer is that they are linearizable but not geometrically so. It would have been great to be able to get all the linearizable classes accessible to the power of geometry. At first sight, it appears to be impossible. It turns out that this is not quite true. Though we have not been able to get at all the classes, we can make two more classes accessible to geometry by the use of complex symmetry analysis. To explain this it is necessary to briefly state a result used for the equivalence of systems of ODEs which reduces the number of coefficients to n2 − 1, and then to show that for n = 2 with complex methods we get 2 of them instead of 3.
This procedure of reduction of arbitrary coefficients for linearizable systems simplifies the classification problem enormously. Recall that a general 2-dimensional system of ODEs has 10 arbitrary coefficients which is reduced to 3, a 3-dimensional system of ODEs has 21 which get reduced to 8, a 4-dimensional system has 36 which reduce to 15, and so on. System (5.11) is called the optimal canonical form for linear systems of two second-order ODEs, as it has the least number of arbitrary coefficients.
Theorem 5.2. Any system of the form (5.15) corresponds to a scalar complex second-order ODE if and only if the coefficients αij, βik, γil, and δi satisfy the conditions:
The reason that we can reduce to just two functions is that we are dealing with the special class of linear systems of ODEs that correspond to scalar complex linearizable ODS. This gives the following theorem [49].
Theorem 5.3. A linear system of two second-order ODEs,
Since we have only one (complex) coefficient function involved; there are now the usual three cases: (a) β(x) is an arbitrary function; (b) it is a constant function; (c) it is zero. On working through it we found that (a) gives 6 symmetry generators, (b) 7, and (c) 15. The cases of 5 and 8 symmetry generators were not accessed by complex symmetry analysis, but it did get the other two missing classes! One would really like to find some way of characterizing them.
I now give some illustrative examples taken from [49].
Example 5.4. The system corresponding to β = 0,
It corresponds to the scalar linearizable ODE,
Example 5.5. The system corresponding to constant β,
Example 5.6. The system corresponding to variable β,
5.3. Beyond the Wafo Soh-Mahomed Theorem
Though we cannot get the 5-dimensional case of linearizable systems for two variables, we can go still further in a novel way. Here is an example from [50] that has a 4-dimensional Lie algebra only.
Example 5.7. We have
Though this system could be solved by the normal (tedious) symmetry methods, as it does have the minimum number of symmetry generators required for the purpose, it could not be solved by linearization.
What happened to the Wafo Soh-Mahomed theorem which said that such systems cannot be linearized as that would require 5 generators?
There is worse (or perhaps one should say “better”) to follow. It was found that one can even go down below the minimum number required for symmetry solutions of systems of ODEs. There are examples of 3 and 2 and even of one symmetry generator; the last of which is given here.
Example 5.8. We have
Now what happened not only to the Wafo Soh-Mahomed theorem but also to all the limitations of the general (tedious or not) symmetry methods? They seem to have been all bypassed.
The Wafo Soh-Mahomed theorem continues to hold. The system never got linearized. It only corresponded to a complex scalar ODE that was linearizable. The point is that the linearizing transformation for the complex scalar ODE converts the real independent variable to a complex one. Now we no longer generally have a system of ODEs, it is a system of PDEs. It turns out that the solution of the original system does “go through the filter” of complexifying and then reducing to the real, but the rest need not. In the previously mentioned example with 4 generators we got 4 arbitrary constants, which could (in principle) have solved a linear 2-dimensional system, but in the example with only one symmetry generator there are only two arbitrary constants. This could not be the general solution of a system of 2 linear second order ODEs. This is not a complete answer to how the limitations got bypassed but may be regarded as a partial answer. It is an example of Penrose’s “complex magic” [28].
6. Conditional Linearizability
A major problem with the geometric approach is that it is limited to the second order. For other orders it would appear that we have to forego the power of geometry. Recall that the connection to geometry is through the geodesic equations, which are second order and limited to those of geodesic type. The latter restriction was avoided by using projections. However, there is no geometrical quantity that comes naturally from the third or higher derivatives. How, then, can we go beyond this restriction?
One way is to differentiate the second-order ODE that can be dealt with geometrically but that seems to be trivial. Once again, the appearance of triviality may be misleading. Consider the symmetries of the free particle equation and its derivative. Instead of increasing the number of symmetry generators, we have reduced it! Further, the structure of the symmetry algebra is totally different. It is largely abelian, with a small nonabelian subgroup. Differentiating once again would restore the number of generators, but the symmetry algebra would be different. Again, something odd is happening. One also takes it for granted that the solution of the original equation will be the solution of the new equation. Implicit in there is the assumption that requiring that inserting the original equation into the differentiated equation will make no difference. However, the symmetry structure of the system of equations will be vastly different. As such, it is worthwhile to explicitly insert the original equation into the differentiated one to see what happens.
We can now compare coefficients to determine the second-order ODE from which it could have arisen. Of course, this equation has 6 coefficients, while a cubically semilinear second-order ODE could only have 4. Therefore, there is no guarantee that the third-order ODE could have arisen from the candidate second-order ODE. Consequently there have to be consisten criteria to be satisfied. These are given in detail in [51].
This procedure is called conditional linearization, and the second-order ODE from which it can be obtained is called the root equation. It can yield the solution of higher order ODEs that have only 2 arbitrary constants in them, coming from the second-order ODE that could be differentiated to obtain them by replacement. It may be that such equations are not amenable to the classical (Lie) linearization procedure. An illustrative example is presented.
Example 6.1. The third-order ODE taken is
The previous procedure can be taken further [52]. One can differentiate the third-order ODE obtained by differentiating the root equation and then inserting the root equation in it, inserting the third-order ODE in, first inserting the root equation into the third-order and then differentiating again. or even inserting the root equation in twice. If we follow the last mentioned procedure, we get a fourth-order septically semilinear ODE. Thus, given such an ODE in general, one can check if it can come from a second-order ODE. If so, we can construct the root equation, and if one can solve it one has the solution of the fourth-order ODE. A couple of examples are provided that do not satisfy the Ibragimov-Meleshko-Suksern criteria for fourth-order ODEs [14].
Example 6.2. The fourth-order ODE
Example 6.3. The fourth-order ODE,
7. Proposal for Classification of Systems of ODEs
How far can we take this procedure? In principle, we can go to any higher order ODE with a second-order root equation and retain the power of geometry. There are, of course, many more possibilities for the replacement of the derivatives here. Though the calculations get extremely complicated and messy; in principle one could do the same for any system of ODEs [53]. However, if one wants to take a different root equation one gets restricted to scalar equations only as the power of geometry is lost. That is not to say that the concept of conditional linearizability will not apply. Rather, the procedure for directly writing down the solution of the equation is now lacking.
At the moment ignoring this problem, it is clear that one can say something nontrivial about the original motivation of Lie in introducing Lie groups. The key point was not to provide solutions but to classify equations according to their solvability by point transformations a’la Galois. A handle has been provided by conditional linearizability [54] as I now explain.
One can start with a root equation of any order m, including the first. In fact, the first-order ODEs are all linearizable, and hence we are not restricted to any special class there. For the second order we have seen how Lie established linearizability. For higher order scalar ODEs the linearizability criteria can, in principle, be obtained by algebraic computer codes. One can then differentiate to any other (higher) order, m, and go through some replacement procedure. For example, the fifth-order ODE that only depends on the first derivative and is conditionally linearizable with a second-order root equation has a ninth-order polynomial dependence on the first derivative. Similarly, starting with one of the Ibragimov-Meleshko classes yields a fourth-order conditionally linearizable ODE. The resulting equation will have at least one solution, that is, the general solution of the root equation. That must have m arbitrary constants. Depending on the replacement procedure, the number of arbitrary constants, p, appearing in the general solution will lie somewhere between m and n. It can be hoped that we would be able to determine p by the replacement procedure. As such, there would be p arbitrary initial conditions that could be required for the ODE. We can then give the following definition.
Definition 7.1. An mth order ODE (m > 1) will be said to be conditionally classifiable by a symmetry algebra 𝒜 with respect to a pth order root ODE (p < m) if and only if the mth order ODE jointly with the pth order ODE forms an overdetermined compatible system (so the solutions of the mth order ODE reduce to the solutions of the pth order ODE), and the pth order ODE has symmetry algebra 𝒜. The most general class of linearizable, conditionally linearizable, or conditionally classifiable ODEs will be denoted by .
In the context of this definition the following conjecture was stated [54].
Conjecture 7.2. All ODEs, or systems of ODEs, of order p ≥ 2 are linearizable, conditionally linearizable, symmetry classifiable, or conditionally classifiable by symmetry, that is in terms of .
This leads to the proposal that Lie’s programme could, perhaps, be completed by using linearizability and classifiability presented previously. Of course, this would not apply to the singular case p = 1, which is, why the conjecture was stated for p ≥ 2.
8. Summary and Discussion
Inspired by Galois’ success with algebraic (polynomial) equations, Lie tried to replicate it for differential equations. While Galois developed groups, Lie developed Lie groups. In one sense, then, Galois’ development seems more fundamental than Lie’s. It led to more definitive results, namely, the non solvability of quintic and higher order equations by means of radicals. However, Lie’s attempt was far more ambitious, given the much greater complexity of the problem of dealing with DEs as a whole. The attempt paid rich dividends. Not only did he provides a systematic procedure for solving DEs by transformation of variables, his groups led to a much deeper understanding of differential geometry and provided it with much greater power. In fact, Lie’s original aim seems to have got lost in the process. Recently Lie’s symmetry analysis of DEs has gained popularity because it has made it possible to solve nonlinear problems arising in engineering, such as those in non-Newtonian fluid dynamics. Those problems had been trivialized so as to make them solvable. With the advent of electronic computing the tendency was to find numerical solutions. However, these could be misleading as one may appear to get a solution that does not exist. Also, the approximations involved could often “throw the baby out with the bath water.” Now it has begun to appear that geometry can pay back for its benefits with interest. In this paper the “pay-back” has been reviewed.
The benefit of geometry rests on the connection between the Lie symmetries of systems of geodesic equations and the isometries of the manifold in which they lie. The connection seems trivial if one is only looking at geodesic equations. However, it becomes a nontrivial method for solving systems of ODEs that could be regarded as geodesic equations. Further, it becomes thoroughly nontrivial when one extends to systems of ODEs that correspond to projected systems of geodesic equations. One then gets a method for directly writing down solutions of linearizable systems based on the beautiful result that the linearizability criteria are equivalent to the requirement that the underlying manifold is to be flat. There is a caveat here. For systems the requirements are not generally equivalent but only for the maximally symmetric linearizable cases. The geometrization of the other classes is a problem.
The problem was partially overcome by using the development of complex symmetry analysis. For a system of two ODEs one obtains 3 of the 5 linearizable classes. At present that is a very minor part of the total. However, one can split a system of 2 complex ODEs to obtain a system of 4 real ODEs, a system of 3 to obtain a system of 6, and so on. This remains limited to only even dimensional systems. How about the odd ones? There is a development to obtain odd dimensional systems, but that is not adequately explored to discuss here. The bigger problem is that even for the 2-dimensional case we are missing 2 classes. Where did they go? As yet there is no answer. One can speculate that the procedure of projection has limited the number of classes. If my original idea of embedding was used we may be able to bypass the restriction of only shifting by one step. One could perhaps embed twice. If so, we may recover the two missing classes. The level of computational complication that arises makes it difficult to actually apply the methods being talked of. With improved algebraic computing one may be able to routinely use these methods with a fast computer. As a bonus the complex methods provided a means of solving systems of ODEs that were not amenable to solution by symmetry methods. An interesting point arises in connection with complex linearization. We have found examples of systems of two ODEs that can be solved if they have even one symmetry generator. Does there exist a system with no symmetry generator. That would be worth looking for. Ali, Safdar, and I conjecture that no such systems of ODEs exists. However, we have no proof of this conjecture. It would be worth either finding one or finding a counter example to it.
The extension from ODEs of geodesic type has gone fairly far with the use of projection and complexification. However, all said and done it is only second-order ODEs that we are talking of. The geometric method cannot be used to linearize higher order systems of ODEs. An inroad is made with conditional linearization, by using a second-order root equation to obtain a higher system of ODEs. This does not necessarily provide a linearizable higher order system but does provide a (limited) solution. As a bonus it provides a possible path to the “holy grail” of classification of ODEs according to their solvability that had been Lie’s original motivation. We might be able to classify them according to the nature of the initial value problem that can be solved as regards to the number of initial conditions that could be met in general. There is a caveat here as well. It has been found that the method used by Meleshko [32] for linearizing third-order autonomous ODEs does not fall in the Ibragimov-Meleshko classes or the conditionally linearizable classes. In fact, the method can be generalized to higher orders [55], and it is generally found that other classes of this type emerge. As such, there is at least one other method for linearization. There again, it is not linearization in the sense of Lie but will provide another general classifiable class. How many such classes are there? If the number is finite, the proposal can be meaningfully completed, but if there are infinitely many the proposal must fail.
Even with all of the previously mentioned pious hopes being met, we have still only scratched the surface of the problem. ODEs are a very limited class of DEs. What about PDEs? One needs to find a connection between geometry and PDEs. The natural expectation would be to look for a generalization of geodesics for higher dimensional subspaces. (Bear in mind that a geodesic is a 1-dimensional subspace.) They arise by solving the problem of minimizing the arc length. The natural extension would appear to be minimal surfaces. However, so far attempts to use them have not led to any great success. This is perhaps the most important development in this direction is needed.
Acknowledgments
The author gratefully acknowledges useful comments and suggestions by Sajid Ali, Hina Dutt, and Fazal Mahomed. He is also grateful to DECMA and the School of Computational and Applied Mathematics, University of the Witwatersrand, Johannesburg, South Africa, for support at the Galois Bicentenary Conference (SDEA-I), where this work was presented and to the Abdus Salam International Centre for Theoretical Physics, Trieste, Italy, for support at Trieste where this paper was written.