The Penrose transform in the language of sheaves and D-modules
Andrea D’Agnolo (after a joint work with Pierre Schapira)
Abstract
The methods of D-module and sheaf theory are applied here to the study of integral transforms, such as the twistor correspondence of Penrose. This approach has the advantage of distinguishing the topological obstructions from the analytic ones. A section devoted to a short review of the classical paper [4] shows how this is effective.
AMS subject classification: 35C15, 32L25.
1 Historical introduction
Let TT be a complex four-dimensional vector space, and denote by IM the Grassmannian of two-planes in TT. Recall that IM is a compact complex manifold of dimension four, which is identified — according to Penrose — to a conformal compactification of the complexified Minkowski space. It fol- lows in particular that the conformally invariant equations of physics, which are defined on the Minkowski space, nicely extend as differential operators acting on holomorphic vector bundles of IM. A distinguished family of such equations is given by the “massless field equations”, parameterized by a half- integer called “helicity”: these include the Dirac-Weyl neutrino equations, the linearized zero rest mass Einstein equations, and Maxwell’s wave equa- tion. For simplicity, in these notes we will denote by h the massless field equation of helicity h (so that 0 is the classical wave equation).
The starting point of twistor theory was Penrose’s discovery that the holomorphic solutions φ ∈ Γ(U ; ker h) of the homogeneous massless field equations of positive helicity, on topologically “good” open subsets U ⊂ IM,
Appeared in: Bull. Soc. Roy. Sci. Li`ege 63 (1994), no. 3-4, 323–334, Algebraic Analysis Meeting (Li`ege, 1993).
may be written as contour integrals φ =
Z
γ
ψ (1.1)
of free holomorphic functions ψ defined on corresponding open subsets bU of the projective three-space IP def= IP(TT). This is closely related to a similar integral formula that Bateman found in 1904 for the solutions of the wave equation. It was then realized that the freedom in the choice of ψ corresponds to the freedom in the choice of a representative for a ˇCech cohomology class of a line bundle on IP, and that the contour integral R
γ corresponds to the functor of direct image for sheaves.
In the classical paper [4], Eastwood, Penrose and Wells established the rig- orous mathematical setting for this integro-geometric construction in terms of sheaf cohomology. Let us explain in some details their construction.
Let IFdef= F12(TT) denote the flag manifold of type (1, 2) of TT. Recall that this is a compact complex manifold of dimension five, whose points are given by pairs (L1, L2) of linear subspaces L1 ⊂ L2 ⊂ TT of dimension one and two respectively. The so-called “twistor correspondence” is the double fibration
IF
g.&f IP IM,
(1.2)
where the projections are given by f (L1, L2) = L2, g(L1, L2) = L1. The cohomological interpretation of formula (1.1) is as follows: bU def= g(f−1(U )) ⊂ IP is the transform of U via (1.2); ψ is the pull back on IF of a representative of a ˇCech cohomology class in H1( bU ; L) for a line bundle L of IP; and the contour integral corresponds to the first derived direct image functor R1f∗. On the projective space IP the line bundles are classified by their first Chern class k. As usual, we denote by OIP(k) the −k-th tensor product H⊗−k of the tautological line bundle H of IP (recall that H is the sub-bundle of the trivial bundle IP × TT, whose fiber over the point L1 of IP is the line L1 itself).
The holomorphic solutions φ ∈ ker(U ; h) of the homogeneous equation for
h are thus obtained by integrating along f the pull back by g of sections in H1( bU ; OIP(k)). Here, the correspondence between the helicity h and the Chern class k is given by the formula:
h(k) = −(1 + k 2).
For instance, the solutions of the wave equation on U correspond to coho- mology classes in H1( bU ; OIP(−2)).
As for the notations, in the same way as bU ⊂ IP, let us denote by bx, cW and bz the transform of x ∈ IM, W ⊂ IP and z ∈ IP respectively, via (1.2).
The above discussion is summarized in the following theorem.
Theorem 1.1. ([4]) Let U ⊂ IM be an open subset such that:
U ∩bz is contractible for every z ∈ bU . (1.3) Then the natural morphism:
P : H1( bU ; OIP(k)) −→ ker(U ; h(k)) is an isomorphism.
(In [4], hypothesis (1.3) is actually weaker, since U ∩bz is only assumed connected and simply connected. Here, we restricted ourselves to the con- tractible case in order to simplify our discussion in the framework of derived categories.)
This result was the corner stone of Penrose’s “twistor programme” whose aim was to derive laws of physics from the rigidity of complex geometry.
An enormous literature on the subject is now available. From the point of view of the linear Penrose transform, the widest generalization of the above theorem is probably found in Baston and Eastwood’s book [1], where they treat the correspondence
G/Q .&
G/R G/P,
for G a complex semisimple Lie group, and P , R and Q = P ∩ R parabolic subgroups. In this framework, one gets isomorphisms between cohomology groups of homogeneous vector bundles on G/R and kernels and cokernels of invariant differential operators on G/P .
Our aim here is to present the new approach of [2] to the (linear) Penrose transform, in terms of (derived categories of) sheaves and D-modules.
The plan is the following.
In the next section, we will give a proof of Theorem 1.1 which, although simplified by the use of derived categories, is very close to the original proof of [4]. This will allow us to point out some problems that one encounters within the classical approach, thus providing a motivation for our work [2].
In section 3 we present our description of the Penrose transform, dis- cussing the extent to which it is possible to recover results such as Theo- rem 1.1 in this new framework.
We then describe the microlocal geometry underlying the Penrose trans- form. This will allow us to prove some interesting results which, in particular,
clarify the computational problems usually encountered. As an application, we show how to easily recover the work of Wells [14] concerning the hyper- function solutions of the massless field equations with positive helicity.
2 The case of positive helicity
Here, we will present a slight variation of the classical proof of Theorem 1.1 given in [4], for the case of positive helicity.
The twistor correspondence (1.2) satisfies the following properties:
(i) the maps f and g are smooth and proper,
(ii) the pair (g, f ) gives a closed embedding IF ,→ IP × TT.
In particular, for x ∈ IM the set f−1(x) is a closed submanifold of IF, and f sends isomorphically f−1(x) to the submanifold bx ' IP1 of IP. Similarly, for z ∈ IP the set bz ' g−1(z) ' IP2 is a closed submanifold of X. It follows that hypothesis (1.3) of Theorem 1.1 asserts that for every z ∈ bU the fibers g−1(z) are topologically trivial, and thus
H1( bU ; OIP(k)) ' H1(f−1(U ); g−1OIP(k)). (2.1) Moreover,
H1(f−1(U ); g−1OIP(k)) def= H1(RΓ(f−1(U ); g−1OIP(k)))
' H0(RΓ(U ; Rf∗g−1OIP(k)[1])), (2.2) and so, in order to prove Theorem 1.1 in the case of positive helicity (i.e. for k < −2), we are left to analyze the complex Rf∗g−1OIP(k)[1].
To the smooth map g is associated the relative de Rham sequence:
0 −→ g−1OIP −→ OIF dg
−→ Ω1g −→ Ωdg 2g −→ 0, (2.3) where dg denotes the relative exterior differential, and Ωig denotes the sheaf of relative i forms (i = 1, 2). Since the transition functions of the line bundle g∗OIP(k) are invariant along the fibers of g, we deduce from (2.3) the exact sequence:
0 −→ g−1OIP(k) −→ OIF(k)−→ Ωdg 1g(k)−→ Ωdg 2g(k) −→ 0,
where we wrote Ωig(k) = Ωig ⊗OIF g∗OIP(k). In terms of derived categories, this gives an identification:
g−1OIP(k) ' [OIF(k) −→ Ωdg 1g(k) −→ Ωdg 2g(k)]
| | |
0 1 2
which we write for convenience sake:
g−1OIP(k) ' [H0(k) −→ Hdg 1(k + 1)−→ Hdg 2(k + 2)]. (2.4) On a small neighborhood V of x ∈ U , the map f is identified to the first projection V × IP1 −→ V . Roughly speaking, for i = 0, 1, 2, the “twist”
(k + i) attached to Hi is there to recall that the vector bundle Hi(k + i) behaves like OIP1(k + i) when restricted to the fibers of f , f−1(x) ∼= IP1.
In order to compute Rf∗g−1OIP(k) using the isomorphism (2.4), recall that
Hj(IP1; OIP1(k)) =
6= 0 if j = 1, k ≤ −2 6= 0 if j = 0, k ≥ 0
= 0 otherwise.
In our case of positive helicity, we have k < −2, and so Rjf∗Hi(k + i) = 0 for j 6= 1. The sheaves eHi = R1f∗Hi(k +i) are coherent OIM-modules by Grauert theorem, and in fact they turn out to be locally free (i.e. holomorphic vector bundles). We thus get:
Rf∗g−1OIP(k) ' [ eH0
R1f∗dg
−→ He1
R1f∗dg
−→ He2].
| | |
1 2 3
(2.5)
An explicit computation as in [4] shows that R1f∗dg = h(k). Applying the functor H1(RΓ(U ; ·)) to the above sequence we obtain the isomorphism:
H0(RΓ(U ; Rf∗g−1OIP(k))[1]) ' ker[Γ(U ; eH0) −→ Γ(U; eh(k) H1)], which, together with (2.1), (2.2), gives the desired identification:
P : H1( bU ; OIP(k))−→ ker(U;∼ h(k))
This method does not apply, as simply as above, to the case of non positive helicity. For instance, in the case h = 0 (which corresponds to the classical wave equation), when applying the functor Rf∗ to (2.4), we notice that the middle term on the right hand side vanishes, while the third term is concentrated in degree zero. The computation of Rf∗g−1OIP(−2) is not as straightforward as it was for (2.5), since the associated spectral sequence degenerate only at the second order. Similar technical difficulties are encountered for h < 0, when trying to apply the above reasoning.
3 Sheaves and D-modules
We refer to [6] for an exposition of the theory of sheaves in the framework of derived categories: here we will just fix the notations.
Let X be a complex manifold of dimension dX, and denote by Db(X) the derived category of the category of bounded complexes of sheaves of C-vector spaces. Let f : Y −→ X be a morphism of complex manifolds, and set dY /X = dY − dX. We will consider the six classical functors in the derived category f−1, Rf!, ⊗, and Rf∗, f!, RHom. If A ⊂ X is a locally closed subset, we denote by CAthe constant sheaf on A. We denote by ωY /X
the relative dualizing complex defined by ωY /X = f!CX ' orY /X[d], where orY /X is the relative orientation sheaf on Y , and d = dimRY − dimRX.
The dual to F , denoted DX0 F , is defined by DX0 F = RHom(F, CX). Fi- nally, we denote by DbR−c(X) the full triangulated subcategory of Db(X) consisting of R-constructible objects (i.e. objects whose cohomology groups are R-constructible sheaves).
Here we will consider the correspondence:
Y
g.&f
Z X,
(3.1)
where X, Y and Z are complex manifolds, and where we make the following hypotheses:
(H.1) f and g are smooth and proper,
(H.2) (g, f ) : Y ,→ Z × X is a closed embedding.
Definition 3.1. Let F ∈ Ob(Db(X)) and H ∈ Ob(Db(Z)). We set:
PH def= Rf∗g−1(H), (3.2)
PFe def= Rg!(f−1(F ) ⊗ ωY /Z), (3.3) and call them “Penrose transform” and “adjoint Penrose transform” respec- tively.
The name “adjoint Penrose transform” is justified by the following propo- sition, which is a straightforward consequence of classical adjunction formulas of sheaf theory such as the Poincar´e-Verdier duality formula.
Proposition 3.2. (i) Let F ∈ Ob(Db(X)) and H ∈ Ob(Db(Z)). Then HomDb(Z)( ePF, H)−→ Hom∼ Db(X)(F, PH).
(ii) Let K ∈ Ob(DbR−c(X)), H ∈ Ob(Db(Z)), and assume K has compact support. Then we have a commutative diagram whose rows are isomor- phisms:
RaZ ∗(PK ⊗ H) −→∼ RaX ∗(K ⊗ PH)
↓ ↓
RaZ ∗RHom(DZ0 PK, H) −→ Ra∼ X ∗RHom(D0XK, PH)
Let us recall some notions and notations on D-modules for which we refer to Schneiders’s lectures [11].
Denote by OX the sheaf of holomorphic functions on X, by ΩX the sheaf of holomorphic forms of maximal degree, and by DX the sheaf of rings of linear differential operators with holomorphic coefficients. Denote by Db(DX) the derived category of the category of bounded complexes of left DX-modules.
We shall say that a coherent DX-module M is good, if for each compact subset K ⊂ X, there exists a coherent OX-module defined in a neighborhood of K, contained in M, and which generates M (see [8]). We denote by Dbgood(DX) the full triangulated subcategory of Db(DX) consisting of objects M such that Hj(M) is coherent and good for every j ∈ Z. If M is a coherent DX-module (or an object of Db(DX) whose cohomology groups are coherent) we denote by char(M) its characteristic variety, a closed complex analytic involutive set of the cotangent bundle T∗X. Let f : Y −→ X be a morphism of manifolds. Denote by f∗ and f−1 the direct and inverse images in the sense of D-modules.
Let us now consider the double fibration (3.1).
Definition 3.3. For N ∈ Ob(Dbgood(DZ)) we set PN def= f∗g−1N , and call it the D-module Penrose transform of N .
As a corollary of the Cauchy-Kowalevski-Kashiwara theorem applied to g, and the direct image theorem applied to f (cf. e.g. [11]), we get the following result.
Proposition 3.4. Let N ∈ Ob(Dgoodb (DZ)). Then the canonical morphism P RHomD
Z(N , OZ) −→ RHomD
X(PN , OX)[−dY /X] is an isomorphism.
Let us set, as usual:
RHomD
Z(·, ·) = RΓ(Z; RHomD
Z(·, ·)).
Combining Propositions 3.2 and 3.4 we obtain the following result.
Theorem 3.5. (i) Let N ∈ Ob(Dbgood(DZ)), and let F ∈ Ob(Db(X)).
Then there is a canonical isomorphism RHomD
Z(N ⊗ ePF, OZ)−→ RHom∼ D
X(PN ⊗ F, OX)[−dY /X].
(i) Let N ∈ Ob(Dgoodb (DZ)), and let K ∈ Ob(DR−cb (X)) have compact sup- port. Then there is a canonical diagram whose rows are isomorphisms
RHomDZ(N , PK ⊗ OZ) −→∼ RHomDX(PN , K ⊗ OX)[−dY /X]
↓ ↓
RHomD
Z(N ⊗ DZ0 PK, OZ) −→ RHom∼ DX(PN ⊗ D0XK,OX)[−dY /X].
For x ∈ X, z ∈ Z and U ⊂ X, let bx = gf−1(x), bz = fg−1(z), bU = gf−1(U ).
Corollary 3.6. Let N ∈ Ob(Dbgood(DZ)). Then (i) [Germ formula] For x ∈ X one has
RΓ(bx; RHomDZ(N , OZ)) ' RHomD
X(PN , OX)x[−dY /X];
(ii) [Holomorphic solutions] Let U ⊂ X be an open subset such that U ∩bz is contractible for every z ∈ bU . (3.4) Then
RΓ( bU ; RHomD
Z(N , OZ)) ' RΓ(U ; RHomD
X(PN , OX))[−dY /X].
Proof. It is easy to check that PCx ' Cxb for x ∈ X. On the other hand, hypothesis (H.2) gives an identification U ∩bz ' f−1(U ) ∩ g−1(z), and hence (3.4) implies that the fibers of g over f−1(U ) are topologically trivial. One then has ePCU ' CUb.
Let L be a holomorphic vector bundle on Z, L∗ = HomO
Z(L, OZ) its dual, and consider the locally free DZ-module:
DL∗ = DZ⊗OZL∗.
One recovers L as the sheaf of holomorphic solutions to DL∗: L ' RHomD
Z(DL∗, OZ).
In the framework of correspondence (1.2), applying Corollary 3.6 with L = OIP(k), we get the isomorphisms:
RΓ(bx; OIP(k)) ' RHomDX(PDIP(−k), OX)x[1],
RΓ( bU ; OIP(k)) ' RΓ(U ; RHomDX(PDIP(−k), OX))[1],
where we set DIP(−k) = DOIP(−k). It is then possible to give the following interpretation to Theorem 1.1: the Penrose transform of the locally free D- module DIP(−k) is the D-module associated to the operator h.
4 Microlocal geometry
To the double fibration (3.1) are associated the correspondences of conormal bundles
T∗Y ←− Y ×tf0
X T∗X −→ Tfπ ∗X, T∗Y ←− Y ×tg0
Z T∗Z −→ Tgπ ∗Z.
Since the maps f and g are smooth, tf0 and tg0 are closed embeddings. Let Vf
def= tf0(Y ×
X ×T∗X), Vg
def= tg0(Y ×
Z×T∗Z).
It is then easy to check that the correspondence Vf ∩ Vg
gπ.&fπ T∗Z T∗X is identified to the correspondence:
TY∗(Z × X)
pa1.&p2 T∗Z T∗X,
(4.1)
where TY∗(Z × X) denotes the cotangent bundle to Y ,→ Z × X, where p1, p2
are the first and second projection, and where pa1 = −p1. Since TY∗(Z × X) is Lagrangian, it is well known that p1 is submersive if and only if pa2 is immersive. In the following we will assume:
(H.3) the map p2 in (4.1) is a closed embedding (and so pa1 is submersive).
Let
V def= fπ(Vf ∩ Vg).
We will call V the characteristic set of the transform (3.1).
Theorem 4.1. Assume (H.1)–(H.3), and assume that X is connected. Then:
(i) Hj(PDL) is a holomorphic flat connection for j 6= 0,
(ii) char(H0(PDL)) = V , and H0(PDL) has simple characteristics along V ,
(iii) PDL is concentrated in degree zero if and only if there exists x ∈ X such that Hj(bx; L) = 0 for every j 6= dY − dX.
Proof. Statements (i) and (ii) follow immediately if one notices that P “com- mutes to microlocalization”. By this we mean the following. Let EZ be the sheaf of finite order microdifferential operators on T∗X (see [9], [5], or [10] for a detailed exposition). If M is a coherent DX-module, set EN = EZ ⊗˙π−1DZ ˙π−1N , where ˙π : ˙T∗Z −→ Z is the cotangent bundle to Z with the zero-section removed. Then one proves that
EPN = P(EN ).
The conclusion follows if one recalls that a DX-module whose characteristic variety is contained in the zero section is a locally free OX-module of finite rank endowed with a flat connection (cf. e.g. [11]).
The statement (iii) is easily proved using (i) and the germ formula.
In order to apply the last theorem to the twistor correspondence (1.2), recall that bx is identified with a projective one-space IP1 linearly embedded in IP. It is then a straightforward computation to check that for x ∈ X one has
H0(bx; OZ(k)) =
0 for k < 0,
6= 0 and finite dimensional for k ≥ 0, H1(bx; OZ(k)) is infinite dimensional for any k, Hj(bx; OZ(k)) = 0 for j 6= 0, 1 and for any k.
Corollary 4.2. The complex PDZ(−k) is concentrated in degree zero if and only if k < 0.
This explains why many authors just treat the case of positive helicity
“for simplicity’s sake”.
5 Hyperfunction solutions
Let φ be a Hermitian form on TT of signature (+, +, −, −) and choose a basis for TT such that:
φ =
0 iI2
−iI2 0
. Consider the local chart:
C4 −→ IM
(x1, x2, x3, x4) 7→
x3− x4 x1+ ix2
x1− ix2 x3+ x4
1 0
0 1
and notice that
(A∗, I2) φ
A I2
= 0
if and only if A is Hermitian, i.e. if and only if it is the image of a point (x1, x2, x3, x4) in R4 ⊂ C4 via the above local chart. The manifold
M = {L2 ∈ IM; φ(v) = 0 ∀v ∈ L2}
is thus a compact, completely real submanifold of IM, which is identified to a conformal compactification of the real Minkowski space. We similarly define
F = {(L1, L2) ∈ IF; φ(v) = 0 ∀v ∈ L2}, P = {L1 ∈ IP; φ(v) = 0 ∀v ∈ L1},
and consider the embedding of diagrams F
e g.&fe
P M
,→
IF
g.&f
IP IM
Recall after [14], that P is a real hypersurface of IP, and that eg : F −→ P is a sphere bundle (i.e. locally isomorphic to P × S1 −→ P ).
We denote by
AM = CM ⊗ OIM,
BM = RHom(DIM0 CM, OIM),
the sheaves of analytic functions and Sato hyperfunctions on M respectively.
Theorem 3.5 applied with K = CM, N = DIP(−k), gives:
RΓ(IP; PCM ⊗ OIP(k)) −→ RHom∼ D
IM(PDIP(−k), AM)[−1]
↓ ↓
RHom(D0PCM, OIP(k)) −→ RHom∼ D
IM(PDIP(−k), BM)[−1].
(5.1)
Since eg is locally isomorphic to P × S1 −→ P , and P is simply connected, we find:
HjPCM ∼=
CP, for j = 0, 1, 0, otherwise.
Under the hypothesis k < 0, taking the zero-th cohomology in (5.1) we get the following diagram in which the horizontal arrows are isomorphisms
H1(P ; OIP(k)) −→ Hom∼ DIM(PDIP(−k), AM)
↓ ↓
HP2(IP; OIP(k)) −→ Hom∼ DIM(PDIP(−k), BM).
This is Theorem 6.1 of [14].
6 Acknowledgments
I wish to thank Jean-Pierre Schneiders for the fruitful discussions we had together during the preparation of this work.
References
[1] R. J. Baston, M. G. Eastwood, The Penrose transform: its interaction with representation theory. Oxford Univ. Press (1989)
[2] A. D’Agnolo, P. Schapira, I: Correspondences for D-modules and Pen- rose transform. To appear; II: Correspondance de D-modules et trans- formation de Penrose.Expos´e XII, S´em. EDP Ec. Polytechnique (1993) [3] M. G. Eastwood, The generalized Penrose-Ward transform. Math. Proc.
Camb. Phil. Soc. 97 (1985)
[4] M. G. Eastwood, R. Penrose, R. O. Wells Jr., Cohomology and massless fields. Comm. Math. Phys. 78 (1981)
[5] M. Kashiwara, B-functions and holonomic systems. Invent. Math. 38 (1976)
[6] M. Kashiwara, P. Schapira, Sheaves on manifolds. Springer Berlin Hei- delberg New York 292 (1990)
[7] Yu. I. Manin, Gauge field theory and complex geometry. Springer Berlin Heidelberg New York 289 (1988)
[8] M. Saito, Induced D-modules and differential complexes. Bull. Soc.
math. France 117 (1989)
[9] M. Sato, T. Kawai, M. Kashiwara, Hyperfunctions and pseudo-differen- tial equations. In Hyperfunctions and pseudo-differential equations. H.
Komatsu ed., Proceedings Katata 1971. Lect. Notes in Math. Springer Berlin Heidelberg New York 287 (1973)
[10] P. Schapira, Microdifferential systems in the complex domain. Springer Berlin Heidelberg New York 269 (1985)
[11] J.-P. Schneiders, Introduction to D-modules. In these proceedings.
[12] R. S. Ward, R. O. Wells Jr., Twistor geometry and field theory Cam- bridge monographs on mathematical physics (1990)
[13] R. O. Wells Jr., Complex manifolds and mathematical physics. Bulletin of the A.M.S. 1, 2 (1979)
[14] R. O. Wells Jr., Hyperfunction solutions of the zero-rest-mass field equa- tions Commun. Math. Phys. 78 (1981)