return

Celebratio Mathematica

Barry C. Mazur

Deforming Galois Representations

by Carl Wang-Erickson

Barry Mazur’s art­icle “De­form­ing Galois rep­res­ent­a­tions[1] is re­garded today as open­ing up a new dir­ec­tion in al­geb­ra­ic num­ber the­ory. Ap­pear­ing in print in 1989 fol­low­ing a con­fer­ence talk at the Math­em­at­ic­al Sci­ences Re­search In­sti­tute (MSRI) in Berke­ley in March 1987, this art­icle in­tro­duced mod­uli the­ory of Galois rep­res­ent­a­tions in­to the con­scious­ness of num­ber the­or­ists. By the time when this au­thor began as a PhD stu­dent about twenty years later, Mazur’s art­icle was pro­moted to my gen­er­a­tion of stu­dents as the prop­er start­ing point to grasp one of the strands of math­em­at­ic­al de­vel­op­ments needed to ap­pre­ci­ate the “\( R = \mathbb{T} \) the­or­ems” that were un­der­stood as the pin­nacle of \( p \)-ad­ic tech­niques to es­tab­lish arith­met­ic Lang­lands cor­res­pond­ences. To the novice, it was at least easy to ap­pre­ci­ate that it was im­port­ant to un­der­stand both “\( R \)” and “\( \mathbb{T} \)” to start to grok “\( R = \mathbb{T} \),” and we un­der­stood that the start of the story of \( R \) was Mazur’s pa­per. In­deed, “\( R \)” stands for a uni­ver­sal de­form­a­tion ring for \( p \)-ad­ic Galois rep­res­ent­a­tions sat­is­fy­ing some con­di­tion of in­terest, as Mazur first for­mu­lated it. Then ho­mo­morph­isms \( R \to \overline{\mathbb{Q}}_p \) are in bijec­tion with the Galois rep­res­ent­a­tions of in­terest. And \( \mathbb{T} \) is the im­age ring of an ac­tion of Hecke op­er­at­ors on some nat­ur­al Hecke mod­ule, so that ho­mo­morph­isms \( \mathbb{T} \to \overline{\mathbb{Q}}_p \) are in bijec­tion with Hecke ei­gensys­tems. Thus “\( R = \mathbb{T} \)” not only sup­plies a de­sired bijec­tion between \( p \)-ad­ic Galois rep­res­ent­a­tions and Hecke ei­gensys­tems, but es­tab­lishes this bijec­tion along with a com­mon \( p \)-ad­ic in­ter­pol­a­tion.

That said, the in­flu­ence of this pa­per seems to not have been ini­tially pre­dicted. Some at­tendees of the MSRI con­fer­ence felt that Mazur had gone down a rab­bit hole, won­der­ing what this work could pos­sibly be good for. The an­swer emerged about six years later in An­drew Wiles’s lec­tures and pre­print: you could prove mod­u­lar­ity of el­lipt­ic curves us­ing \( R \)!

As Wiles re­marks in his Fer­mat pa­per [e11], by the time of the MSRI con­fer­ence in March 1987,

Hida had con­struc­ted some ex­pli­cit one-para­met­er fam­il­ies of Galois rep­res­ent­a­tions. In an at­tempt to un­der­stand this, Mazur had been de­vel­op­ing the lan­guage of de­form­a­tions of Galois rep­res­ent­a­tions. Moreover, Mazur real­ized that the uni­ver­sal de­form­a­tion rings he found should be giv­en by Hecke rings, at least in cer­tain spe­cial cases ([e11], p. 450).

Wiles points out here that the first in­stances of “\( R = \mathbb{T} \)” ap­peared in Mazur’s art­icle, ar­tic­u­lat­ing a vis­ion to es­tab­lish mod­u­lar­ity of Galois rep­res­ent­a­tions that Wiles’s Fer­mat pa­per brought to fruition. Not only that, but Mazur’s art­icle, at once, in­tro­duced the de­form­a­tion the­ory of Galois rep­res­ent­a­tions thor­oughly enough that it can still be re­garded as an ex­cel­lent ref­er­ence today; char­ac­ter­ized it us­ing ho­mo­lo­gic­al tools; and showed how to im­pose loc­al con­di­tions cor­res­pond­ing to vari­ous fla­vors of mod­u­lar forms. These top­ics of Mazur’s art­icle, and a small sample of dir­ec­tions of in­quiry re­flect­ing the in­flu­ence of his art­icle, are what we will dis­cuss in what fol­lows.

1.  The motivation for and formulation of R

Haruzo Hida’s art­icle Galois rep­res­ent­a­tions in­to \( \operatorname{GL}_2(\mathbb{Z}_p[\![ X ]\!]) \) at­tached to or­din­ary cusp forms ap­peared in print in 1986 [e3]. It was the primary in­spir­a­tion for Mazur’s art­icle. By this time, 2-di­men­sion­al \( p \)-ad­ic Galois rep­res­ent­a­tions as­so­ci­ated to single cusp forms were well-un­der­stood. Let us be­gin by re­view­ing them on our way to in­tro­du­cing Hida’s con­struc­tion of “big” Galois rep­res­ent­a­tions.

1.1.  The p-adic Galois representations associated to cuspidal Hecke eigenforms

Let \( G_{\mathbb Q} = \operatorname{Gal}(\overline{\mathbb{Q}}/{\mathbb Q}) \) and choose Frobeni­us ele­ments \( \operatorname{Fr}_\ell \in G_{\mathbb Q} \) for all prime num­bers \( \ell \). Let \( f \) de­note a nor­mal­ized cuspid­al Hecke ei­gen­form \( f \) of weight \( k \in \mathbb{Z}_{\geq 1} \) that is new of level \( N \in \mathbb{Z}_{\geq 1} \). Rep­res­ent­ing its Four­i­er ex­pan­sion as \[ f(z) = \sum_{\geq 1} a_n(f) q^n \qquad\text{where }q = e^{2 \pi i z} ,\]

in fact the \( a_n(f) \) are al­geb­ra­ic and gen­er­ate a num­ber field, the Hecke field of \( f \), de­noted \[ {\mathbb Q}(f) = {\mathbb Q}(a_n(f)_{n \geq 1}).\]

Giv­en a choice of \( p \)-ad­ic norm on \( {\mathbb Q}(f) \), thought of as an em­bed­ding \( \sigma_v : {\mathbb Q}(f) \hookrightarrow \overline{\mathbb{Q}}_p \), there ex­ists a unique (up to iso­morph­ism) ir­re­du­cible con­tinu­ous rep­res­ent­a­tion \[ \rho_{f,v} : G_{\mathbb Q} \to \operatorname{GL}_2(\overline{\mathbb{Q}}_p) \]

char­ac­ter­ized by \[ \operatorname{Tr} \rho_{f,v}(\operatorname{Fr}_\ell) = \sigma_v(a_\ell(f)) \quad \text{ for all primes }\ell \nmid Np. \]

This \( \rho_{f,v} \) was con­struc­ted by Shimura (when \( k=2 \)), De­ligne (when \( k > 2 \)), and De­ligne–Serre (when \( k=1 \)). Then \( \rho_{f,v} \) fur­ther has prop­er­ties, as fol­lows.

  1. \( \det \rho_{f,v}(c) = -1 \) for any choice of com­plex con­jug­a­tion ele­ment \( c \in G_{\mathbb Q} \). One calls such 2-di­men­sion­al rep­res­ent­a­tions of \( G_{\mathbb Q} \) odd.
  2. \( \det \rho_{f,v} = \chi_{f,v} \cdot \kappa^{k-1} \), where \( \kappa : G_{\mathbb Q} \to \mathbb{Z}_p^\times \) de­notes the \( p \)-ad­ic cyc­lo­tom­ic char­ac­ter and \[ \chi_f : (\mathbb{Z}/N\mathbb{Z})^\times \to \overline{\mathbb{Q}}^\times \]

    de­notes the neben­char­ac­ter of \( f \); we also think of \( \chi_f \) as a char­ac­ter of \( G_{\mathbb Q} \) us­ing class field the­ory, and write \( \chi_{f,v} := \sigma_v \circ \chi_f \).

  3. \( \rho_{f,v} \) is rami­fied only at places di­vid­ing \( Np \infty \), which is why \[ \operatorname{Tr} \rho_{f,v}(\operatorname{Fr}_\ell) = \sigma_v(a_\ell(f)) \]

    is well-defined.

  4. The re­stric­tion of \( \rho_{f,v} \) to a de­com­pos­i­tion group at \( p \), \[ G_p = \operatorname{Gal}(\overline{\mathbb{Q}}_p/{\mathbb{Q}}_p) \subset G_{\mathbb Q} ,\]

    is de Rham and has Hodge–Tate weights \( [0,k-1] \); it is also crys­tal­line if \( p \nmid N \).

  5. At primes \( q \) di­vid­ing \( N \), the re­stric­tion of \( \rho_{f,v} \) to the de­com­pos­i­tion group \( G_q \) at \( q \) is rami­fied, and its re­stric­tion to an in­er­tia sub­group \( I_q \subset G_q \) is re­stric­ted ac­cord­ing to the \( q \)-typ­ic­al part of the level group for which \( f \) is mod­u­lar.

It is con­jec­tured that all Galois rep­res­ent­a­tions arising “in nature” (that is, from arith­met­ic geo­metry) also come from mod­u­lar forms and their gen­er­al­iz­a­tions. A clas­sic ex­ample of a Galois rep­res­ent­a­tion arising from arith­met­ic geo­metry is fur­nished by the \( p \)-ad­ic Tate mod­ule of an el­lipt­ic curve \( E \) defined over \( q \), \[ T_p E = \varprojlim_n E[p^n] = \Bigl[\dotsm \buildrel{\cdot p}\over\to E[p^{n+1}] \buildrel{\cdot p}\over\to E[p^n] \buildrel{\cdot p}\over\to \dotsm \buildrel{\cdot p}\over\to E[p^2] \buildrel{\cdot p}\over\to E[p] \Bigr]. \]

Be­cause \( E(\mathbb{C}) \) is a com­plex tor­us, mak­ing \[ E(\mathbb{C})[p^n] \simeq \mathbb{Z}/p^n \mathbb{Z} \times \mathbb{Z}/p^n \mathbb{Z} \]

as groups, one finds that \( T_p E \simeq \mathbb{Z}_p \times \mathbb{Z}_p \). The Galois ac­tion on the al­geb­ra­ic points of \( E[p^n] \) is com­pat­ible with the lim­it, giv­ing a con­tinu­ous ac­tion of \( G_{\mathbb Q} \) on \( T_p E \), or with a choice of basis, \[ \rho_{E,p} : G_{\mathbb Q} \to \operatorname{GL}_2(\mathbb{Z}_p). \]

This has the prop­erty that for all primes \( \ell \) at which \( E \) has good re­duc­tion, \[ \operatorname{Tr} \rho_E(\operatorname{Fr}_\ell) = \ell + 1 - \# E(\mathbb{F}_\ell). \]

Moreover, \( \rho_E \otimes_{\mathbb{Z}_p} \overline{\mathbb{Q}}_p \) is ir­re­du­cible and odd, and its re­stric­tion to \( G_p \) has Hodge–Tate weights \( [0,1] \). Then the mod­u­lar­ity con­jec­ture for el­lipt­ic curves can be ex­pressed as the ex­ist­ence of a cuspid­al Hecke ei­gen­form \( f = f_E \) with Hecke field \( {\mathbb Q}(f) = {\mathbb Q} \) such that \( \rho_{E,p} \simeq \rho_{f,p} \), or, equi­val­ently, such that \( a_\ell(f) = \ell + 1 - \# E(\mathbb{F}_\ell) \) for all but fi­nitely many primes \( \ell \).

1.2.  Hida’s interpolation of ordinary cusp forms and their Galois representations

Hida es­tab­lished [e4] 1-para­met­er \( p \)-ad­ic in­ter­pol­a­tions of cuspid­al Hecke ei­gen­forms that are or­din­ary, in a sense we now ex­plain.

Fix an in­teger \( N \in \mathbb{Z}_{\geq 1} \). We call this \( N \) a tame level be­cause we as­sume \( p \nmid N \) and con­sider mod­u­lar forms of level \( \Gamma_1(Np^r) \) for \( r \in \mathbb{Z}_{\geq 1} \); we also may let \( r=0 \) and let \( \Gamma_1(Np^0) := \Gamma_1(N) \cap \Gamma_0(p) \). Let \( U_p \) refer to the usu­al Hecke op­er­at­or at \( p \) at these levels. We con­sider pairs \( (f,v) \) as above — that is, \( f \) is a nor­mal­ized cuspid­al Hecke ei­gen­form, now of level \( \Gamma_1(Np^r) \) for \( r \in \mathbb{Z}_{\geq 0} \), and \( v \) is a place over \( p \) of \( {\mathbb Q}(f) \). Such an \( (f,v) \) is called \( p \)-or­din­ary or simply or­din­ary provided that \( a_p(f) \) is a \( v \)-unit, or, equi­val­ently, that the \( v \)-ad­ic valu­ation of the \( U_p \)-ei­gen­value of \( f \) is 0.

Let­ting \[ S_k(p^r) := S_k(\Gamma_1(Np^r)) \otimes_\mathbb{Z} \mathbb{Z}_p ,\]

the mod­ule of mod­u­lar forms of weight \( k \) and level \( \Gamma_1(Np^r) \) over \( \mathbb{Z}_p \), there is a ca­non­ic­al dir­ect sum­mand \( S_k^{\operatorname{ord}}(p^r) \subset S_k(p^r) \) on which \( U_p \) acts by \( p \)-units. Sim­il­arly, de­not­ing by \( \mathbb{T}_k(p^r) \) the Hecke al­gebra over \( \mathbb{Z}_p \) gen­er­ated by Hecke en­do­morph­isms with­in \( \operatorname{End}_{\mathbb{Z}_p}(S_k(p^r)) \), there is a ca­non­ic­al quo­tient ring \( \mathbb{T}_k^{\operatorname{ord}}(p^r) \) of \( \mathbb{T}_k(p^r) \) on which \( U_p \) acts by a \( p \)-unit (via its reg­u­lar ac­tion). While ho­mo­morph­isms \( \mathbb{T}_k(p^r) \to \overline{\mathbb{Q}}_p \) are in bijec­tion with pairs \( (f,v) \) of level \( \Gamma_1(Np^r) \), it is ex­actly those \( (f,v) \) that are or­din­ary that factor through \( \mathbb{T}_k(p^r) \twoheadrightarrow \mathbb{T}_k^{\operatorname{ord}}(p^r) \).

Hida es­tab­lished the fol­low­ing \( p \)-in­teg­ral \( p \)-ad­ic in­ter­pol­a­tion of or­din­ary \( (f,v) \). For sim­pli­city let­ting \( p \) be odd, let \begin{equation} \label{eq: Lambda decomp} \Lambda := \mathbb{Z}_p[\![ \mathbb{Z}_p^\times ]\!] \simeq \prod_{\nu : \mathbb{F}_p^\times \to \mathbb{Z}_p^\times} \mathbb{Z}_p[\![ X]\!], \tag{1.2.1} \end{equation}

the Iwas­awa al­gebra of \( \mathbb{Z}_p^\times \). The iso­morph­ism at the right comes from the de­com­pos­i­tion \[\mathbb{Z}_p^\times \simeq \mathbb{F}_p^\times \times (1 + p\mathbb{Z}_p), \]

and sends the to­po­lo­gic­al gen­er­at­or \( [1+p] \in 1 + p\mathbb{Z}_p \) to \( 1+X \). The spec­trum of \( \Lambda \) may be thought of as a weight space, where \( k \in \mathbb{Z} \) cor­res­ponds to a ho­mo­morph­ism \[ \varphi_k : \Lambda \to \mathbb{Z}_p, \quad \mathbb{Z}_p^\times \ni [z] \mapsto z^{k-1}, \]

and the ad­di­tion­al data of (the \( p \)-typ­ic­al part of a) prim­it­ive neben­char­ac­ter \[ \chi_v : (\mathbb{Z}/Np^r\mathbb{Z})^\times \to \overline{\mathbb{Q}}_p^\times \]

of mod­u­lus \( Np^r \) gives rise to a ho­mo­morph­ism \[ \varphi_{k,\chi} : \Lambda \to \mathbb{Z}_p[\zeta_{p^{r-1}}] .\]

All of the Hecke al­geb­ras above are nat­ur­ally \( \Lambda \)-al­geb­ras along the map \[ \Lambda \to \mathbb{T}_k(p^r), \qquad \mathbb{Z}_p^\times \ni [z] \mapsto p^{k-1}\cdot \langle z\rangle_{p^r}, \]

where \( \langle z\rangle_{p^r} \) refers to the level \( p^r \) dia­mond Hecke op­er­at­or eval­u­ated at \( z \pmod{p^r} \).

The­or­em 1.2.2 (Hida [e4]): There ex­ists a fi­nite flat \( \Lambda \)-al­gebra \( \mathbb{T}_N^{\operatorname{ord}} \) gen­er­ated by Hecke op­er­at­ors such that for all weights \( k \geq 2 \) and char­ac­ters \( \chi_v \) that are prim­it­ive of mod­u­lus \( p^r \), there is a Hecke-com­pat­ible iso­morph­ism \[ \mathbb{T}_N^{\operatorname{ord}} \otimes_{\Lambda, \varphi_{k,\chi_v}} \mathbb{Z}_p[\zeta_{p^{r-1}}] \stackrel{\sim}{\rightarrow} \mathbb{T}_k^{\operatorname{ord}}(p^r)_{\chi_v}, \]

\( \mathbb{T}_k^{\operatorname{ord}}(p^r)_{\chi_v} \) de­not­ing the pushout along \[ \chi_v : \mathbb{Z}_p[(\mathbb{Z}/p^r\mathbb{Z})^\times] \to \mathbb{Z}_p[\zeta_{p^{r-1}}] \]

of \( \mathbb{T}_k^{\operatorname{ord}}(p^r) \).

One thinks of the \( \mathbb{T}_N^{\operatorname{ord}} \) as a uni­ver­sal 1-di­men­sion­al fam­ily of Hecke ei­gensys­tems be­cause it is fi­nite and flat over \( \Lambda \), and be­cause \( \Lambda \) has one para­met­er in the sense of \eqref{eq: Lambda decomp}. On many loc­al com­pon­ent rings of \( \mathbb{T}_N^{\operatorname{ord}} \) (which is semi­loc­al), the rank re­l­at­ive to \( \Lambda \) is 1, mean­ing that \( \mathbb{T}_N^{\operatorname{ord}} \) has a loc­al factor that is iso­morph­ic to \( \mathbb{Z}_p[\![ X]\!] \) com­pat­ibly with the weight map \( \Lambda \to \mathbb{T}_N^{\operatorname{ord}} \) and one of the factors of the de­com­pos­i­tion \eqref{eq: Lambda decomp}. For nota­tion­al con­veni­ence, let us write \( \bar f \) for a con­gru­ence class of or­din­ary \( (f,v) \) of tame level \( N \), which biject­ively la­bel the max­im­al ideals of \( \mathbb{T}_N^{\operatorname{ord}} \). On such loc­al com­pon­ents, which we write as \( \mathbb{T}_{N,\bar f}^{\operatorname{ord}} \), there is a smooth 1-para­met­er fam­ily where the para­met­er is the weight, and one can think of this fam­ily as para­met­er­iz­ing an or­din­ary cusp form \( (f_k,v_k) \in S_k^{\operatorname{ord}}(p^0) \) of weight \( k \geq 2 \) by spe­cial­iz­ing along each weight map \( \varphi_k : \Lambda \to \mathbb{Z}_p \) that factors through \( \Lambda \twoheadrightarrow \mathbb{Z}_p[\![ X]\!] \).

Hida fur­ther found that Galois rep­res­ent­a­tions in­ter­pol­ate along with the Hecke ei­gensys­tems. For sim­pli­city, we state the case of rank 1 re­l­at­ive to \( \Lambda \), where we have the or­din­ary cusp forms \( (f_k,v_k) \) para­met­er­ized by the weight \( k \).

The­or­em 1.2.3 (Hida [e3]): For any quo­tient \( \mathbb{T}_N^{\operatorname{ord}} \twoheadrightarrow \mathbb{Z}_p[\![ X]\!] \) such that the com­pos­ite map \( \Lambda \to \mathbb{Z}_p[\![ X]\!] \) has loc­al rank 1, there is a Galois rep­res­ent­a­tion \( \rho : G_{\mathbb Q} \to \operatorname{GL}_2(\mathbb{Z}_p[\![ X]\!]) \) that spe­cial­izes in weight \( k \) to the Galois rep­res­ent­a­tion \( \rho_{f_k,v_k} \). That is, there is an iso­morph­ism of 2-di­men­sion­al ir­re­du­cible rep­res­ent­a­tions \[ \bigl(\rho \otimes_{\Lambda, \varphi_k} \mathbb{Z}_p \bigr) \otimes_{\mathbb{Z}_p} \overline{\mathbb{Q}}_p \simeq \rho_{f_k,v_k}. \]

In par­tic­u­lar, the rep­res­ent­a­tion \[ \rho \otimes_{\mathbb{Z}_p[\![ X]\!]} \overline{\mathrm{Frac}(\mathbb{Z}_p[\![ X]\!])} \]

is ir­re­du­cible and \( \operatorname{Tr} \rho(\operatorname{Fr}_\ell) \) is equal to the im­age of the Hecke op­er­at­or \( T_\ell \) in \( \mathbb{Z}_p[\![ X]\!] \), for all primes \( \ell \nmid Np \). Un­der a mild as­sump­tion, there is an iso­morph­ism \begin{equation} \label{eq: ord shape} \rho\vert_{G_p} \simeq \biggl( \begin{matrix} \ast \,&\,\ast \\ {0}&{\nu(U_p)} \end{matrix} \biggr) : G_p \to \operatorname{GL}_2\bigl(\,\overline{\mathrm{Frac}(\mathbb{Z}_p[\![ X]\!])}\,\bigr). \tag{1.2.4} \end{equation}

1.3.  The motivating question

How should one think about these “big” Galois rep­res­ent­a­tions con­struc­ted by Hida, in­ter­pol­at­ing all of the \( \rho_{f,v} \) cor­res­pond­ing to or­din­ary \( (f,v) \) in the con­gru­ence class \( \bar f \)? The fol­low­ing ques­tion ex­presses some mo­tiv­a­tion for Mazur’s in­vest­ig­a­tion.

Ques­tion 1.3.1: Ima­gine the largest pos­sible Galois rep­res­ent­a­tion \( \rho^{\operatorname{ord}} \) that could pos­sibly arise from or­din­ary ei­gen­forms in the con­gru­ence class \( \bar f \) and write \( R^{\operatorname{ord}} \) for its coef­fi­cient ring; that is, we have \( \rho^{\operatorname{ord}} : G_{\mathbb Q} \to \operatorname{GL}_2(R^{\operatorname{ord}}) \). In cases where \[ \mathbb{T}_N^{\operatorname{ord}} \twoheadrightarrow \mathbb{T}_{N,\bar f}^{\operatorname{ord}} \stackrel{\sim}{\rightarrow} \mathbb{Z}_p[\![ X]\!] \]

is a pro­jec­tion onto a loc­al factor of \( \mathbb{T}_N^{\operatorname{ord}} \) as in The­or­em 1.2.3 is this \( \rho^{\operatorname{ord}} \) identic­al to \( \rho \)?

As a first step to trans­late the mo­tiv­at­ing ques­tion in­to a pre­cise ques­tion, we ad­dress the re­sid­ual case. In­deed, let­ting \( \mathbb{F} \) de­note the residue field of \( \smash{\mathbb{T}_{N,\bar f}^{\operatorname{ord}}} \), the con­gru­ence class \( \bar f \) can be thought of as the \( \mathbb{F} \)-val­ued Hecke ei­gensys­tem arising via the residue map \( \mathbb{T}_{N,\bar f}^{\operatorname{ord}} \twoheadrightarrow \mathbb{F} \). There ex­ists a unique semisimple Galois rep­res­ent­a­tion \[ \bar\rho = \rho_{\bar f} : G_{\mathbb Q} \to \operatorname{GL}_2(\mathbb{F}) \]

char­ac­ter­ized in the same way as in The­or­em 1.2.3: \[ \operatorname{Tr} \bar\rho(\operatorname{Fr}_\ell) = T_\ell, \qquad \text{for all primes } \ell \nmid Np, \]

where we think of \( T_\ell \in \mathbb{F} \) via the residue map. We call this the re­sid­ual (Galois) rep­res­ent­a­tion as­so­ci­ated to \( \bar f \).

Next, hav­ing fixed this \( \bar\rho \), we may un­der­stand the pos­sible Galois rep­res­ent­a­tions that can arise from the or­din­ary \( (f,v) \) in the con­gru­ence class \( \bar f \) by fol­low­ing Mazur in the open­ing lines of his art­icle [1]:

Giv­en a con­tinu­ous ho­mo­morph­ism \[ G_{{\mathbb Q},S} \stackrel{\bar{\rho}}{\longrightarrow} \operatorname{GL}_2(\mathbb{F}_p), \]

… the mo­tiv­at­ing prob­lem of this pa­per is to study, in a sys­tem­at­ic way, the pos­sible lift­ings of \( \bar\rho \) to \( p \)-ad­ic rep­res­ent­a­tions \[ G_{{\mathbb Q},S} \stackrel{\bar{\rho}}{\longrightarrow} \operatorname{GL}_2(\mathbb{Z}_p). \]

This no­tion of lift can be gen­er­al­ized to any com­plete loc­al \( \mathbb{Z}_p \)-al­gebra \( (A,\mathfrak{m}_A) \) with residue field \( \mathbb{F}_p \). A lift of \( \bar\rho \) to \( A \) is a con­tinu­ous ho­mo­morph­ism \( \rho_A : G_{{\mathbb Q},S} \to \operatorname{GL}_2(A) \) such that there is an equal­ity of ho­mo­morph­isms \( \rho_A \pmod{\mathfrak{m}_A} = \bar\rho \) from \( G_{{\mathbb Q},S} \) to \( \operatorname{GL}_2(\mathbb{F}_p) \). In par­tic­u­lar, the pu­tat­ive “largest pos­sible” \( \rho^{\operatorname{ord}} \) re­lated to the con­gru­ence class \( \bar f \), as in Ques­tion 1.3.1, should be real­iz­able as a lift of \( \bar\rho \) to a loc­al ring \( R^{\operatorname{ord}} = R_{\bar\rho}^{\operatorname{ord}} \).

These de­vel­op­ments lead to re­fined ques­tions, which Mazur ad­dresses.

Ques­tion 1.3.2: Is there in­deed a “largest” or­din­ary lift of \( \bar \rho \) val­ued in some com­plete loc­al \( \mathbb{Z}_p \)-al­gebra? How can we get a handle on this largest lift? And should it in­deed be com­par­able with Hida’s \( \rho^{\operatorname{ord}} \) val­ued in loc­al factors \( \mathbb{T}_{N,\bar f}^{\operatorname{ord}} \) of \( \mathbb{T}_N^{\operatorname{ord}} \)?
1.4.  Mazur’s introduction of deformation theory and its dimension

Mazur goes on to in­tro­duce his sys­tem­at­ic study of lifts, in­tro­du­cing de­form­a­tion the­ory and ini­tially ig­nor­ing the or­din­ary con­di­tion.

We use the tech­niques of de­form­a­tion the­ory. … We prove that if \( \bar\rho \) is ab­so­lutely ir­re­du­cible, there is a uni­ver­sal de­form­a­tion of \( \rho \), i.e., a com­plete no­eth­eri­an loc­al ring \( R = R(\bar\rho) \) with residue field \( \mathbb{F}_p \), and a con­tinu­ous ho­mo­morph­ism \[ G_{{\mathbb Q},S} \stackrel{\bar{\rho}}{\longrightarrow} \operatorname{GL}_2(R) \]

(well-defined up to con­jug­a­tion by an ele­ment in \( \operatorname{GL}_2(R) \) which re­duces to the iden­tity mat­rix mod­ulo the max­im­al ideal in \( R \)) which is uni­ver­sal in an evid­ent sense. Un­der the as­sump­tion that \( p > 2 \) and that \( S \) con­tains the primes \( p \) and \( \infty \), we show that the Krull di­men­sion of \( R/pR \) is \( \geq 1 \) if \( \det(\rho) \) is even, and it is \( \geq 3 \) if \( \det(\rho) \) is odd, with equal­ity hold­ing if the de­form­a­tion prob­lem is un­ob­struc­ted.

As the text in­dic­ates, there is a no­tion of de­form­a­tion of \( \bar\rho \), which is an equi­val­ence class of lifts un­der a con­di­tion some­times called strict equi­val­ence: two lifts \( \rho_A \) and \( \rho^{\prime}_A \) are strictly equi­val­ent when there ex­ists a mat­rix \( x \in 1 + M_2(\mathfrak{m}_A) \subset \operatorname{GL}_2(A) \) such that \( \rho_A = x \cdot \rho^{\prime}_A \cdot x^{-1} \).

Here is a more form­al and de­tailed ver­sion of what Mazur has sum­mar­ized in the in­tro­duc­tion above.

The­or­em 1.4.1 (Mazur): Let \( \bar\rho : G_{{\mathbb Q},S} \to \operatorname{GL}_2(\mathbb{F}_p) \) be an ab­so­lutely ir­re­du­cible rep­res­ent­a­tion.
  1. There is a uni­ver­sal de­form­a­tion of \( \bar\rho \), con­sist­ing of the data of a com­plete No­eth­eri­an loc­al \( \mathbb{Z}_p \)-al­gebra \( R_{\bar\rho} \) with residue field \( \mathbb{F}_p \) and a ho­mo­morph­ism \( \rho^\mathrm{univ}_{\bar\rho} : G_{{\mathbb Q},S} \to \operatorname{GL}_2(R_{\bar\rho}) \). It is uni­ver­sal in the sense that, for any lift \( \rho_A : G_{{\mathbb Q},S} \to \operatorname{GL}_2(A) \) there ex­ists a unique ho­mo­morph­ism \( \phi : R_{\bar\rho} \to A \) such that \( \operatorname{GL}_2(\phi) \circ \rho_{\bar\rho}^\mathrm{univ} \) and \( \rho_A \) are strictly equi­val­ent.
  2. If \( p > 2 \) and \( S \) con­tains the primes \( p \) and \( \infty \), then there are bounds on Krull di­men­sion ac­cord­ing to the par­ity of \( \det \bar\rho \).
    1. If \( \bar\rho \) is odd, then \( \dim R_{\bar\rho} \geq 3 \).
    2. If \( \bar\rho \) is even, then \( \dim R_{\bar\rho} \geq 1 \).

Mazur’s lower bounds on Krull di­men­sion come from the fol­low­ing re­la­tions between de­form­a­tions and Galois co­homo­logy.

  • Lifts \( \tilde \rho \) of \( \bar\rho : G_{{\mathbb Q},S} \to \operatorname{GL}_d(\mathbb{F}_p) \) to the dual num­bers \( \mathbb{F}_p[\epsilon]/(\epsilon^2) \) are in bijec­tion with con­tinu­ous (in­homo­gen­eous) 1-cocycles on \( G_{{\mathbb Q},S} \) val­ued in the ad­joint rep­res­ent­a­tion of \( \bar\rho \), which is a \( \mathbb{F}_p \)-vec­tor space de­noted \( Z^1(G_{{\mathbb Q},S}, \operatorname{ad}\bar\rho) \). This fol­lows from the straight­for­ward bijec­tion between such 1-cocycles \( f \) and their real­iz­a­tion in a \( 2d \)-di­men­sion­al (over \( \mathbb{F}_p \)) rep­res­ent­a­tion, which amounts to \( \tilde \rho \), as \[ \tilde\rho = \begin{pmatrix} \bar\rho& \text{``}\epsilon\text{"} f \bar\rho\\ {0}&\ \bar\rho \end{pmatrix} . \]

  • Strict equi­val­ences of such lifts amounts to con­jug­a­tion by matrices of the form \( \bigl(\begin{smallmatrix}{1}&{\ast}\\{0}&{1}\end{smallmatrix}\bigr) \), un­der which the or­bit of \( f \) is pre­cisely a co­homo­logy class. Con­sequently, de­form­a­tions to the dual num­bers are in bijec­tion with first ad­joint co­homo­logy, \( H^1(G_{{\mathbb Q},S},\operatorname{ad}\bar\rho) \).
  • Ob­struc­tions are real­ized in second ad­joint co­homo­logy by a sort of product map. The simplest ex­ample (as Mazur de­scribes in ([1], Sec­tion 1.6, Re­mark)) arises when con­sid­er­ing wheth­er some \( \tilde \rho \) as above, with match­ing co­homo­logy class \( [f] \), ex­tends from the dual num­bers to \( \mathbb{F}_p[\epsilon]/(\epsilon^3) \). It ex­tends to \( \mathbb{F}_p[\epsilon]/(\epsilon^3) \) if and only if the cup product \begin{equation} \label{eq: quadratic} H^1(\dotsm) \otimes H^1(\dotsm) \to H^2(G_{{\mathbb Q},S},\operatorname{ad}\bar\rho) \tag{1.4.2} \end{equation}

    van­ishes on \( [f] \otimes [f] \).

Here is the gen­er­al con­clu­sion that Mazur de­rives from these ar­gu­ments. Let \( d^i := \dim_{\mathbb{F}_p} H^i(G_{{\mathbb Q},S}, \operatorname{ad}\bar\rho) \).

Pro­pos­i­tion 1.4.3 ([1], Pro­pos­i­tion 2, Sec­tion 1.6): Let \( \bar\rho : G_{{\mathbb Q},S} \to \operatorname{GL}_2(\mathbb{F}_p) \) be ab­so­lutely ir­re­du­cible. The tan­gent space of \( R_{\bar\rho}/pR_{\bar\rho} \), \[ \operatorname{Hom}_{\mathbb{F}_p}(\mathfrak{m}_{\bar\rho}/(p, \mathfrak{m}_{\bar\rho}^2),\mathbb{F}_p), \]

is ca­non­ic­ally iso­morph­ic to \( H^1(G_{{\mathbb Q},S},\operatorname{ad}\bar\rho) \). Choos­ing an ar­bit­rary sur­jec­tion \( \sigma : \mathbb{Z}_p[\![ x_1, \dotsc, x_{d^1}]\!] \twoheadrightarrow R_{\bar\rho} \) that is an iso­morph­ism on tan­gent spaces, there is an in­jec­tion \begin{equation} \label{eq: obstruction map} \operatorname{Hom}_{\mathbb{F}_p}\biggl(\frac{\ker \sigma}{(x_1, \dotsc, x_{d^1}) \cdot \ker \sigma} \biggr) \hookrightarrow H^2(G_{{\mathbb Q},S}, \operatorname{ad}\bar\rho). \tag{1.4.4} \end{equation}

Fol­low­ing Mazur, let \[ \delta = d^1 - d^2 \]

de­note the ex­pec­ted di­men­sion of \( R_{\bar\rho}/pR_{\bar\rho} \). Upon the ex­pect­a­tion that \( R_{\bar\rho} \) is \( \mathbb{Z}_p \)-flat, this \( \delta \) is also the ex­pec­ted re­l­at­ive di­men­sion of \( R_{\bar\rho} \) over \( \mathbb{Z}_p \). The fol­low­ing co­rol­lary ex­presses the ex­tent to which we can ap­proach this ex­pect­a­tion merely from the de­form­a­tion the­ory of Pro­pos­i­tion 1.4.3.

Co­rol­lary 1.4.5: The Krull di­men­sion of \( R_{\bar\rho} \) sat­is­fies \[ \delta + 1\leq \dim R_{\bar\rho} \leq d^1+ 1. \]

If the ex­pec­ted di­men­sion is achieved by \( R_{\bar\rho}/pR_{\bar\rho} \), then \( R_{\bar\rho}/pR_{\bar\rho} \) is a loc­al com­plete in­ter­sec­tion ring. If \( d^2=0 \), then \( R_{\bar\rho} \) is iso­morph­ic to a power series ring over \( \mathbb{Z}_p \) of the ex­pec­ted di­men­sion.

The bounds on Krull di­men­sion giv­en in The­or­em 1.4.1 fol­low from Co­rol­lary 1.4.5 and the glob­al Euler char­ac­ter­ist­ic for­mula, as Mazur ar­gues in great­er gen­er­al­ity in ([1], Sec­tion 1.10).

1.5.  Expected dimension and flatness conjectures

Along the way to­ward present­ing the res­ults of Sec­tion 1.4, Mazur in­tro­duces the fol­low­ing per­spect­ives and ques­tions re­gard­ing \( R_{\bar\rho} \) that con­tin­ue to be of es­sen­tial in­terest today.

  1. The no­eth­eri­an­ness of \( R_{\bar\rho} \) fol­lows from an ab­stract group-the­or­et­ic con­di­tion, which he calls \( \Phi_p \) and is typ­ic­ally called Mazur’s \( \Phi_p \) fi­nite­ness con­di­tion. The \( \Phi_p \) con­di­tion on a profin­ite group \( G \) is that, for any fi­nite in­dex sub­group of \( G \), its max­im­al pro-\( p \) quo­tient is to­po­lo­gic­ally fi­nitely gen­er­ated. Thanks to the Hermite–Minkowski the­or­em, \( G_{{\mathbb Q},S} \) sat­is­fies \( \Phi_p \) for all primes \( p \).
  2. The the­ory of Sec­tion 1.4 works per­fectly well un­der the fol­low­ing gen­er­al­iz­a­tions.
    • For all di­men­sions \( d \in \mathbb{Z}_{\geq 1} \) with \( \operatorname{GL}_d \) gen­er­al­iz­ing \( \operatorname{GL}_2 \) as above.
    • For all Galois groups \( G_{F,S} \) for num­ber fields \( F \) and a fi­nite set of places \( S \) of \( F \), which sat­is­fies \( \Phi_p \) for the same reas­on as does \( G_{{\mathbb Q},S} \).
    • For all fi­nite residue fields \( k \), gen­er­al­iz­ing the \( \mathbb{F}_p \) above.

    Since Mazur’s art­icle, oth­er al­geb­ra­ic groups have been con­sidered in place of \( \operatorname{GL}_d \).

  1. For rep­res­ent­a­tions of \( G_{{\mathbb Q},S} \) or oth­er ab­so­lute Galois groups, it is nat­ur­al to ask wheth­er the ex­pec­ted di­men­sion and com­plete in­ter­sec­tion prop­erty of Co­rol­lary 1.4.5 al­ways holds.
  2. The ques­tion of Krull di­men­sion is subtle and ex­tra­pol­ates from ques­tions that are known to be dif­fi­cult: in the case \( d=1 \) for \( G_{F,S} \) and any \( \rho \), the ex­pec­ted di­men­sion con­jec­ture (Con­jec­ture 1.5.1, be­low), is equi­val­ent to the \( p \)-ad­ic Leo­poldt con­jec­ture for \( F \) ([1], Lemma 4, Sec­tion 1.10). In this case, \( \delta = r_2 + 1 \), where \( r_2 \) is the num­ber of com­plex places of \( F \).
  3. Mazur had no ex­ample of a ir­re­du­cible com­pon­ent of \( \operatorname{Spec} R \) on which \( p \) is nil­po­tent, i.e., which “doesn’t lift to char­ac­ter­ist­ic zero” ([1], Sec­tion 1.10, Rem.). This is con­sidered today to be a flat­ness con­jec­ture (that is, flat­ness over \( \mathbb{Z}_p \)).

Mazur re­marked that he did not have any counter­examples to the fol­low­ing state­ment, which since has de­veloped in­to a con­jec­ture en­com­passing the ques­tions raised by (iii)–(v).

Con­jec­ture 1.5.1 (Ex­pec­ted di­men­sion): As­sume that \( p \) is odd and let \( R_{\bar\rho} \) de­note the de­form­a­tion ring of some ab­so­lutely ir­re­du­cible \( \bar\rho : G_{{\mathbb Q},S} \to \operatorname{GL}_d(\mathbb{F}_p) \). The \( \mathbb{Z}_p \)-al­gebra \( R_{\bar\rho} \) is flat of re­l­at­ive di­men­sion \( \delta \). In par­tic­u­lar, by Co­rol­lary 1.4.5, the in­ject­ive map \eqref{eq: obstruction map} is an iso­morph­ism and \( R_{\bar\rho} \) is a com­plete in­ter­sec­tion ring.
Re­mark 1.5.2: Be­cause ab­so­lute Galois groups \( G_K \) of \( \ell \)-ad­ic loc­al fields \( K \) are to­po­lo­gic­ally fi­nitely gen­er­ated, their de­form­a­tion rings are No­eth­eri­an by the ar­gu­ment for The­or­em 1.4.1. The ana­logue of Con­jec­ture 1.5.1, where \( G_K \) re­places \( G_{{\mathbb Q},S} \), has re­cently been proved; see Sec­tion 3.
1.6. Intellectual lineage

While we have mainly em­phas­ized the nov­elty of Mazur’s use of de­form­a­tion the­ory, its his­tor­ic­al pre­ced­ents bear men­tion­ing. In­deed, Mazur men­tions two in the elided por­tion of the quote that be­gins Sec­tion 1.4:

We use the tech­niques of de­form­a­tion the­ory. There have been nu­mer­ous stud­ies of the glob­al vari­ation of rep­res­ent­a­tions over \( \mathbb{C} \) of fi­nitely gen­er­ated groups; cf. the mem­oir of Lub­otzky and Ma­gid [e2] or the re­cent pre­print of Gold­man and Mill­son [e6]. The view­point we ad­opt here is sim­il­ar, with the ex­cep­tion that in our con­text (our groups are profin­ite and our rep­res­ent­a­tions are \( p \)-ad­ic) it makes sense only to con­sider form­al de­form­a­tions.

Gold­man and Mill­son showed that de­form­a­tions of rep­res­ent­a­tions of fun­da­ment­al groups of com­pact Kähler man­i­folds have at most quad­rat­ic sin­gu­lar­it­ies, i.e., the quad­rat­ic term de­scribed in \eqref{eq: quadratic} suf­fices as a present­a­tion of the de­form­a­tion ring. Lub­otzky and Ma­gid deal with mod­uli vari­et­ies of rep­res­ent­a­tions of fi­nitely gen­er­ated groups. In Sec­tion 7 of [e2], en­titled “His­tor­ic­al re­marks” (which touches on a sur­pris­ingly broad ar­ray of math­em­at­ic­al prob­lems with links to the mod­uli the­ory of rep­res­ent­a­tions), they point out that Weil, in his study of dis­crete sub­groups \( \Gamma \) of a Lie group with Lie al­gebra \( \mathfrak{g} \), seems to have been the first to no­tice that de­form­a­tions are con­trolled by \( H^1(\Gamma, \mathfrak{g}) \) [e1]. This is an im­port­ant in­sight that Mazur draws upon.

2.  Local conditions and the first “R \({}= \mathbb{T} \)

Hav­ing es­tab­lished the above found­a­tion­al res­ults of Galois de­form­a­tion the­ory, Mazur re­turns to the mo­tiv­at­ing case, im­pos­ing a \( p \)-loc­al or­din­ary con­di­tion on Galois rep­res­ent­a­tions and mod­u­lar forms. The rest of his pa­per im­poses suf­fi­ciently nar­row re­stric­tions on \( \bar\rho \) so that he can prove that a de­form­a­tion ring for or­din­ary Galois rep­res­ent­a­tion \( R_{\bar\rho}^{\operatorname{ord}} \) is nat­ur­ally iso­morph­ic to an or­din­ary Hecke al­gebra. This out­come is the first his­tor­ic­al ex­ample of an \( R = \mathbb{T} \) the­or­em ([1], Sec­tion 2.5, Pro­pos­i­tion 14), set­ting the stage for the en­su­ing dec­ades of ex­plor­a­tion of these de­form­a­tion spaces and proofs of mod­u­lar­ity the­or­ems. For ex­ample, Mazur an­ti­cip­ated one dir­ec­tion of in­quiry: after pre­view­ing his proof that cer­tain \( R_{\bar\rho}^{\operatorname{ord}} \) are nat­ur­ally iso­morph­ic to an or­din­ary Hecke al­gebra, Mazur asks, “Are all rep­res­ent­a­tions [in the de­form­a­tion space] sim­il­arly ap­prox­im­able” even when no or­din­ary con­di­tion is im­posed? ([1], p. 386).

2.1. The ordinary deformation ring

Be­cause Galois rep­res­ent­a­tions \( \bar\rho: G_{{\mathbb Q},S} \to \operatorname{GL}_2(\mathbb{F}_p) \) as­so­ci­ated to holo­morph­ic mod­u­lar forms are odd (Sec­tion 1.1), by Co­rol­lary 1.4.5, there are too many di­men­sions — at least 3, which is ex­pec­ted — com­pared to Hida’s 1-di­men­sion­al fam­il­ies of \( p \)-or­din­ary Hecke ei­gensys­tems (The­or­em 1.2.2). Mazur real­ized that this dif­fer­ence in di­men­sion can be at­trib­uted to the \( p \)-loc­al con­di­tion that \( p \)-ad­ic Galois rep­res­ent­a­tions \( \rho \) as­so­ci­ated to \( p \)-or­din­ary mod­u­lar forms must sat­is­fy: the re­stric­tion \( \rho\vert_{G_p} \) to the de­com­pos­i­tion sub­group at \( p \), \( G_p \subset G_{{\mathbb Q},S} \), must be re­du­cible with an un­rami­fied quo­tient as in \eqref{eq: ord shape}.

To make this rig­or­ous, Mazur de­scribes the or­din­ary loc­al con­di­tion: giv­en \( \rho_A : G_{{\mathbb Q},S} \to \operatorname{GL}_2(A) \) and think­ing of \( A \oplus A \) as the nat­ur­al mod­ule for the \( G_{{\mathbb Q},S} \)-ac­tion via \( \rho : G_{{\mathbb Q},S} \to \operatorname{GL}_2(A) \), one calls \( \rho_A \) or­din­ary when there ex­ists a \( A \)-rank 1 free sum­mand \( M \subset A \oplus A \) that is in­vari­ant un­der the in­er­tia sub­group \( I_p \subset G_{{\mathbb Q},S} \).

Mazur claims that there ex­ists a uni­ver­sal or­din­ary de­form­a­tion ([1], Pro­pos­i­tion 3, Sec­tion 1.7) and finds its co­ordin­ate ring, the uni­ver­sal or­din­ary de­form­a­tion ring \( R_{\bar\rho}^{\operatorname{ord}} \), as a quo­tient of \( R_{\bar\rho} \) un­der a sup­ple­ment­al as­sump­tion ([1], Pro­pos­i­tion 12, Sec­tion 2.5). The Za­r­iski-closed­ness of a loc­al con­di­tion gained re­cog­ni­tion as an im­port­ant goal in the the­ory.

2.2  “Neat” residual representations

Now that \( R_{\bar\rho}^{\operatorname{ord}} \) ex­ists, Mazur’s next chal­lenge is to show that it is small enough to be iso­morph­ic to an or­din­ary Hecke al­gebra. To do this, he chooses a situ­ation where the am­bi­ent de­form­a­tion ring \( R_{\bar\rho} \) is as nice as pos­sible, re­l­at­ive to the re­stric­tions of Co­rol­lary 1.4.5, when \( \bar\rho \) is ir­re­du­cible and odd: this is the case where \( R_{\bar\rho} \) is iso­morph­ic to a power series ring over \( \mathbb{Z}_p \) in 3 vari­ables. He shows that this case oc­curs when \( \bar\rho \) sat­is­fies a con­di­tion called “neat,” which we will de­vel­op shortly.

Re­mark 2.2.1: All of the rest of Mazur’s work oc­curs in the set­ting where \( \bar\rho \) is neat, lever­aging the nice­ness of \( R_{\bar\rho} \). It seems that the main res­ults go­ing bey­ond Mazur’s ap­proach did not ap­pear un­til Wiles’s work.

Mazur con­siders the case where \( S = \{p, \infty\} \), so \( G_{{\mathbb Q},S} \) is the Galois group of \( {\mathbb Q} \) rami­fied at only \( p \) and \( \infty \); and se­lects the re­sid­ual rep­res­ent­a­tion \( \bar\rho \) to be an in­duced rep­res­ent­a­tion \[ \bar\rho = \operatorname{Ind}_{{\mathbb Q}(\sqrt{-p})}^{\mathbb Q} \bar\chi : G_{{\mathbb Q},S} \to \operatorname{GL}_2(\mathbb{F}_p) \]

from a non­trivi­al char­ac­ter \( \bar\chi : G_{{\mathbb Q}(\sqrt{-p})} \to \mathbb{F}_p^\times \) of the ima­gin­ary quad­rat­ic field \( {\mathbb Q}(\sqrt{-p}) \) con­tained in the \( p \)-th cyc­lo­tom­ic field; thus one must have \( p \equiv 3\pmod{4} \). Mazur im­poses ad­di­tion­al as­sump­tions, as fol­lows, and also lists some primes \( p \) for which these as­sump­tions are sat­is­fied:

  • the class num­ber of \( L = \overline{\mathbb{Q}}^{\ker \bar\rho} \) is re­l­at­ively prime to \( p \);
  • the abeli­an cyc­lic ex­ten­sion \( L/{\mathbb Q}(\sqrt{-p}) \) is rami­fied only at the prime of \( {\mathbb Q}(\sqrt{-p}) \) over \( p \);
  • \( \bar\rho \) sat­is­fies a neat­ness con­di­tion of ([1], Sec­tion 1.12), which means that cer­tain \( \operatorname{Gal}(L/{\mathbb Q}) \)-mod­ules con­struc­ted from the unit group \( L^\times \) have no non­trivi­al \( \operatorname{ad}\bar\rho \)-iso­typ­ic­al parts.

The neat­ness as­sump­tion is good enough to de­duce that \( H^2(G_{{\mathbb Q},S}, \operatorname{ad}\bar\rho) = 0 \), so that the un­res­tric­ted Galois de­form­a­tion ring \( R_{\bar\rho} \) is iso­morph­ic to a power series ring \( \mathbb{Z}_p[\![ X_1,X_2,X_3 ]\!] \). Then Mazur checks that there is a unique lift of \( \bar\rho \) to \( \rho : G_{{\mathbb Q}, S} \to \operatorname{GL}_2(\mathbb{Z}_p) \) such that the im­age of \( \rho \) is iso­morph­ic to the im­age of \( \bar\rho \) via re­duc­tion mod­ulo \( p \). He can also identi­fy this as the Galois rep­res­ent­a­tion \( \rho_f \) at­tached to a weight 1 \( p \)-or­din­ary cuspid­al ei­gen­form \( f \) of level \( \Gamma_0(p) \).

Be­cause \( \bar\rho \) is in­duced, its im­age is a di­hed­ral group. Such rep­res­ent­a­tions will be called glob­ally di­hed­ral in what fol­lows.

2.3.  Establishing R\( {}= \mathbb{T} \)

Now we have enough in­form­a­tion to ex­press Mazur’s or­din­ary \( R = \mathbb{T} \) the­or­em and sketch its proof. Let \( R_{\bar\rho}^{\operatorname{ord}} \) de­note the uni­ver­sal or­din­ary de­form­a­tion ring for \( \bar\rho \), which is nat­ur­ally a quo­tient \( R_{\bar\rho} \twoheadrightarrow R_{\bar\rho}^{\operatorname{ord}} \). Let \( \mathbb{T}^{\operatorname{ord}} \) de­note Hida’s or­din­ary Hecke al­gebra of tame level \( N=1 \). The weight 1 ei­gen­form \( f \) gives a ho­mo­morph­ism \( \phi_f : \mathbb{T}^{\operatorname{ord}} \to \mathbb{Z}_p \) send­ing each Hecke op­er­at­or to its ei­gen­value on \( f \). Let \( \mathbb{T}_{\bar f}^{\operatorname{ord}} \) de­note the com­ple­tion of \( \mathbb{T}^{\operatorname{ord}} \) at the max­im­al ideal that is the ker­nel of the com­pos­ite \[ \phi_{\bar f} : \mathbb{T}^{\operatorname{ord}} \stackrel{\phi_f}{\longrightarrow} \mathbb{Z}_p \twoheadrightarrow \mathbb{F}_p. \]

Mazur’s ar­gu­ment in ([1], Sec­tions 2.1–2.5) to es­tab­lish a nat­ur­al iso­morph­ism \( R^{\operatorname{ord}}_{\bar\rho} \stackrel{\sim}{\rightarrow} \mathbb{T}^{\operatorname{ord}}_{\bar f} \) pro­ceeds through the fol­low­ing steps.

  1. The lift \( \rho_f \) of \( \bar\rho \) is the only lift (among all coef­fi­cient rings \( (A,\mathfrak{m}_A) \), prop­erly in­ter­preted) un­der which the im­age of the in­er­tia sub­group \( I_p \) is con­stant, i.e., maps iso­morph­ic­ally to \( \bar\rho(I_p) \) mod­ulo \( \mathfrak{m}_A \) (Pro­pos­i­tion 9 and Lemma 5, [1]).
  2. There is a quo­tient \( R_{\bar\rho} \twoheadrightarrow R^\mathrm{dih}_{\bar\rho} \) factor­ing all of the maps \( R_{\bar\rho} \to A \) such that the as­so­ci­ated Galois rep­res­ent­a­tion \( \rho_A \) is di­hed­ral, i.e., it has pro-di­hed­ral im­age. Moreover, there is an iso­morph­ism \( R^{\mathrm{dih}}_{\bar\rho} \simeq \mathbb{Z}_p [\![ t_1, t_2]\!] \) (Lemma 6, Pro­pos­i­tion 11, [1]). Be­cause \( \bar\rho_f \) and \( \rho_f \) have di­hed­ral im­age, the as­so­ci­ated map \( R_{\bar\rho} \to \mathbb{Z}_p \) factors through \( R_{\bar\rho} \twoheadrightarrow R^{\mathrm{dih}}_{\bar\rho} \).
  3. The or­din­ary and glob­ally di­hed­ral loci with­in \( \operatorname{Spf} R_{\bar\rho} \) are trans­verse in the strongest pos­sible way, not­with­stand­ing the fact that the \( \mathbb{Z}_p \)-point as­so­ci­ated to \( \rho_f \) lies in both of these two sub­loci (Lemma 7, [1]). Namely, the in­ter­sec­tion of the glob­ally di­hed­ral and or­din­ary sub­spaces of the tan­gent space \( \mathfrak{t}_{\bar\rho} \) of \( \operatorname{Spf} R_{\bar\rho} \), \[ \mathfrak{t}_{\bar\rho} = \operatorname{Hom}_{\mathbb{F}_p} \biggl(\frac{\mathfrak{m}_{\bar\rho}}{(\mathfrak{m}_{\bar\rho}^2, p)}, \mathbb{F}_p\biggr) \cong H^1(G_{{\mathbb Q},S}, \operatorname{ad}\bar\rho) \simeq \mathbb{F}_p^{\oplus 3}, \]

    is the zero sub­space.

  4. Be­cause \( R^\mathrm{dih}_{\bar\rho} \simeq \mathbb{Z}_p [\![ t_1, t_2]\!] \), the glob­ally di­hed­ral sub­space of \( \mathfrak{t}_{\bar\rho} \) has di­men­sion 2. Thus the trans­vers­al­ity res­ult (iii) im­plies that hence the tan­gent space of the or­din­ary sub­locus has dim \( \leq 1 \) (Pro­pos­i­tion 13, [1]). This is equi­val­ent to the ex­ist­ence of some sur­jec­tion \( \mathbb{Z}_p[\![ t]\!] \twoheadrightarrow R_{\bar\rho}^{\operatorname{ord}} \).
  5. By in­ter­pol­at­ing the Galois rep­res­ent­a­tions as­so­ci­ated to the or­din­ary ei­gen­forms para­met­er­ized by \( \mathbb{T}_{\bar f}^{\operatorname{ord}} \) fol­low­ing Hida, there is a rep­res­ent­a­tion \[\rho_{\mathbb{T}_{\bar f}^{\operatorname{ord}}} : G_{{\mathbb Q},S} \to \operatorname{GL}_2(\mathbb{T}_{\bar f}^{\operatorname{ord}}). \]

    This rep­res­ent­a­tion is or­din­ary and de­forms \( \bar\rho \), and there­fore there is a sur­ject­ive map \( R_{\bar\rho}^{\operatorname{ord}} \twoheadrightarrow \mathbb{T}_{\bar f}^{\operatorname{ord}} \).

  6. Be­cause \( \mathbb{T}_{\bar f}^{\operatorname{ord}} \) is \( \Lambda \)-flat and \( R_{\bar\rho}^{\operatorname{ord}} \) is a quo­tient of \( \mathbb{Z}_p[\![ t]\!] \), the sur­jec­tion provides an iso­morph­ism \[R_{\bar\rho}^{\operatorname{ord}}\cong \mathbb{T}_{\bar f}^{\operatorname{ord}}. \]

2.4.  Complements

Among the parts of his pa­per that we have not dis­cussed in de­tail, Mazur en­gages in pres­ci­ent dis­cus­sions of 2-di­men­sion­al Galois rep­res­ent­a­tions de­form­ing the neat re­sid­ual rep­res­ent­a­tion \( \bar\rho \) fixed above, as fol­lows.

  • The closed locus of \( \operatorname{Spf} R_{\bar\rho} \) in which the ac­tion of \( I_p \) is re­du­cible is the uni­on of two hy­per­sur­faces, each of which con­sists of rep­res­ent­a­tions whose re­stric­tion to \( I_p \) sta­bil­izes a fixed flag. The in­ter­sec­tion of these two hy­per­sur­faces is a line in which the ac­tion of \( I_p \) is re­du­cible and de­com­pos­able. Such loci figured in­to stud­ies of Ghate–Vat­sal [e18], [e24], who proved (un­der mild as­sump­tions) that the \( I_p \)-re­du­cible and de­com­pos­able locus gen­er­ic­ally con­sists of di­hed­ral rep­res­ent­a­tions of \( G_{{\mathbb Q},S} \).
  • The ana­lyt­ic space \( (\operatorname{Spf} R_{\bar\rho})^\mathrm{an} \) as­so­ci­ated to \( \operatorname{Spf} R_{\bar\rho} \) ad­mits a map, due to Sen, to the 2-di­men­sion­al ana­lyt­ic af­fine space of quad­rat­ic mon­ic poly­no­mi­als in \( {\mathbb{Q}}_p[t] \). Loosely speak­ing, this keeps track of Hodge–Tate weights \( p \)-ad­ic­ally in­ter­pol­at­ing over \( (\operatorname{Spf} R_{\bar\rho})^\mathrm{an} \). Mazur asks sev­er­al pro­voc­at­ive ques­tions about loci with weights \( (0,0) \). The al­geb­ra­iz­a­tion and char­ac­ter­iz­a­tion of loci with fixed such weights played a sub­stan­tial role in sub­sequent stud­ies and ap­plic­a­tions of Galois de­form­a­tion the­ory.

3.  The influence of Galois deformation theory

The vast in­flu­ence of Mazur’s pa­per mani­fests in many ways. To con­clude, what fol­lows is an at­tempt to men­tion some of the most prom­in­ent dir­ec­tions rep­res­en­ted among de­vel­op­ments sparked by Mazur’s pa­per, though un­doubtedly there are more.

(1)  There are many fur­ther Za­r­iski-closed­ness res­ults and char­ac­ter­iz­a­tions re­gard­ing loci with­in Galois de­form­a­tion spaces with loc­al prop­er­ties of in­terest as in Sec­tion 2.4, one of the first be­ing the PhD thes­is of Ra­makrishna [e9]. Later, Kisin ex­pan­ded such res­ults to a broad­er ar­ray of con­di­tions from \( p \)-ad­ic Hodge the­ory [e22], [e19].

(2)  Wiles [e11] and Taylor–Wiles [e12] de­veloped meth­ods — the nu­mer­ic­al cri­terion and the Taylor–Wiles meth­od — to com­pare Galois de­form­a­tion rings with Hecke al­geb­ras and es­tab­lish \( R = \mathbb{T} \) the­or­ems, fol­lowed by works of many oth­ers to ex­pand the scope of these meth­ods. Among such ef­forts, in­nov­a­tions to the Taylor–Wiles meth­od due to Kisin [e22], [e21] stand out, as he ap­plied these in­nov­a­tions to es­tab­lish the mod­u­lar­ity con­jec­ture (Fon­taine–Mazur con­jec­ture for \( \operatorname{GL}_2/{\mathbb Q} \)) in a broad ar­ray of cases.

(3)  Skin­ner–Wiles [e15] ap­plied the Taylor–Wiles meth­od to ad­dress re­sid­ually re­du­cible cases of the Fon­taine–Mazur con­jec­ture for \( \operatorname{GL}_2/{\mathbb Q} \).

(4)  There are ex­pli­cit stud­ies of spaces of de­form­a­tions of Galois rep­res­ent­a­tions, in­clud­ing those of Mazur’s PhD su­per­visees Bo­ston [e7] and Böckle [e16].

(5)  The “in­fin­ite fern” of mod­u­lar points with­in Galois de­form­a­tion spaces was iden­ti­fied by Mazur [3] and Gouvêa–Mazur [2]. This work, along with Cole­man’s de­vel­op­ment of the the­ory of over­con­ver­gent \( p \)-ad­ic mod­u­lar forms [e13], led to the de­vel­op­ment of \( p \)-ad­ic ana­lyt­ic fam­il­ies of mod­u­lar forms, the first ex­ample giv­en by the Cole­man–Mazur ei­gen­curve [4].

(6)  Clozel–Har­ris–Taylor [e20] ex­pan­ded the scope of de­form­a­tion-the­or­et­ic meth­ods (e.g., Taylor–Wiles meth­od) to prove auto­morphy the­or­ems for \( d \)-di­men­sion­al Galois rep­res­ent­a­tions sat­is­fy­ing con­di­tions such that they ought to arise from auto­morph­ic rep­res­ent­a­tions of a unit­ary groups, gen­er­al­iz­ing to \( d > 2 \) the case of \( d=2 \) and \( \operatorname{GL}_2 \). There were oth­er earli­er stud­ies of de­form­a­tion the­ory for Galois rep­res­ent­a­tions val­ued in high­er rank re­duct­ive groups, for ex­ample due to Tilouine [e14].

(7)  There are ef­forts such as those of Dia­mond–Flach–Guo [e17] and Bellaïche–Chenevi­er [e23] to use Galois de­form­a­tion spaces, and their re­la­tion to Hecke al­geb­ras, to draw con­clu­sions about Selmer groups and Bloch–Kato con­jec­tures.

(8)  The de­vel­op­ment of the the­ory of pseudochar­ac­ters due to Wiles [e5], Taylor [e8], Caray­ol [e10], and oth­ers ap­ply­ing de­form­a­tion the­ory to the trace func­tion arising from rep­res­ent­a­tions. This was fur­ther de­veloped and ap­plied by Bellaïche–Chenevi­er [e23] and Chenevi­er [e25]. A de­cis­ive de­vel­op­ment of the the­ory of pseudochar­ac­ters, ex­tend­ing the no­tion to \( G \)-val­ued rep­res­ent­a­tions where \( G \) is a re­duct­ive group, was provided by V. Laf­forgue in the course of his proof of the un­rami­fied Lang­lands cor­res­pond­ence over func­tion fields [e26].

(9)  The in­tro­duc­tion of stacks of Galois rep­res­ent­a­tions such as in work of the au­thor [e29] and of Emer­ton–Gee [e34], vary­ing among all re­sid­ual rep­res­ent­a­tions. In par­tic­u­lar, these no longer re­quire the re­stric­tion, ne­ces­sary for Mazur’s de­form­a­tion the­ory, that the re­sid­ual rep­res­ent­a­tion has no non­scal­ar auto­morph­isms.

(10)  The proof of the ex­pec­ted di­men­sion and \( \mathbb{Z}_p \)-flat­ness prop­er­ties, Con­jec­ture 1.5.1, is due in the \( \ell \)-ad­ic loc­al case to Böckle–Iy­engar–Pašk&umacrnas [e35] (when \( \ell = p \)), and to Helm [e30], Dat–Helm–Kur­inczuk–Moss [e37] and Zhu [e38] (when \( \ell \neq p \)). It is cur­rently open in the glob­al case.

(11)  Ven­kateshGala­ti­us’s ex­pan­sion of the scope of Galois de­form­a­tion tech­niques in­to de­rived set­tings [e28], set­ting up the Galois side of de­rived en­rich­ments first set up on the auto­morph­ic side by Calegari–Ger­aghty [e27].

(12)  Re­cently there have been spec­tac­u­lar ap­plic­a­tions of high­er di­men­sion­al Galois rep­res­ent­a­tions and their de­form­a­tion the­ory, such as New­ton–Thorne’s work on sym­met­ric power func­tor­al­ity for mod­u­lar forms [e31], [e32] and Hil­bert mod­u­lar forms [e39], and of Box­er–Calegari–Gee–Pil­loni on the mod­u­lar­ity of abeli­an sur­faces [e33], [e36].

There is now a lot of at­ten­tion dir­ec­ted at the po­ten­tial for geo­met­riz­a­tions of the arith­met­ic Lang­lands cor­res­pond­ences ana­log­ous to those of the geo­met­ric Lang­lands pro­gram. Such geo­met­riz­a­tion em­phas­izes sheaves on mod­uli stacks of Galois rep­res­ent­a­tions over the points of this stack, sug­gest­ing that there are even more yet-un­ex­plored lay­ers of sig­ni­fic­ance of the Galois de­form­a­tion spaces pi­on­eered by Mazur.

Carl Wang-Er­ick­son is cur­rently an as­sist­ant pro­fess­or of math­em­at­ics at the Uni­versity of Pitt­s­burgh. His re­search fo­cuses on the re­la­tion­ship between \( p \)-ad­ic fam­il­ies of Galois rep­res­ent­a­tions and the co­homo­logy of mod­u­lar curves.

Works

[1]B. Mazur: “De­form­ing Galois rep­res­ent­a­tions,” pp. 385–​437 in Galois groups over \( \mathbb{Q} \) (Berke­ley, CA, 23–27 March 1987). Edi­ted by Y. Ihara, K. Ribet, and J.-P. Serre. Math­em­at­ic­al Sci­ences Re­search In­sti­tute Pub­lic­a­tions 16. Spring­er (New York), 1989. For A. Mazur. Avail­able open access here. MR 1012172 Zbl 0714.​11076 incollection

[2]F. Gouvêa and B. Mazur: “Fam­il­ies of mod­u­lar ei­gen­forms,” Math. Com­put. 58 : 198 (April 1992), pp. 793–​805. MR 1122070 Zbl 0773.​11030 article

[3]B. Mazur: “An ‘in­fin­ite fern’ in the uni­ver­sal de­form­a­tion space of Galois rep­res­ent­a­tions,” pp. 155–​193 in Journées arith­métiques [Arith­met­ic days] (Bar­celona, Ju­ly 1995), published as Col­lect. Math. 48 : 1–​2. Spring­er (Ber­lin), 1997. In memory of R. H. Bing. MR 1464022 Zbl 0865.​11046 incollection

[4]R. Cole­man and B. Mazur: “The ei­gen­curve,” pp. 1–​113 in Galois rep­res­ent­a­tions in arith­met­ic al­geb­ra­ic geo­metry (Durham, UK, 9–18 Ju­ly 1996). Edi­ted by A. J. Scholl and R. L. Taylor. Lon­don Math­em­at­ic­al So­ci­ety Lec­ture Note Series 254. Cam­bridge Uni­versity Press, 1998. MR 1696469 Zbl 0932.​11030 incollection