Statistical mechanics of spin systems on diluted random structures

(1)

SISSA

ISAS

Scuola Internazionale Superiore di Studi Avanzati

International School for Advanced Studies

Statisti al me hani s of spin systems

on diluted random stru tures

Thesis submitted for the degree of

Do tor Philosophi

Candidate: Supervisors:

Mi hele Leone Prof. Amos Maritan

(2)

(3)

Introdu tion 7

1 General te hniques for diluted random models 11

1.1 Graphs and Hyper-graphs: preliminarydenitions . . . 11

1.2 Spin models ondiluted stru tures . . . 14

1.2.1 Disorder . . . 15

1.2.2 Frustration . . . 15

1.2.3 Combinatorialoptimizationproblems as spin models . . . 18

1.2.4 Quen hed disorderaverages and general omputational strategies . . . . 19

1.2.5 Repli as . . . 19

1.2.6 Cavity . . . 21

1.2.7 Phase spa e stru ture . . . 23

2 The generalized diluted p-spin model 27 2.1 Combinatorialoptimizationinterpretation of p-spinmodels: the XOR-SAT . . . 28

2.2 From the partitionfun tion to the average free-energy. . . 30

2.2.1 Some onsiderationson normalization. . . 34

2.3 The Repli a Symmetri Results . . . 37

2.3.1 Vanishingelds . . . 39

2.3.2 Analyti al Ansatz for T =0 solutionswith non vanishingelds. . . 40

2.3.3 The ferromagneti solution . . . 41

2.3.4 The spin-glass statesand the RSenergy lines . . . 44

2.4 The 1RSB al ulations . . . 45

2.4.1 The variationalfa torizedAnsatz . . . 47

2.4.2 The onstru tion of the phase diagram . . . 51

2.4.3 On the physi alir-relevan e of fra tionalelds . . . 58

2.4.4 A parti ular exa t ase: hyper-graphs with xed degree distribution . . . 60

2.5 The general 1RSBequations . . . 65

2.5.1 General SolutionatT =0 . . . 67

2.6 "Ferromagneti omplexity" . . . 73

2.6.1 Hiding solutions inrandom satisabilityproblems . . . 76

2.7 An exa t alternative solutionof the p-spin modelat T =0 . . . 78

2.7.1 The onset of frustration: hyper-loops in the graph . . . 78

2.7.2 Leaf removalalgorithm . . . 79

2.7.3 The ore and the al ulation ofthe threshold . . . 83

(4)

3 Some parti ular ases of interest 89

3.1 The 2+p-XOR-SATmodel: role of phase oexisten e and nite-size s aling . . . 89

3.1.1 Model denition and outlineof someresults . . . 90

3.1.2 Numeri al simulations . . . 92

3.1.3 Con lusions . . . 95

3.2 Ferromagneti orderingon randomgraphs . . . 96

3.2.1 Introdu tion . . . 96

3.2.2 The repli a approa hon generalrandom graphs . . . 97

3.2.3 Ferromagneti phasetransition . . . 99

3.2.4 Criti al behavioraround . . . 100

3.2.5 Power lawdistributed graphs . . . 101

3.2.6 2< 3 . . . 102

3.2.7 3< 5 . . . 103

4 Two examples of NP optimization problems 105 4.1 The Hyper-Graph Bi oloring Problem. . . 105

4.1.1 The RS results . . . 106

4.1.2 The RSB Cal ulations . . . 108

4.2 Results of the variationalRSB al ulations for the random 3-SAT . . . 111

5 Phase and omputational omplexity transitions 115 5.1 Global algorithmstransitions inlinear systems over niteelds. . . 115

5.1.1 Introdu tion . . . 115

5.1.2 Random Linear systems in GF(2): rigorous results and statisti al me- hani s analysis . . . 116

5.1.3 Algorithmsbehavior . . . 120

5.1.4 The RSA ryptosystem and fa torization . . . 123

5.2 The dynami phase transitionfor de oding algorithms. . . 132

5.2.1 Introdu tion . . . 132

5.2.2 Error orre ting odes, de oding algorithmsand the avity equations . . 133

5.2.3 Statisti alme hani s formulationand the repli aapproa h . . . 138

5.2.4 Binary erasure hannel: analyti al and numeri alresults . . . 139

5.2.5 The general hannel: analyti al and numeri alresults . . . 147

5.2.6 Con lusions . . . 154

6 Determining bounds 157 6.1 Variationalbounds for optimizationproblems and spin systems. . . 157

6.1.1 Notations . . . 158

6.1.2 The general strategy . . . 160

6.1.3 The RS bound . . . 164

6.1.4 The 1RSB Bound . . . 168

6.1.5 Summary and on lusions . . . 171

Con lusions and perspe tives 173

(5)

C On the hoi e of the fun tional order parameter 179

C.1 The degree sub-distributions: an alternative al ulation . . . 181

C.1.1 RS results . . . 181

C.1.2 Fa torized 1RSB results: 1 . . . 181

C.2 Fa torized 1RSB results: 2 . . . 182

D Criti alexponents and non universal amplitudes 183 D.1 Case <k 4 >nite . . . 183

D.2 S ale free networks: ase 3< <5 . . . 184

E E.C.Codes: BSC, A T=0 variational al ulation 187 F Details of the al ulations of Chapter 6 189 F.1 p-spin . . . 189

F.1.1 Che k of the positivesign of R p spin 1RSB . . . 189 F.1.2 Che k of F p spin var [P℄=F p spin 1RSB [P℄ . . . 191 F.2 K-SAT . . . 192

F.2.1 Che k of the positivesign of R K SAT RS ... . . 192 F.2.2 ...and of R K SAT 1RSB . . . 194 F.2.3 Che k of F K SAT var [P℄=F K SAT 1RSB [P℄ . . . 195

F.2.4 Existen e of the free-energy of the p-spin model . . . 196

Bibliography 204

(6)

(7)

Random ombinatorial problems and diluted spin systems

During the last two de ades, in spite of many pioneering fundamental ontributions ([1℄ and

referen es therein), the mainstream ofanalyti alresults inthe eld of statisti alme hani s of

spin-glasses anddisorderedsystems fo usedmainlyonmean-eldmodelsof largedegree

1

([2,3,

4, 5,6℄ and referen es therein).

Inthe morere entyears, amajor eorthas been devoted tothe study ofmodels that ould

retain,atleastinastatisti alway,somefeatures ofnitedimensionality,likenitedegrees and

presen e of geometri al onstraints in uen ing both the stati and the dynami al properties

of the systems. Spin glass models over diluted random graphs onstitute by now the natural

frameworkforthemostadvan edanalyti alstudies on erningtheglasstransitionindisordered

systems.

The interest in diluted spin system is by far not limited to physi s. As we shall dis uss

in great detail in this thesis, there exists a huge lass of open root problems in theoreti al

omputer s ien e and in dis rete mathemati s whi h have a simple representation as diluted

spin system.

From the point ofview of pure physi s,the study of dilutedsystems represents onlya rst

step towards the treatment of nite dimensionality or geometri ally stru tured models, and

one ould think for instan e to even more omplex or \semi-random" stru tures where some

regularity reminis ent of a real latti e geometry is progressively introdu ed into the random

adja en ymatrix. Buteven ifonelimitstheinvestigationtopurelyrandomdilutedgraphsand

to lassi al spin models dened onthem, the questions that arise are still of a deep kind both

from afundamental and froman appli ationoriented pointof view.

Why are this models interesting? The main reasons an be summarized inthe following:

Fromafundamentalpointofview: theyarestillessentiallymeaneld,howevertheyretain

nite intera tion degrees that isreminis ent of nite dimensional ases. The presen e of

large s ale stru tures like large loops has to be taken into onsideration as a rst step

in the understanding the role of topology and geometry for the olle tive behavior of

omplex systems.

Moreover, they are widely a epted as prototype models in the study of fundamental

phenomena inthe theory of ComputationalComplexity.

From an appli ative (but not less important) point of view: they have a natural wide

range of appli ations toa lass of systems that span over the following elds

1

Inthefollowingwe will all degreewhat usuallyphysi ists all onne tivity, i.e. thenumberof neighbors

of avertex ofthe latti eorgraphthemodel isdened on. We hose The rsttermin order to be onsistent

(8)

{ Statisti al analysis of the behavior of realisti neural networks ([2, 6, 27, 28℄ and

referen es therein)

{ Combinatorial optimization problems ([27, 29, 30, 31, 32, 34, 35, 36, 37, 117℄ and

referen es therein)

{ Error orre ting odes and ryptography ([27, 38,39℄ and referen es therein)

{ Models of statisti al information pro essing and image restoration ([27℄ and

refer-en es therein)

{ Statisti almodelsof olle tivephenomenainbiology(forinstan e. geneandprotein

regulatory networks, networks of ellular signalling pathways et .) ([40, 41℄ and

referen es therein)

{ Statisti al analysis and optimal design in omplex arti ial networks su h as the

Internet or the World Wide Web ([43, 44,45,46℄ and referen es therein)

In whatfollowsweare goingtodeal both withbasi theoreti alaspe ts and withsome

spe- i appli ations belonging to omputer s ien e ( ombinatorial optimization, error orre ting

odes and ryptography). We are going to study the low temperatureequilibrium and

out-of-equilibriumphasesofdilutedspin-glasses withthe aimof elu idatingthe geometri alstru ture

of ground states underlying stati and dynami transitions. The omputational ounterpart

of su h a study arises from the elementary observation that (hard) ombinatorial

optimiza-tion problems an beeasilyreformulatedasproblems ofndinggroundstatesinspin-glass-like

Hamiltonians. In this sense the idea of studying their topologi al stru ture is quite a natural

one [2℄. The re ent eorts in developing a mathemati al and physi al understanding of su h

systems over diluted stru tures have opened new perspe tives and new roads to solutions to

those problems, ta kling them in their natural milieu. The models reviewed and studied in

this thesis are all of random nature. While this is usually the most natural thing to do in

physi s, the study of random ombinatorial problems has been revealing itself useful also in

omputer s ien ewhereitallows tobroaden tothe typi al- asethe lassi alworst- ase notions

of omputational omplexity [34℄.

This thesis isdevoted tothe analyti al study of the dynami and stati transitions

numer-i allyobserved in this whole lass of models, withspe i fo us on ombinatorialoptimization

problems and error orre ting odes, on e mapped on spe i diluted spin systems. Stress is

posed on the onne tion between the slowing down pro esses in algorithms behavior and

sta-tisti alphase transitionsdue tosome intrinsi property onthespin model, that anusually be

tra ked down to the emergen e of non trivial frustrated topologi alstru tures in the

underly-ing graph. The re ent a hievements [29, 30, 56℄ of a promising new lass of algorithms that

seems to outperform the other state-of-the-art sear h pro edures for typi ally hard

ombina-torial problems are based ontheoreti al understanding rooted inthe on epts reviewed in this

work. This result, among others, seems to show how statisti alphysi s of disordered systems

has stillalot totea h uswhen appliedto the eld of omputational omplexity.

The thesis willbe organized inthe following way: in the rst hapter some general

guid-ing on epts of random graphs and modern statisti al physi s of disordered systems will be

presented, and the onne tion with relevant problems in theoreti al omputer s ien e will be

stressed. In hapter two wewillintrodu eindetailthe mathemati alte hniquesused todeal

with the analyti omputation of relevant physi al quantities for a wide lass of spin models

(9)

arbi- omplete al ulations will be shown in the ase of a generalization of the p-spin model over

su h stru tures. Their validity an be seen to hold for a mu h wider family of random

om-binatorial optimization problembelongingto the NP lass in the worst ase, su h as K-SAT,

Q- oloring[57, 58℄ and many others. Some appli ations to spe i prototype examples will be

shown inthe third hapter, while hapter four willdeal with spe i examples oftwo

om-binatorial optimization problems, namely the 3-SAT and the bi oloring problem of graphs of

uniform rank 3

2

. Chapter ve will be devoted to two relevant examples of the relation

be-tween the algorithmi omputational omplexity of a problem and the presen e (and nature)

of dynami and stati phase transitions in the asso iated spin model. In the rst example

the mapping will essentially be between the sear h for solutions of large random sparse

lin-ear system over nite elds

3

and the sear h of the zero temperature ground states of some

ad ho dened multiple rank intera tion diluted ferromagnet. In the se ond part essentially

the samemapping willbe used tostudy the dynami slowing down of parity he k algorithms

for error orre ting odes - with the onsequent onset of omputational omplexity - and the

orrespondent dynami phase transitionin spin glasses.

The mathemati al language used throughout this work will be that of repli a theory: we

are well aware that this is a very ontroversial eld, due to the lankness of lear and rigorous

foundations that makes its mathemati alinterpretationobs ure and its results \unreasonably

su essful"[42 ℄. And this even after more than 20 years after the original formulation of the

theory [59℄. Inthe ne essary attempttoover omethis problema al ulationin hapter sixis

presented withtheaimofshowinghowrepli atheory anbeatleastinterpretedasasystemati

variationalmethodalsointhe aseondilutedmodels. Thetreatmentwillbeageneralizationof

the methodre entlyproposedbyGuerra[60℄forfully onne tedmodels. Moreover, veryre ent

work[23,24,30℄has lariedtheequivalen ebetweenthe avitymethodandthe repli aresults

also in the diluted systems ase. Sin e the rst one deals with usual probabilisti obje ts, it

has a learer and more dire t interpretation that ould lend itself to further rigorous studies.

Some dire tions for future work are summarized in the on lusions.

The al ulationsand theresultspresented inthisthesisare theoutputofathreeyears

ollabo-rationwiththeI.C.T.P. ondensedmatterandstatisti alphysi sgroupinthenamesofRi ardo

Ze hina,SilvioFranz,AlfredoBraunsteinand Federi oRi i-Tersenghi(nowinRome). Great

partoftheworkwasalsotheoutputofa ollaborationwithAndreaMontanari(

E oleNormale,

Paris). Thisworkwouldnothavebeenpossiblewithoutthem,andIwishtothankthemdeeply.

2

thedenitionofgraphandofrankwillbegivenatthebeginningof hapter1.

(10)

Disordered Diluted Spin Systems

Connection to Random

Combinatorial Optimization

for one class of models

Detailed replica calculation

Alternative solutions in

one specific case

The RS and 1RSB pictures

CHAPTER 2 Detailed Calculations

General Concepts:

CHAPTER 1

CHAPTER 6 Rigorous results

Bicoloring and 3−SAT

CHAPTER 4 Ising model on arbitrary

degree networks

structures

Models on heterogeneous

CHAPTER 3/5

Random Graphs and Hyper−graphs

Replicas

Transitions in Global Algorithms

Error Correcting Codes

APPLICATIONS

CHAPTER 5

(11)

General te hniques for diluted random

models

1.1 Graphs and Hyper-graphs: preliminary denitions

Duringthe wholelengthof thisthesis weare goingtodeal withspin models dened ondiluted

random stru tures su h as simple random graphs or hyper-graphs [61, 62, 63℄. A graph G is

ommonlydened asanon-emptynitesetV(G)ofelementaryunits alledverti es ornodes

or sites inour ommonnotation, and a niteset E(G) of distin t unorderedpairs of distin t

nodes alled edges or links . We allV(G)the vertex set and E(G)the edge set of G. In our

notationthe i

th

site willbe denoted by itsLatin index i andanedge between sites iand j will

bedenoted as the ouple ij. We willwork with undire ted edges (graphs). Wewill dene the

size or orderof the graph asthe ardinalityof the vertex set or the number N of sites, we will

allM the the ardinalityofthe edge set. A omplete graphisagraphwhose edgeset ismade

of all possible links between nodes. In that ase one has M = N(N 1)=2 O(N

2

). Many

interesting models an be dened on a generalization of graph stru tures that go under the

nameofhyper-graphs. LetX=fx

1

;::::;x

N

gbeaniteset., andletE =fE

i

ji2Igbeafamily

of subsets of X. E is said to be a hyper-graph on X if E

i 6= ; 8 i 2 I and S i2I E i = X. The

stru ture H = (X;E) is alled hyper-graph. Again, jXj = N is the order of the hyper-graph.

It is easy ro see how a graph is simply a parti ular ase of hyper-graph with E restri ted to

subsets ofexa tlytwoelements. E willbe the generalizededge set (orhyper-edgeset) of H . Is

it possible to draw a hyper-graph in many equivalentways. One possibility is shown in gure

(1.1),whereedgesareshownasmultipleverti esplaquettes. Thismaynotbetheorthodoxway

to represent ageneral hyper-graph, but is reminis ent of the usualway torepresentmulti-spin

orplaquetteintera tioninlatti eeldtheoryorstatisti alme hani s,sowewilladoptitinthe

following. In the future hapters we willo asionallyneed the on ept of in iden e matrix as

the matrix

^

A=((a

j

i

))with M rows thatrepresent the edgesof Hand N olumnsrepresenting

its verti es, su h that:

a j i = 1 if x j 2E i a j i = 0 if x j = 2E i

In a hyper-graph H , the rank r(S)of a set S X is dened as

r(S)=max

i

jS\E

i

(12)

E

9

10

11

8

7 E

E

1 E

₂

4

3

5 E

₆

E

1 E

₂

3

4

5

4

5 E

₂

E

1

3

6

7

8 E

E

Figure1.1: Trivialexamplesof asimplegraph,two hyper-graphs ofxed rank3and an

hyper-graph of rank4 and minimalrank 2. alsorank 2edges are expressed in hyper-graph notation.

All theseexample haveN verysmall ompared with the stru tures wewillbe interested in,so

they are onlyto be intended, alongwith others in the text, as api torial guide.

The rank of the hype-graphis therefore

r(X)=max i jX\E i j (1.2) Ifr(X)=E i

8i, thenthe hyper-graph issaid tobeof uniformrank. Asimple graphwillthen

be a hype-graph of uniform rank2.

Toea hhyper-graphH=(X;E

1

;:::;E

M

)there orrespondsadualhyper-graphH

=(E;X 1 ;:::;X N )

whose verti es arepointse

1 ;:::;e M representing E 1 ;:::;E M

and whoseedges are setsX

1 ;:::;X N representing x 1 ;:::;x N where8j, X j =fe i jiM ; E i 3x j g (1.3)

When dealing with the graphi al interpretation of error orre ting odes, we will swit h to a

representation of hyper-graphs interms of fa tor graphs [64, 65,66℄(see alsothe appendix for

a graphi alexample),more familiarto omputer s ientists, and where duality is madeevident

and expli itlyexploited. Any hyper-graph an bereadasabipartitegraphwhereone subsetis

X andtheotherE,andwherethereisaedgepointingfromx

i

toe

l

ifthe orrespondentelement

ofthe in iden e matrixofthe originalhyper-graph isnon-zero. Su hparti ular bipartitegraph

is alledfa tor graph. Given a hyper-graph H , a hain of length q isdened [63℄as asequen e

(x 1 ;E 1 ;x 2 ;E 2 ;::::;E q ;x q+1 ) s.t. x 1 ;::::;x q

are distin t verti es

E

1

;::::;E

q

are distin t edges

x k ;x k+1 2E k 8 k =1;:::;q

In the physi s jargon, hainsare nothing but onne ted omponentsof the hyper-graph H .

(13)

rank 2is nothingbut aloop. Inthe physi s of disorderedand frustratedsystems a parti ular

role turn out to play those y les where every vertex belongs to an even number of edges.

We will all those y les \ ompa t y les" , \hyper- y les" or \hyper-loops", for the similarity

with the graphs ase where loops always have this property. Two examples of very parti ular

y les (the rst is alsoa ompa t y le) are shown ing. (1.1) forthe ase of ahyper-graph of

uniformrank3(see also[19℄for therst appli ationtohyper-loops on epts tospinglasses, to

myknowledge). A hyper-graph isthe said toberandom[62, 69,68℄wheneverthe presen e or

absen e ofea hof itsedgesisgiven with adenedarbitraryprobability. Traditionally,random

graphs were introdu ed as those where the probability of having an edge between two given

verti es is a onstant r

1 :

8i;j Prob(ij 2E(G))=r (1.4)

If r/ O(1) then the graph is said tobedense as wellas its in iden e matrix. If r/O(1=N)

then the graph willbe thin or diluted and its in iden e matrix willbe a sparse one. In this

last ase M / N. This will be the ase we'll onsidered it the rest of the thesis. Complete

hyper-graphs of nite rank l have C

N l / N l 1 edges 2

. Therefore, in order to have a number

of edges proportional to N every plaquette ontaining l, verti es must have a probability to

be present proportional to 1=N

(l 1)

. If ea h edge is present with the same xed probability,

properly res aled with N, the hyper-graph will be diluted and ea h vertex i will have a nite

degree k

i

drawn from apoissonianprobabilitydegree distribution

k =e k k! (1.5)

where isafreeparameterdeterminingmeanvalue andvarian eofthe distribution. A

parti -ular \self similar" form is pe uliar of the poissoniandistribution: In this ase the probability

k

of nding a vertex of degree k is equal to the probability q

k

of nding a nearest neighbor

vertex with degree k+1, as an be seen applyingeq. (1.5) tothe denition [69℄

q k (k+1) k+1 P k k k = (k+1) k+1 <k > : (1.6)

This note is very important in pra ti al al ulations and is the origin of major simpli ations

inthe repli aandthe avity equations[23,24℄wewillseelater on. Thisisre e ted by thefa t

1

Averyri hphenomenologyofstru turesappearinginthegraphasafun tionofthedegreeanda omplete

study of graphsbehaviorasr in reases withN hasbeen performed on arigorousmathemati a basisstaring

from the seminal paper of

Erdos and Renyi[61℄. Forasystemati introdu tionsee for instan e [62℄ and [67℄

and referen estherein. Tomyknowledgeno omparablesystemati study hasbeenundertakenin the aseof

rank>2hyper-graphsyet. Fora learintrodu tionto hyper-graphssee[68℄.

2 C

N

(14)

that in the poissonian ase the mean value uniquely determines the varian e and vi e-versa.

As a onsequen e, a lot of simpli ations and parti ular behaviors of poissonian hyper-graphs

annot be applied in wider families of random stru tures. However, itis possible to onstrain

the probability of the value of the number of edges in ident on a xed vertex in order to

draw diluted hyper-graphs from ensembles with arbitrary degree distributions (arbitrary

k ).

The onstraints will be of global nature and will not introdu e vertex-vertex orrelations, as

it will be seen in the following. Sin e the repli a as well as the avity equation for spin

models dened over diluted hyper-graphs will be on erned in the omputation of the lo al

ee tive elds a ting on ea h spin variable in absen e of a parti ular edge in ident to the

vertex under onsideration, The natural ensemble we are going to work with will indeed be

that of the q

k

and not that of the

k

. We stress again that this hange is immaterial in the

poissonian ase, where one falls ba k into the same ensemble, but not in the general one.

Moreover, one ould of oursethinktomore omplex or\semi-random"stru tures wheresome

regularity reminis ent of a real latti e geometry is progressively introdu ed into the random

matrix through the presen e of orrelations of various kind (see [41℄ for one among possible

examples) or through the presen e of regular sub-hyper-graphs merged in the whole one in a

random way. But even if one limits the investigation to purely random diluted hyper-graphs

and tosimple lassi alspin models dened onthem,the questions thatarise are stillof adeep

kind both from a fundamental and from an appli ation oriented point of view. Nevertheless,

theimmediatefuturedire tionsforinvestigationswillne essarilyhavetodealwiththepresen e

ofsu h orrelations[72,46,70,71,73℄,aswellaswith modelswheretheintera tion onstraints

are non lo alin nature[74℄ or non purely lassi al.

1.2 Spin models on diluted stru tures

In all ases, we are going to work with models that an be des ribed, under appropriate

map-ping, via some spin Hamiltonian H(J;s), where fJg represents an ensemble of disordered

intera tion energy variables taking non zero values onthe edges of the hyper-graph ordened

as ombinationsofmoreelementaryintera tion termsasinthe aseoftheK-SAT. s)are N 1

spinvariables(0or1Booleanvariablesintheusual ombinatorialproblemsen oding)livingon

the verti esofthe hyper-graph. Wewilldealinthefollowingwith ases wheretheHamiltonian

an bewritten asa sum of lo al energeti ontributions

as H(J;s)= X (fJ g;fs g); (1.7)

whereindi atesea hsubset(usuallyanedgeoftheunderlyinghyper-graphora lauseinSAT

-likeformulation)that ontains asmallnumber of both onstraintsand spin variables, relative

to the total number N of variables

3

. As atitle of example,the simplest possible Hamiltonian

is the Viana-Bray (see the original arti le by Vianaand Bray in[1℄):

H(J;s)= X i<j J ij s i s j : (1.8)

The diluted hyper-graphs that dene the underlying topologi alstru ture will be drawn from

the appropriate hosenstatisti alensemble, fullydeterminingtheprobabilitydistributionP(k)

3

(15)

of edge degrees and the probability distributionQ(l)of ranks. The rankonea h single

hyper-graphedgeisequalinthe asesunder onsiderationtothenumberofspin variablesinthe lo al

energeti term

. Therefore the distributionQ(l) isgoingto bestri tly relatedtothe fra tion

of l-variables intera tion terms Hamiltonianssummingup to the total H(J;s)

4 .

1.2.1 Disorder

On e the set fJg of non zero ouplings (equivalent to the set of present edges) is set, its

elements an take values a ording to an a priori arbitrary distribution (J). For disordered

pure ferromagneti -typemodels (J)willread

(J)=Æ(J

^

1): (1.9)

For disordered pure anti-ferromagnets

(J)=Æ(J+

^

1): (1.10)

Finally, forthe pure generalized1 spin-glass ase

5 : (J)= 1 2 Æ(J ^ 1)+Æ(J+ ^ 1) (1.11)

More in general, the same models an be studied for other forms of the oupling distribution

(J): ontinuous,mixturesofa ontinuousandadeltapeaked part,mixturesof pure

ferromag-neti and spin-glassterms,andsoon. In hapter5wewillworkwith modelsthatareoriginally

dened asa mixture of the previous ferromagneti and spin-glassone

(J)= 1 2 pÆ(J ^ 1)+(1 p)Æ(J+ ^ 1) (1.12)

where p is a parameter tuning the amount of \average frustration" or \average glassiness"

present into the system. Finally,the rigorousresults presented in hapter6 willbe derived for

general formsof symmetri (J).

1.2.2 Frustration

It was observed right at the beginningof spin-glasstheory by Toulouse[2, 75℄ that a mixture

of ferromagneti and anti-ferromagneti ouplings an give rise to on i ting onstraints,su h

that it is in general impossible to minimize lo ally all the energy terms

. This property

is widely known as frustration. In spin glass-models on diluted stru tures (Viana-Bray), this

typi allyhappens when the density of the graph allows parti ular ompa t stru tures su h as

loops to per olate in the system. In the ase of higher rank hyper-graphs, loops per olation

turns outnot tobeasuÆ ient ondition forthe existen eof anextensive fra tionoffrustrated

onstraints,essentiallybe ause the extra degrees of freedomdue tothe possibilityof adjusting

the spin variables belonging to the edges but not to the loop. In fa t, even more ompa t

stru tures su h as hyper-loops must per olate in the underlying matrix. The phenomenon is

exemplied in g. (1.2). The ommon presen e of disorder and frustration allows for a phase

4

Infa tthetwofra tionswill oin ideinthegeneralizedp-spinmodel.

5

Histori allythespin-glassmodelshavebeendenedonlyinthe aseoftwobodyintera tions,asaphysi ally

sensible model for real magneti materials. However,a generalizedmulti-spin intera tionfamily of spin-glass

type models anbejustied notonly fortheir usein random ombinatorial optimization, but asanee tive

(16)

−

+ +

+

−

(unsatisfied)

+

−

+

?

to satisfy the constraint

one can always choose

frustrated

constraint

or

?

=

hyper−loop

+

−

Figure1.2: Frustration ingraphs and hyper-graphs. This Pi ture isvery similartothe one we

will draw in hapter 2 for the ore resolution under the a tion of the Leaf Removal algorithm

(see se tion 2.7 for details). It is important to keep in mind this similarity, be ause it will be

the main ause of the ee tiveness of the algorithmin lo ating the spin-glasstransitionin the

(17)

0

0.2

0.4

0.6

0.8

1

0

0.2

0.4

0.6

0.8

1

Figure1.3: Pi torial one dimensional proje tion of rough energy lands ape.

spa estru ture withalarge-typi allyexponential-numberofdegenerateglobalgroundstates

as well as denite energy metastable states. Pi torially, one ould think that these systems

show an energy lands ape of the kind exemplied in g. (1.3). This pi ture is however often

verymisleadingbe ausethex-axisinthe pi tureis infa taproje tion ofthe highdimensional

phase spa e, where all the topologi al stru ture is hidden. The degree and nature of the

inner stru ture of the phase spa e an vary in prin iple from problem to problem. A better

understanding of thistopologyinthe aseof some modelsinterestinginrandom ombinatorial

optimization, oding theory and more in general disordered systems physi s, is the main aim

of the present work.

We would like to mention that the families of models studied in the following hapters is

by no means exhaustive. For instan e, a natural generalizationof the te hniques explained is

urrently been appliedto Potts-likemodels[76℄ and ouldbeadapted in prin ipleto Classi al

Heisenberg models and so on. However, in these last ases, the te hni al al ulations are

more involved [57, 58℄ be ause, as it turns out, one is for ed to work with fun tional order

parameters whi h an be written as distributions of ee tive lo al elds of ve torial instead

of s alar nature, as it is the ase of the examples treated in the present work. This leads

to self onsistent equations for the order parameters that are inter-wined in the various eld

omponents [58℄

6

. The body of the hapter will deal with the analyti al repli a te hniques

devised to omputethermodynami alquantitiesofphysi alsystemsondilutedhyper-graphsin

absen eof orrelationsandinpresen eofquen heddisorder. Letusnowexpli itthe onne tion

between this lass of models and ombinatorialoptimization theory.

6

The oloringdegreestati thresholdobviouslydepend onthe numberof olorsavailable for the oloring,

at xed graph. The urrently best rigorous upper bound forthe 3-COL/UNCOL transition (three olors) in

poissoniandegreedistributed randomgraphsis5:06[77℄. Itwasobtainedusingarenedrstmomentmethod,

equivalent to an improved annealed approximation in statisti al physi s. The RS 5:1 threshold obtained in

[57℄ ex eeds the rigorous bound, while at the 1RSB level the authors of [58℄ were able to nd a dynami al

threshold foranaverage degreeequalto 4:42,followedby the3-COL/UNCOLtransition at 4:69. The values

are onje turedtobeexa tbytheauthors, andareindeed inverygoodagreementwithnumeri alsimulations

[78℄. Thegeneraldis ussiononthemeaningofthedynami al thresholdis donein hapter2. Noti ealso that

(18)

1.2.3 Combinatorial optimization problems as spin models

Classi al omplexitytheory[34℄,asarisingfromCook'stheoremof1971[47℄,dealswiththeissue

of lassifying ombinatorialoptimizationproblemsa ordingtothe omputational ostrequired

for their solution. The hard problems are grouped in a lass named NP, where NP stands for

`non-deterministi polynomial time'. These problems are su h that a potential solution an

be he ked rapidly whereas nding one solution may require an exponential time in the worst

ase. In turn, the hardest problems in NP belong to a sub- lass alled NP- omplete whi h is

at the root of omputational omplexity. The ompleteness property refers tothe fa t that if

an eÆ ient algorithm for solving just one of these problems ould be found, then one would

have an eÆ ient algorithm for solving all problems in NP. By now, a huge number of

NP- omplete problemshavebeen identied[34℄, andthe la kofaneÆ ientalgorithm orroborates

the widespread onje ture that NP6=P, i.e.that nosu h algorithmexists.

Complexity theory is based on a worst- ase analysis and therefore does not depend on

the properties of the parti ular instan es of the problems under onsideration. In pra ti e

algorithmsdisplaya huge variabilityof runningtimes, rangingfromlinear toexponential,and

therefore a theory for their most probable behavior represents the natural omplement to the

worst- ase s enario.

The most ommonproblems en ounteredin omputer s ien eand issue of theoreti al

anal-ysis studieswithin omputational omplexity theoryare of atype. A de ision-makingproblem

isoftenformulatedasthatofthe maximizationorminimizationofamulti-variablefun tion,an

optimization problem

7

. The fun tion tobe minimized(maximized) is alled obje tive fun tion

or ost fun tion, and basi ally ounts the number of violated onstraints , given a parti ular

ongurational assignmentsto the variableson the problem. An exampleof ombinatorial

op-timizationproblemfamiliarto physi ists is that of ndingthe ground state of anIsing model.

More ingeneral,any sear h ofgroundstates inanyspin modelonagiven geometri al or

topo-logi al stru ture an be seen as parti ular optimization problem. On the other hand, a large

lass of purely ombinatorial optimizationproblems in prin iplenot related tophysi s an be

seen equivalent to the sear h for zero temperature ground states of ad ho onstru ted spin

models (often spin-glasses) on parti ular topologi al stru tures. Among others we an ount

the number partitioning problem, the graph partitioning, the graph and hyper-graph

olor-ing, the knapsa k problem, the s heduling problem and the satisability (SAT) one. A lear

overview ofsome ofthese examples an beseen in[27℄ andanintrodu tiontothe study

statis-ti al me hani sstudy ofrandom ombinatorialoptimizationproblems seenasspinsystems an

be found in [35℄. In parti ular, SAT has been extensively studied due toits NP- ompleteness

and general nature. Its mapping on a parti ular spin-glass model has been elu idated in [9℄.

As well as in many of its variations, the SAT ost fun tion an be read as the olle tion of

M logi al onstraints that have to be satised by N boolean variables. It turns out [9℄ that

any SATformula ost fun tion an bewritten asthe Hamiltonianofa spin modelwhere0 1

variables arerepla ed by 1spins, andthe onstraintsare welldetermined olle tionsof edges

orplaquettes of various rankof agiven hyper-graph that ompletely hara terizes the formula

under study. If M O(N), whi h is the ase the most interesting formulas belongs to - i.e.

those losetothe satisabilitythreshold- thenthe underlyinghyper-graph isadilutedone. In

ordertostudy theSATprobleminitsspin-glassformulationitisthereforene essary todevelop

ageneralformalismtobeabletodealwithtopologi alstru turessu hasdilutedhyper-graphs.

7

(19)

In the followingthis willbe expli itlydonefor the ase of the generaldiluted p-spinlike

mod-els, but it will be then further generalized in order to ta kle problems like the K-SAT one.

Whenever the onstraints forming the formulato be satisedare drawn randomly froma

pre-viouslydened ensemble, thenthe optimizationproblemwillhavearandomnature. Insteadof

working on a parti ular hyper-graph, this willamount to averaging over the hosen ensemble

of\quen hed"stru tures. Itwillbetheninterestingtodis ernwhi hproperties(oneforallthe

inner omplexity of the problem) survive in shifting the sear h for solution from a parti ular

to a random ase. The following hapters willalmost allbe devoted tothe appli ation of the

general analyti al te hniques developed here to various optimization problems, some of them

used astoy models, asin hapter3, someothers ofmore omplexanalysis, asinthe remaining

hapters.

1.2.4 Quen heddisorder averages andgeneral omputational

strate-gies

Evaluation of a physi al quantity using a spin Hamiltonian of the type (1.8) or any more

ompli ated ase onsidered in this work starts from the tra e over the spin variables for a

givenxed(quen hed)set of ouplings. Forus,this orrespondstorandomly hoosingadiluted

hyper-graphfromadesiredensembleandaxedformofthe(J). Thefree-energyofthesystem

F[J℄= 1 logTr s e H(J;s) (1.13)

an be harmlesslyaveragedoverthe quen hed disorderinthe thermodynami limit,ifthe

self-averaging ondition overextensive thermodynami alquantities (like indeedthe free-energy)is

satised as we assume to be true throughout the whose treatment. This averaging pro edure

goesunder the name of ongurationalaverage :

hFi

Z

D(J)F[J℄ (1.14)

However, the dependen e ofthe partitionfun tiononJ isingeneralvery ompli atedanditis

noteasyto al ulateexpression (1.14)dire tly. Moreover, inthe aseofrealworld optimization

problems, the thermodynami limit ondition does not always hold, and more subtle single

sample analysis alsoin the typi al ase have to betaken into onsideration[30℄.

1.2.5 Repli as

The al ulations are arried out via a\subtle tri k": itmu h easier to ompute

hlogZi= lim n!0 Z n 1 n (1.15)

The last equation is an identity for ontinuous n, but the tri k onsists in al ulating rst

Z n

for integer n, taking the 0 limit is a se ond time. The repli ated partition fun tion, after

averagingoverthe samedisorder realizationbe omesapartitionfun tionof n systems,without

disorder, but with anee tive attra tive intera tion between the various repli as. The reason

for this attra tion is intuitively quite simple [79℄: be ause they share the same Hamiltonian

(20)

with basi ally a singlelarge deepest valley,then the repli aswillall fallin that, and the order

parameter willbea numberq whi hwillmeasure the average distan e between repli aswithin

this single valley. But in a systems with several metastable states, the situation an be more

ompli ated, with some repli as trapped in a valley and some inanother. This ee t is alled

repli a symmetry breaking (RSB).Te hni ally it appears as a standard spontaneous breaking

of a symmetry, the permutation symmetry S

n

of the n repli as. The problem is that this

symmetry is broken only when one onsiders some number of repli as whi h is non-integer

and in fa t smaller than one. Even though this point has been elegantly solved by Parisi [2℄,

the validity of taking the n ! 0 limitstill la ks a general rigorous justi ation ([60℄, [59℄ and

referen es therein). The last hapter will try to deal with this problem in an indire t way.

We will not show (unfortunately) that the physi al quantities dened on the n ! 0 ve tor

spa es are well dened mathemati al obje ts, but at least that, onthe lass of systems we are

interested in, the ee tive repli a Hamiltonian an lead to rigorous variationalresults. In the

ase of fully onne ted models,the repli amean eldtheory an bestated interms of asingle

s alarnn matrix,whoseelementsare theoverlaps hosenviaadeterminates hemeand that

play the role of orderparameters. In the ase ofdiluted systems,however, it emerges the need

for the determinationof a full distributionof multi-spinoverlaps [1,9℄ that an be ompletely

hara terized via the introdu tion of a lass of fun tional order parameters[8, 9,12℄

(~)

that essentially enumerate the fra tion of repli ated spins in a parti ular repli a state, as it

will be ome lear in the ourse of the al ulations

8

. These fun tional order parameters have

stillamean eldnatureand willbeexpressed interms of seriesof multi-spinoverlapfun tions

averagedoverthemeanlo aleldsdistributionsseenbyanaveragevertex inaparti ularstate.

The free-energy of the system an be writtenas a fun tion of (~), and will thereforebea

fun tional of the lo al elds probability distributions, averaged over all possible states of the

system. In the N ! 1 limit

9

, the dominant ontribution to the partition fun tion is found

extremizing the freeenergy with respe t to the fun tionalorder parameter

One isleft with a set of self onsistent integral equationsfor the ee tive elds probability

distributions,that an besolved analyti allyornumeri ally(dependingonthe ases). Insome

spe ial ases, namely on the T = 0 line for some lasses of models, these equations further

simplifydue to the ollapsingofthe fun tionalform ofthe probabilitydistributions intoseries

of weighted deltafun tions. Whenthis happens, oneis leftwith aset ofalgebrai equations in

the weightsthatoftenadmitanalyti solutionsina losedform. This isparti ularlyinteresting

in the eld of ombinatorial optimization, thanks to the existing general mapping pro edure

between the solutionsof the random ombinatorial problemand the zero temperatureground

states of the asso iated spin model. Moreover, the assignments of the problem variables with

a given (typi ally low) number of violated onstraints orrespond to metastable lo alground

states with positive energy. The logarithm of the number N(e) of su h metastable states is

a fun tion of the their energy density e = E=N and is known as Congurational Entropy or

Complexity[7℄ : (e)= 1 N log[N(e)℄ (1.16) 8

Thereareatleasttwowaystodenetheorderparameter,dependingonwhetheronefo uses onthewhole

graphsoronsomepartofit. Seeappendi esC,C.1andC.2 forsomedetails.

(21)

Therefore, the measure of the extensivity of the Complexity willbeanimportantindire tway

to study the hardness of the random ombinatorial problemdepending on the position in the

phasespa eoftheasso iatedpinmodel,andwillstressthedeep onne tionbetween thetheory

of the glassy transitions in disordered systems and the on epts of omputational omplexity

in theoreti al omputer s ien e.

Aswe just said, the ingredientsof frustrationand/or disordertypi ally indu ethe onset of

transitions from a uniform paramagneti to one or more kind of glassy phases in the ontrol

parameters 10

spa e ofthe model.

1.2.6 Cavity

Thesamesolutionsandthesamephysi alinsights anberea hedviathe avitymethod. Cavity

was invented in1986 forthe solutionof theSK model[2℄, but wasre ently reformulatedinthe

diluted systems framework [23℄ and related to an algorithmi understanding of the pro ess in

the aseofitsdire tzerotemperatureformulationin[24,30,29℄. Thebasi ideaof themethod

applied tospin models on dilutedhyper-graphs is the following:

Assume that, likeinthe repli amethod,due tothelo altree-likestru ture of the

hyper-graphandthemeaneldnatureofthemodel,spinvariablesbe omeun orrelatedatlarge

distan es if the system is in a single state. The in uen e of the graph on a single spin

an betherefore easilywritten interms of un orrelated lo alee tive elds a tingon it.

Startingwith asystem ofN variables,addnow avariableS

0

ofdegreek (on average one

will add a fra tion of spins

k

) and onne t it to the rest of the hyper-graph in order to

ompletek lausesoffun tionnodesofthe N verti esgraphswithvariablesfS

1 a ;:::;S k a g a ,

where a is the rank index, i.e. it indexes all the variables other than S

0

belonging to a

given lause (or energy onstraint, asequivalently indi ated throughoutthis thesis).

AssumethatfS 1 a ;:::;S k a g a

wherepreviouslydis onne tedwithprobabilityone inthe

ther-modynami limit(no short loops) and therefore un orrelated:

P (N) (fS 1 a ;:::;S k a g a )'P (N) 1 (fS 1 a g a ):::P (N) k (fS k a g a )' Y a P (N) 1;a (S 1 a ):::P (N) k;a (S k a ) (1.17)

Then it ispossible to ompute the new P

(N+1) (S 0 ;fS a 1 ;:::;S a k g a

) via Bayes theorem as:

P (N+1) (S 0 ;fS 1 a ;:::;S k a g a )' k Y =1 P (N) (fS a g a )e (0) (S 0 ;fS a g a ) ' k Y =1 Y a P (N) ;a (S a )e (0) (S 0 ;fS a g a ) (1.18) where (0) (S 0 ;fS a g a

) is the lo alenergy onstraint

of eq. (1.7) where the dependen e

on the -th lause spin variables has been made expli it, as well as the referen e spin

index 0. Integrating overthe variablesfS

1 a ;:::;S k a g a

one nally obtains:

P (N+1) (S 0 )' k Y =1 Y a X S a =1 P (N) ;a (S a )e (0) (S 0 ;fS a g a ) : (1.19) 10

Were all that therelevant ontrolparametersare in thesemodelsthetemperatureT,the graphdilution

(22)

This last equation denes an iterative method to al ulate P (N+1) 0 (S 0 ) from fP (N) ;a (S a )g ;a .

Thankstotherstassumption,theequationsforP

(N) j (S j )andP (N+1) (S 0

) anbeeasilywritten

as P (N) j (S j ) = e hjSj 2 osh(h j ) P (N+1) 0 (S 0 ) = e h 0 S 0 2 osh(h 0 ) (1.20)

Writing self onsistent equations for the avity elds is then possible inserting (1.20) in(1.19)

and iterating. In the typi al ase, one an then average over all spins, getting an expression

for the distribution P(h)weighted over the degree and rank distribution of the typi al

hyper-graphs.

In fa t, this assumption is globally valid only if the system is in a single pure state. In

many states =1;:::;N

states

are present,the previousequations are validwithin agiven state

, i.e. a luster of solution separated by other lusters. Equations (1.20) will then be state

dependentand theself onsistent ondition(1.19)willhavetobeaveragedbothoverthe sitesi

andover thestates . Thispi ture orrespondstothe one step repli asymmetry breakingone

and is frequent in disordered spin systems. The avity method formulated in this way works

essentially by indu tion and assumes no non trivial orrelations within lusters or inside the

same lusterthat ouldoriginfromthegeometryofthegraph,eventhoughtrivial orrelationsof

a hierar hi al nature an be taken into a ount

11

. The disregarding of orrelations is ommon

with the repli a approa h, as it should be if we laim the two to be equivalent, and it is a

limitationof thetheory thatwillhavetobeover omeinthenearfuture ifone wantstobeable

to systemati allyatta k problems with more omplex geometri alstru ture.

During the avity iterationpro ess, one is bound tomake asmallerrorof order 1=N, sin e

the ensemble of random graphs one is working with hanges slightly under the N ! N +1

avityiterations. Thiserror anbehealedviaa leverbalan ingofverti esandedgesadditions

and erasures. More in detail:

A hyper-graph H

N;M

withN verti es and M edges isdrawn from the desired ensemble.

A avity is arved in it, where q verti es are left with degree equal to minus one their

initialone, through a proper erasure of surrounding verti es and edges. q is hosen as a

fun tion of the degree of the erased verti es and the rankof the erased edges

12 .

Lo al avity magneti elds h

i

a ting on the avity spins S

i

are supposed to follow an

initially unknown probability distribution P

i (h

i

), in prin iple dierent from site to site

and with eld values dependent on the state of the system.

Undertheadditionofanew avityspinS

j

ofdegreek

j

onne tedwithsomeoftheprevious

avitysitesthroughgivennumberofhyper-edgesofsuitably hosenranks,theprobability

distributionsof thenew avity elds are al ulated inaself onsistentiterativeway. The

11

Trivial orrelations are takenin prin ipleinto a ountvia further lusterization stepsthe same\ lusters

within lusters"hierar hyimpliedbytheRSBParisi's onstru tion.

12

(23)

iteration isbuilt in order toself onsistently stabilize the avity elds distributions on e

the original hyper-graph isretrieved

13 .

The pro edure is repeated adding and deleting edges and verti es in a balan ed way, in

order toretrieve the hyper-graph belonging tothe desired startingensemble.

Energy shifts are al ulated under the iteration, allowing to al ulate the free energy,

the energy density and other physi al quantities (for example the omplexity) in the

thermodynami limit.

Averages overthe hyper-graphs ensembleand the model ouplings are performed.

If applied to single spins, the avity method is mean eld in nature. In order to possibly

ee tivelyextendittonitedimensionalmodelsorwithlatti eswithsomenontrivialgeometry

one would need to onsider the in uen e onthe iteration ofmore omplex groupsof variables.

Thishasbeenpartiallydonewiththe lustervariationmethod(CVM)forferromagneti models,

but the extensionin presen e of frustration isstillan open issue.

1.2.7 Phase spa e stru ture

twopossible s enarios have been en ountered inthe models studied:

ARepli aSymmetri phase (RS):generi ally,thedistan eonthelatti ebetweentwo

spins is large, at least of the order of random loops forming in the topology, i.e. of the

order O(logN). It is thereforereasonable to assume that the spins remain un orrelated.

In the avitylanguagethis meansthat theGlobal Ground State(GGS)energy ofagraph

with a referen e avity of q spins arved in it an be written as an additive fun tion of

the valuesofthe avityspins S

i=1;:::;q

,weightedbythe lo alelds h

i=1;:::;q

a tingonthem.

When onsidering the ensemble of random avity graphs, the lo alelds turn out to be

i.i.d. random variables, and their distribution is denoted with P(h). The lo aleld will

therefore not u tuate from state to state be ause of the presen e on only one GGS a,

and the distribution P(h)will be anaverage overall sites. In the repli a formalism, the

sameP(h)willbetheone determiningthemulti-spinoverlaps ontainedinthe fun tional

order parameter (~).

A One Step Repli a Symmetry Broken phase (1RSB): The phase spa e splits in

an exponential number of metastable Lo al Ground States (LGS), dened as states in

whi h the energy annot be lowered by ipping a nite number of spins. In presen e of

several groundstates,the assumptionisthatthereisaone-to-one orresponden eamong

the LGS before and after the addition of spins or edges (at least for the LGS with low

energies). Equivalently we assume that the perturbation due to the hange of the value

of a avity spin propagates (in the limit N going to innity) only to an innitesimal

fra tion of the latti e. Therefore it is possible to write an iteration pro edure for the

wholepopulationofLGSwithgiven energy. Howeveritmaywellbethattheorderof the

LGSenergies hange duringthe graphoperations,and the GGS afteriterationisnot the

same LGS as the one before. The problem is to take into a ount these level rossings,

whi his not donein the RSsolution andturns out tobeautomati ally done inthe RSB

13

(24)

Figure 1.4: Pi torialview of the energy lands apein the phase spa e of asystem inthe 1RSB

phase. The energy is on the verti al axis. utting the pi ture at denite values of the energy

one nds lustersof solutions in reasingin number and dimension.

e

d

e

c

e = 0

γ

<

_d

γ

<

γ

_c

e = 0

e

e = 0

e

d

γ

>

_c

d ~ O(N)

out

_{d ~ o(N)}

in

Figure1.5: 1RSB lustering in phasespa e.

solutions via the repli a formalism. One is then for ed to follow a large population of

the LGSof lowest energy,large enough sothat one an be sure toobtain the GGS when

iterating.

In g.(1.4)weshowapi torialviewofthe energy lustersinthe1RSBphase. Typi alxed

energysli esofthispi tureshowhow,in reasingtheenergy,dierent lustersaresele ted. This

is shown is g. (1.5). This qualitative pi ture as been extensively studied for p > 2-spin and

K >2-SAT modelsin re ent years [11, 21,53℄,mainlywith variationalte hniques. This is the

intuitiveidea we'llhave inmind inthe rest ofthis work. In some lu ky ases- and indeed the

p-spin model will be one of them - this pi ture will turn out to have an exa t interpretation.

In some others, as for instan e the random 3-SAT, no rigorous proof is present. However

this pi ture is highly probable to be orre t and what is more important it turned out to be

extremely useful inthe development of a new lass of algorithmof potentialvast use. Indeed

very re ently [30, 29℄ exa t solutions of the p-spin modeland the K-SAT at zero temperature

in a ertainrange of thephase spa e parametershave been a hieved under this assumption on

the form of the energy levelsdistribution.

(25)

the ase of the Viana-Bray model on diluted graphs [23, 24℄. In this models typi ally slow

de ayinglong range orrelationsare present. However, froma omputationalpointof viewthe

Viana-Bray models turns out to be easy in the phase spa e regions of interest. Indeed, it is

also due to the presen e of this long range orrelations that Viana-Bray like models turn out

tobe omputationallysimple,be ausethe solutionspa e willingeneralbe onne tedby paths

allowingthe rea hingofany phasespa epointwitha leverbutsub-extensivesequen e of lo al

adjustments.

The main al ulation steps reviewed in the last paragraphs are then the ones indi ated in

s hema (1.6), where the onne tionto the eld of ombinatorialoptimizationis made evident.

Finally, some dieren es between the repli a and the avity methods are listed in the

fol-lowing:

While repli as for e the introdu tion of an order parameter that has already undergone

average over the quen hed disorder, the avity equations an also be written ona single

sample(hyper-graph). This makesthe avity approa hmoreapttobeappliedtospe i

real world problems.

The avity approa h deals with well dened mathemati al quantities and is manifestly

variational,while thewelldenitenessof the repli amethod (namelyinthe RSB ase)is

stillun lear.

On the other hand,the repli a methodis mu hmore elegantand ompa t (espe ially at

nite temperature), itdoesnot requirefurtherpostulatesandassumptionsonthe energy

level distribution, that in prin iple depend on the model onsidered, and its equations

an behandled infull generality.

Wewilldevelopthe repli amethod,o asionallytaking advantage ofphysi alinsights

om-ing from the avity pi ture. A throughout treatment on the state of the artof the avity

(26)

Model Hamiltonian H

couplings configuration {J}

Fixed hyper−graph H(V,E) and

Partition Function Z

Replication

Z n

Z

<

n

>

n −> 0 limit

Functional Order parameter

ρ(σ)

Local Fields Distributions

i

Proper Definition of

the Cavity Hyper−Graph

Depending on Ensemble

Single Sample Free−Energy

Single Sample Properties

Solution Search Algorithms

...

Disorder Average

Hypothesis on LGS Energy

Distribution

Cavity Calculation

Stationary Condition on the Potential

Ν −> ο

o limit

on Single Samples P (h )

_i

Single Sample Phase Space

Iterative Equations for

Replica Calculation

Phase Transitions

Physical Observables

Prediction on Average Behavior

of Algorithms

...

Search on Complex Structures

Cost Functions

Combinatorial Optimization

Computer Science

Disordered Systems Physics

Energy Shifts

Comlexity

P(h) of Local Effective Fields

In terms of Average Probability Distributions

Self Consistent Equation for

ρ(σ)

< F >, Average Complexity

Computational Complexity

Figure 1.6: General al ulation strategy via repli a and avity methods and onne tion to

(27)

The generalized diluted p-spin model

Wearenowgoingtodevelopindetailstheanalyti alrepli ate hniquespreviouslydes ribed. In

doingsowewill hoose aspe i lassofmodelsofgeneralimportan e,namelyageneralization

of the p-spin model on un orrelated random hyper-graphs with arbitrary degree and rank

distribution.

Ifwedene

k

asthefra tionofspinvariableswithdegreekandv

l

thefra tionofintera tions

of rank l, the resulting mixed hyper-graph stru ture will be hara terized by the following

distributions: Q( ^ l) = X l v l Æ( ^ l l) (2.1) P( ^ k) = X k k Æ( ^ k k) (2.2) <l > = X l lv l (2.3) <k > = X l k k (2.4)

We noti ethat we ould introdu e two generating fun tions

v(x) = X l v l x l (2.5) (x) = X k k x k (2.6) (2.7)

where (2.6) is the same of [69℄, but the (2.5) generalizes it to more omplex stru tures su h

as the mixed rank hyper-graphs we work with. The generating fun tion formalism is not

stri tly ne essary, but an be very helpful when one is interested in omputing more omplex

topologi al properties of the hyper-graph and indeed will be expli itly used in some ases. In

the generalizedp-spin modelthe Hamiltoniantherefore reads

H = M X l H l (2.8) H l = X i1<:::<i l J i1;:::;i l s i1 ;:::;s i l (2.9) with M = <k > <l > N (2.10)

(28)

and P(fJ i 1 ;:::;i l g l ) = Y l Y i 1 <:::<i l ((1 l! l N l 1 )Æ(J i 1 ;:::;i l )+ l! l N l 1 Æ(J i 1 ;:::;i l 1)) (2.11) P(fJ i 1 ;:::;i l g l ) = Y l Y i 1 <:::<i l ((1 l! l N l 1 )Æ(J i 1 ;:::;i l )+ (2.12) l! l 2N l 1 (Æ(J i 1 ;:::;i l 1)+Æ(J i 1 ;:::;i l +1))

respe tively for the ferromagneti and for the frustrated ase,and

l

=(<k >=<l >)v

l .

2.1 Combinatorial optimization interpretation of p-spin

models: the XOR-SAT

Wenoti ethattheseHamiltonians anbealsoseenasthe ostfun tionofa lassa ombinatorial

optimizationproblemsknown underthe nameofXOR-SAT([19,21,82℄,alsoextended in[25℄).

The XOR-SAT problem is not NP, but it is nevertheless a very useful prototype treatable

optimization problem in order to test the power of statisti al physi s tools. Beside this, its

diluted p-spin version bears many interesting properties from the point of view of stru tural

glasses and granular physi s [79, 20, 15, 80, 81℄, so its study is interesting for a transversal

number of dis iplines. We show the ase of the K-XOR-SAT model with K variables per

onstraint,whi h anbeviewedasaperfe tlybalan edversionoftherandomK-SATproblem

1 .

GivenasetofN Booleanvariablesfx

i

=0;1g

i=1;:::;N

,we onstru taninstan e ofK-XOR-SAT

as follows: given original K-SAT lauses (x

i 1 _x i 2 _x i k ) or (x i 1 _x i 2 _x i k ), every sub- lause

ontainedinone ofthem must appear dire ted andnegated inthe orrespondingK-XOR-SAT

onstraint an even number of times. In the K = 2 ase we'll therefore dene the following

elementary onstraints(2- lauses sets with 50% satisfyingassignments)

C(ijj+1) = (x i _x j )^(x i _x j ) C(ijj 1) = (x i _x j )^(x i _x j ) ; (2.13)

In the K =3 ase we'll have onstraints(4- lauses sets with 50% satisfyingassignments)

C(ijkj+1) = (x i _x j _x k )^(x i _x j _x k ) ^ (x i _x j _x k )^(x i _x j _x k ) C(ijkj 1) = (x i _x j _x k )^(x i _x j _x k ) ^ (x i _x j _x k )^(x i _x j _x k ) ; (2.14)

in the K =4 ase we'llhave 8- lauses of type

C(ijklj+1) = (x i _x j _x k _x l )^(x i _x j _x k _x l ) ^ (x i _x j _x k _x l )^(x i _x j _x k _x l ) ^ (x i _x j _x k _x l )^(x i _x j _x k _x l ) ^ (x i _x j _x k _x l )^(x i _x j _x k _x l ) 1

(29)

C(ijklj 1) = (x i _x j _x k _x l )^(x i _x j _x k _x l ) ^ (x i _x j _x k _x l )^(x i _x j _x k _x l ) ^ (x i _x j _x k _x l )^(x i _x j _x k _x l ) ^ (x i _x j _x k _x l )^(x i _x j _x k _x l ) (2.15)

and so on. Here ^ and _ stand for the logi al AND and OR operations respe tively and

the over-bar is the logi al negation. Let's on entrate on the K = 3 ase, that ontains all

general elements. In the next hapter we will study a mixed version of this modelin the ase

of a mixture of 2 and 4 lauses, whi h we will all 2+p-XOR-SATas it shares many ommon

features to the 2+p-SAT model studied in [11℄. By randomly hoosing

2

a set E of M triples

fi;j;kg among the N possible indi es and M asso iated unbiased and independent random

variablesJ

ijk

=1,we onstru t a Boolean expression inConjun tive NormalForm (CNF)as

F = ^ fi;j;kg2E C(ijkjJ ijk ) : (2.16)

A logi alassignmentofthe fx

i

g'ssatisfyingall lauses,that isevaluatingF totrue, is alleda

solution of the XOR-SATproblem. If nosu h assignmentexists, F issaid tobeunsatisable.

A slightlydierent hoi eof J

ijk

allowsto onstru t XOR-SATformulwhi hare randombut

guaranteed to be satisable. This will lead to the ferromagneti spin ase: to every Boolean

variablewe asso iate independently drawn randomvariables "

i =1, and dene J ijk =" i " j " k

forallfi;j;kg2E. Forthis hoi e,CNFformulaineq.(2.16)issatisedbyfx

i jx i =+1if "= +1; x i

= 0 if " = 1g. As we shall dis uss in great detail, these formul provide a uniform

ensembleof hard satisable instan es for lo alsear h methods. We referto this version of the

modelasthesatisablehSAT.Indeed,therandomsignsofJ

ijk

anberemovedinthissatisable

ase by negating all Boolean variables x

i

asso iated to negative "

i

. The resulting model has

J ijk

= +1 for all fi;j;kg 2 E, and the for ed satisfying solution is x

i

= 1; 8i = 1;:::;N.

The use of the f"

i

g is a way of hiding the latter solution by a random gauge transformation

without hanging the properties of the model. The impossibility of inverting eÆ iently the

gauge transformation by lo al methodsis a onsequen e of the bran hing pro ess arising form

the presen e of K =3variables inea h onstraint. Forany K >3 the same result would hold

whereas for K =2the problemtrivializes. The XOR-SAT model an be easily des ribed asa

minimization problem of a ost-energy fun tion over a random hyper-graph. Given a random

hyper-graph G

N;M

=(V;E),whereV isthe set ofN verti esandE isthe setof M hyper-edges

joiningtriples of verti es,the energy fun tionto be minimizedreads

H J [S℄=M X fi;j;kg2E J ijk S i S j S k ; (2.17)

where ea hvertex i bears a binary \spin" variableS

i

=1,and the weights J

ijk

asso iated to

the randombonds anbe either1atrandom,intheso alled frustrated ase, orsimplyequal

to 1 in the unfrustrated model. We see that this is indeed the parti ular ase of the 3-spin of

eq.(2.9). On ethe mappingS

i =1if x i =1and S i = 1if x i

=0isestablished, one an easily

noti ethat theenergyfun tionineq.(2.17)simply ountsthenumberofviolated lausesinthe

previously dened CNFformul withthe sameset of J's. The frustratedandthe unfrustrated

ases orrespond tothe XOR-SAT and tothe satisable XOR-SAT formul respe tively.

2

(30)

2.2 From the partition fun tion to the average free-energy

The onstraints onthe degreedistributionwillhave tobeintrodu edalongthe omputationof

the logarithm of the partitionfun tion.

Z =e M X si e P k H k (2.18)

Following the approa h of ref. [19℄, we ompute the free energy of the model with the repli a

method,exploitingthe identitylog<Z

n >=1+n<logZ >+O(n 2 ). The n th momentof the

partition fun tion isobtained by repli ating n times the sum over the spin ongurations and

then averagingoverthe quen hed disorderTheaveragedn-th momentofthe partitionfun tion

takes therefore the followingform,a part from anormalization fa tor:

<Z n >=e nM X ~ s i <e P n a=1 P l P i 1 ;:::;i l Ji 1 ;:::;i l s a i 1 :::s a i l > (2.19)

where the average value of anobservable is given by:

<> = [P(k)℄ Z Y i 1 <:::<i l P(J i 1 ;:::;i l )() N Y i=1 Æ( X l X i 1 <:::<i l jsign(J i 1 ;:::;i l )j k i ) (2.20) Y l Æ(Mv l X i 1 <:::<i l jsign(J i 1 ;:::;i l )j) (2.21)

[P(k)℄ is a normalization fa tor ne essary to res ale to one the sum over the onstrained

probability distribution of the ouplings:

[P(k)℄ Z Y i 1 <:::<i l P(J i1;:::;i l ) N Y i=1 Æ( X l X i 1 <:::<i l jsign(J i1;:::;i l )j k i ) Y l Æ(Mv l X i 1 <:::<i l jsign(J i1;:::;i l )j) (2.22)

The two inserted delta fun tions are there toensure the onstraints on degreeand intera tion

terms distribution. In fa t,the onstraint overthe fra tion of xed rank plaquettes is already

taken into a ount in the parti ular form of the distribution, therefore the normalization an

be limited tothe term

[P(k)℄ Z Y i 1 <:::<i l P(J i 1 ;:::;i l ) N Y i=1 Æ( X l X i 1 <:::<i l jsign(J i 1 ;:::;i l )j k i ) (2.23)

Its value is al ulated in the appendix inthis ase. The nal value in the large N limitis:

[P(k)℄e N <k> P k k log <k > k k ! (2.24)

Even more in generality,H

l

ould be inthe form

H l = X i 1 <:::<i l J i 1 ;:::;i l G (l ) [~s℄ (2.25)

where thefun tions G

(l )

depend onthe parti ularmodelunder onsideration. In this work,for

instan e,we onsiderotherrelevantexamplessu hastheBi oloringproblemofarandomrank3

hyper-graph, whereonlyG

(3) = (s i1 s i2 +s i1 s i3 +s i2 s i3

(31)

model, where G (3) = Q 3 l =1 Æ(s i l

; 1) and the oupling variables follow a dierent probability

distribution too. Going ba k to our generalized p-spin modelwe an write the delta fun tions

in their integral form

Y l Æ(M l X i 1 <:::<i l J i 1 ;:::;i l )= Z Y l ( d l 2 )exp( iM X l l v l )exp(i X l l X i 1 <:::<i l jsign(J i 1 ;:::;i l )j) (2.26) Y i Æ( X l X <i 2 ;:::;i l > i jsign(J i 1 ;:::;i l )j k i )= Z Y i ( d i 2 )exp( i X i i k i )exp(i X i 1 <:::<i l ( k X j=1 i j )jsign(J i 1 ;:::;i l )j) (2.27)

Infa t,aswesaid,the hoi ewehavemadeontheprobabilitydistributionofthe ouplings(and

the onsequent value of the quantities

k

) already impliesthe rst onstraint tobe satisedin

the large N limit. Indeed, if we expli itlyinsert the rst delta fun tion(also the normalization

fa tor willa ordingly hange),we areleft withone supplementaryseries ofsaddlepoint

equa-tions inthe variables

l

. Insertingthe solutionsforthelattervariablesintothe ommonsaddle

point equations, we retrieve equivalent expressions. The averaged n-th power of the partition

fun tion be omes, inthe ase of the dilutedferromagnet,

<Z n > exp ( nM) X ~ s i Z Y i ( d i 2 )exp i N X i=1 i k i ! exp 0 <k > <l > N + X l v l N l 1 exp 0 X i1;:::;i l X a s a i 1 :::s a i l +i l X j=1 ij 1 A 1 A (2.28)

We ould now go on in the al ulation treading a path similar to the one followed for the

fully onne ted models, tra ing out the repli ated spin variables through the introdu tion of

a whole series of overlap and multi-overlap quantities. Due to the distribution of ranks and

degrees, however, the numberofoverlapfun tionthatwewouldneedtotakeinto onsideration

is innite, and we are better o if we exploit a more ompa t mathemati al notation via a

generating fun tion formalism of the overlap series

3

. The orre tgenerating fun tion for this

kindofproblemsturnsouttobewritableasafun tionalorderparameterinthe form[19,9,12℄

(~)= 1 N X i Æ(~ ~s i )e i i : (2.29)

In the ase of Poissonian hyper-graphs, due to the self similarity property of eq. (1.5) that

substituted inteq. (1.6) leads to q

poiss k = poiss k , the elds i

are redundant and re-absorbed in

the degree distribution (Poissonian degree hyper-graphs are the ones obtained in the

thermo-dynami limitwhen using the \free" ouplings probability distributions (2.11) and (2.12)). In

this asethe quantity (2.29)dire tlyrepresentsthe fra tionofrepli atedspins s

a

i

intherepli a

state

a

in the whole graph as well as in the avity one. However, in the general ase it is

ne essary to add a eld

i

that an be physi ally interpreted in the following way: e

i i

is an

3