• Non ci sono risultati.

Elements of Mechanics

N/A
N/A
Protected

Academic year: 2021

Condividi "Elements of Mechanics"

Copied!
593
0
0

Testo completo

(1)

Giovanni Gallavotti

The Elements of Mechanics

(2)

I

Giovanni Gallavotti Dipartimento di Fisica

Universit`a di Roma “La Sapienza” Pl. Moro 2

00185, Roma, Italy

e-mail: giovanni.gallavotti@roma1.infn.it web: http://ipparco.roma1.infn.it

c

2007 Giovanni Gallavotti, II Edition c

(3)
(4)

III

Giovanni Gallavotti Dipartimento di Fisica

Universit`a di Roma “La Sapienza” Pl. Moro 2

00185, Roma, Italy

e-mail: giovanni.gallavotti@roma1.infn.it web: http://ipparco.roma1.infn.it

c

2007 Giovanni Gallavotti, II Edition c

(5)
(6)

Preface

Preface to the Second English edition (2007).

c

This is Version 1.3: October 9, 2012

In 2007 I recovered the Copyright. This is a new version that follows closely the first edition by Springer-Verlag. I made very few changes. Among them the Gauss’ method, already inserted in the second Italian edition, has been included here. Believing that my knowledge of the English language has im-proved since the late ’970’s I have changed some words and constructions.

This version has been reproduced electronically (from the first edition) and quite a few errors might have crept in; they are compensated by the corrections that I have been able to introduce. This version will be updated regularly and typos or errors found will be amended: it is therefore wise to wait sometime before printing the file; the versions will be updated and numbered. The ones labeled 2.* or higher will have been entirely proofread at least once.

As owner of the Copyright I leave this book on my website for free down-loading and distribution. Optionally the colleagues who download the book could send me a one line message (saying “downloaded”, at least): I will be grateful. Please signal any errors, or sources of unhappiness, you spot.

On the web site I also put the codes that generate the non trivial figures and which provide rough attempts at reproducing results whose originals are in the quoted literature. Discovering the phenomena was a remarkable achieve-ment: but reproducing them, having learnt what to do from the original works, is not really difficult if a reasonably good computer is available.

Typeset with the public Springer-Latex macros.

Giovanni Gallavotti Roma 18, August 2007

(7)

Preface to the first English edition

The word ”elements” in the title of this book does not convey the impli-cation that its contents are ”elementary” in the sense of ”easy”: it mainly means that no prerequisites are required, with the exception of some basic background in classical physics and calculus.

It also signifies ”devoted to the foundations”. In fact, the arguments chosen are all very classical, and the formal or technical developments of this century are absent, as well as a detailed treatment of such problems as the theory of the planetary motions and other very concrete mechanical problems. This second meaning, however, is the result of the necessity of finishing this work in a reasonable amount of time rather than an a priori choice.

Therefore a detailed review of the ”few” results of ergodic theory, of the ”many” results of statistical mechanics, of the classical theory of fields (elas-ticity and waves), and of quantum mechanics are also totally absent; they could constitute the subject of two additional volumes on mechanics.

This book grew out of several courses on “Meccanica Razionale”, i.e., essentially, Theoretical Mechanics, which I gave at the University of Rome during the years 1975-1978.

The subjects cover a wide range. Chapter 2, for example, could be used in an undergraduate course by students who have had basic training in classical physics; Chapters 3 and 4 could be used in an advanced course; while Chapter 5 might interest students who wish to delve more deeply into the subject, and fit could be used in a graduate course.

My desire to write a self-contained book that gradually proceeds from the very simple problems on the qualitative theory of ordinary differential equations to the more modem theory of stability led me to include arguments of mathematical analysis, in order to avoid having to refer too much to existing textbooks (e.g., see the basic theory of the ordinary differential equations in §2.2-§2.6 or the Fourier analysis in §2.13, etc.).

I have inserted many exercises, problems, and complements which are meant to illustrate and expand the theory proposed in the text, both to avoid excessive size of the book and to help the student to learn how to solve theoret-ical problems by himself. In Chapters 2-4, I have marked with an asterisk the problems which should be developed with the help of a teacher; the difficulty of the exercises and problems grows steadily throughout the book, together with the conciseness of the discussion.

The exercises include some very concrete ones which sometimes require the help of a programmable computer and the knowledge of some physical data. An algorithm for the solution of differential equations and some data tables are in Appendix O and Appendix P, respectively.

The exercises, problems, and complements must be considered as an im-portant part of the book, necessary to a complete understanding of the theory.

(8)

Preface VII

In some sense they are even more important than the propositions selected for the proofs, since they illustrate several aspects and several examples and counterexamples that emerge from the proofs or that are naturally associated with them.

I have separated the proofs from the text: this has been done to facilitate reading comprehension by those who wish to skip all the proofs without los-ing continuity. This is particularly true for the more mathematically oriented sections. Too often students tend to confuse the understanding of a mathemat-ical proposition with the logmathemat-ical contortions needed to put it into an objective, written form. So, before studying the proof of a statement, the student should meditate on its meaning with the help (if necessary) of the observations that follow it, possibly trying to read also the text of the exercises and problems at the end of each section (particularly in studying Chapters 3-5).

The student should bear in mind that he will have understood a theorem only when it appears to be self-evident and as needing no proof at all (which means that its proof should be present in its entirety in his mind, obvious and natural in all its aspects and, if necessary, describable in all details). This level of understanding can be reached only slowly through an analysis of several exercises, problem, examples, and careful thought.

I have illustrated various problems of classical mechanics, guided by the desire to propose always the analysis of simple rather than general cases. I have carefully avoided formulating ”optimal” results and, in particular, have always stressed (by using them almost exclusively) my sympathy for the only ”functions” that bear this name with dignity, i.e., the C∞-functions and the elementary theory of integration (”Riemann integration”).

I have tried to deal only with concrete problems which could be ”construc-tively” solved (i.e., involving estimates of quantities which could actually be computed, at least in principle) and I hope to have avoided indulging in purely speculative or mathematical considerations. I realize that I have not been en-tirely successful and I apologize to those readers who agree with this point of view without, at the same time, accepting mathematically non rigorous treatments.

Finally, let me comment on the conspicuous absence of the basic elements of the classical theory of fluids. The only excuse that I can offer, other than that of non pertinence (which might seem a pretext to many), is that, perhaps, the contents of this book (and of Chapter 5 in particular) may serve as an introduction to this fascinating topic of mathematical physics.

The final sections, §5.9-§5.12, may be of some interest also to non stu-dents since they provide a self-contained exposition of Arnold’s version of the Kolmogorov-Arnold-Moser theorem.

This book is an almost faithful translation of the Italian edition, with the addition of many problems and§5.12 and with §5.5, §5.7, and §5.12 rewritten. I wish to thank my colleagues who helped me in the revision of the manuscript and I am indebted to Professor V. Franceschini for providing (from his files) the very nice graphs of§5.8.

(9)

I am grateful to Professor Luigi Radicati for the interest he showed in inviting me to write this book and providing the financial help from the Italian printer P. Boringhieri.

The English translation of this work was partially supported by the ”Stiftung Volkswagenwerk” through the IHES.

Giovanni Gallavotti Roma, 27 December 1981

(10)

Contents

1 Phenomena Reality and models . . . 1

1.1 Statements . . . 1

1.2 An example of a Model . . . 3

1.3 The Laws of Mechanics. . . 5

1.4 General Thoughts on Models . . . 8

2 Qualitative Aspects of One-Dimensional Motion . . . 11

2.1 Energy Conservation . . . 11

2.2 General Properties of Motion. Uniqueness . . . 13

2.2.1 Problems for§2.2 . . . 16

2.3 General Properties of Motion. Existence . . . 18

2.3.1 Problems . . . 21

2.4 General Properties of Motion. Regularity. . . 22

2.4.1 Exercises and Problems . . . 26

2.5 Local and Global Solutions of Differential Equations . . . 26

2.5.1 Exercises and Problems . . . 31

2.6 More on Differential Equations. Autonomous Equations . . . 32

2.6.1 Exercises and Problems . . . 35

2.7 One-Dimensional Conservative Periodic and Aperiodic Motions 36 2.7.1 Exercises and Problems . . . 39

2.8 Equilibrium: Stability in the Absence of Friction . . . 40

2.8.1 Exercises and Problems . . . 43

2.9 Stability and Friction . . . 43

2.9.1 Exercises and Problems . . . 46

2.10 Period and Amplitude: Harmonic Oscillators . . . 47

2.10.1 Exercises and Problems . . . 50

2.11 The Damped oscillator: Euler’s Formulae . . . 52

2.11.1 Exercises and Problems . . . 55

2.12 Forced Harmonic Oscillations in Presence of Friction . . . 56

2.13 Fourier’s series for C∞-Periodic Functions . . . 60

(11)

2.14 Forced Oscillations of a Pendulum . . . 64

2.14.1 Exercises and Problems . . . 69

2.15 Damped Pendulum: Small Forced Oscillations . . . 70

2.15.1 Problems . . . 73

2.16 Small Damping: Resonances . . . 74

2.16.1 Exercises and Problems . . . 77

2.17 Rigorously Periodic Oscillators and friction . . . 78

2.17.1 Exercises . . . 82

2.18 Compatibility Conditions for the Anchor Escapement . . . 83

2.19 Encore on Anchor Escapement: Stability of Periodic motion . . 87

2.19.1 Problems . . . 92

2.20 Frictionless Forced Oscillations: Quasi-Periodic Motions . . . 92

2.20.1 Exercises and Problems . . . 96

2.21 Quasi-Periodic Functions.Tori. Fourier traansform . . . 99

2.21.1 Exercises and Problems . . . 107

2.22 Observables and Their Time Averages. . . 108

2.22.1 Exercises and Problems . . . 112

2.23 Averages. Errors and Probability. Stochastic Phenomena . . . 114

2.23.1 Exercises and Problems . . . 123

2.24 Extremal Properties. Action Principle . . . 126

2.24.1 Exercises and Problems . . . 135

3 Constraints. Analytical Mechanics . . . 141

3.1 Systems of Points . . . 141

Exercises . . . 144

3.2 Work. Linear and Angular Momentum . . . 144

Exercises . . . 150

3.3 The Least Action Principle . . . 151

3.4 Introduction to the Constrained Motion Theory . . . 153

3.4.1 Exercises . . . 156

3.5 Ideal Constraints as Mathematical Entities . . . 157

3.5.1 Problems . . . 166

3.6 Real and Ideal Constraints . . . 168

3.6.1 Exercises and Problems . . . 175

3.7 Approximate Constraints. Perfection criteria. . . 176

3.7.1 Exercises and Problems . . . 185

3.8 A Perfection Criterion for Approximate Constraints . . . 186

3.8.1 Problems . . . 196

3.9 Application to Rigid Motion. K¨onig’s Theorem . . . 198

3.9.1 Exercises and Problems . . . 207

3.10 General Considerations on the Theory of Constraints . . . 208

3.11 Equations of Hamilton and Lagrange. Analytical Mechanics . . . 211

3.11.1 Exercises, Problems and Complements . . . 227

3.12 Completely Canonical Transformations: Their Structure . . . 233

(12)

Contents XI

4 Special Mechanical Systems . . . 245

4.1 Systems of Linear Oscillators . . . 245

4.1.1 Exercises . . . 249

4.2 Irrational Rotations on ℓ-Dimensional Tori . . . 250

4.3 Ordered Systems of Oscillators. Heuristics. . . 252

4.4 Oscillator Chains and the Vibrating String . . . 258

4.5 Vibrating String. Wave equation . . . 264

4.5.1 Exercises . . . 269

4.6 Vibrating String: General Case. Dirichlet Problem in [0, L] . . . . 271

4.7 Elastic Film. Wave velocity . . . 278

4.8 Anharmonic Oscillators.Small Oscillations and Integrability . . . 284

4.8.1 Problems . . . 290

4.9 Central Motions . . . 291

4.9.1 Problems . . . 296

4.10 Kepler’s Marvelous Laws . . . 298

4.10.1 Exercises and Problems . . . 302

4.11 Integrable Systems. Solid with a Fixed Point . . . 307

4.11.1 Problems and Complements . . . 317

4.12 Other Integrable Systems. . . 325

4.12.1 Exercises and Problems . . . 331

4.13 Some Integrability Criteria . . . 333

4.14 Integrability. Frequency of visit. Ergodicity. . . 341

4.14.1 Exercises and Problems . . . 351

4.15 Integrability Criteria. Complexity. Entropy . . . 353

4.15.1 Exercises and Problems . . . 360

5 Stability for Dissipative and Conservative Systems . . . 365

5.1 A Mathematical Model for illstration puroses . . . 365

5.2 Stationary Motions for a Dissipative Gyroscope . . . 369

5.2.1 Exercises . . . 373

5.3 Attractors and Stability . . . 374

5.3.1 Exercises . . . 381

5.4 The Stability Criterion of Lyapunov . . . 382

5.4.1 Exercises . . . 386

5.5 Vague Attractivity of a Stationary Point . . . 389

5.5.1 Exercises . . . 406

5.6 Vague-Attractivity Properties. The Attractive Manifold . . . 408

5.6.1 An Equivalent Problem . . . 413

5.6.2 B: Some Useful Estimates of Derivatives. . . 414

5.6.3 C: Definition of the Approximate Surfaces. . . 415

5.6.4 D: Approximate Surfaces are Well Defined. . . 416

5.6.5 E: Alternative Existence Proof . . . 416

5.6.6 F: Check of Validity of Eq.(5.6.49). . . 419

5.6.7 G: Existence limt→+∞πnt . . . 420

(13)

5.6.9 I: Attractivity of σ(π∞). . . 423

5.6.10 L: Order of Tangency. . . 423

5.6.11 M: Regularity in α. . . 425

5.6.12 N: General Case. . . 427

5.6.13 Exercises . . . 428

5.7 The Hopf Bifurcation . . . 431

5.7.1 Exercises and Problems . . . 438

5.8 Stability of periodic or more complex orbits . . . 440

5.8.1 A. Example 1: The “Lorenz Model”. . . 444

5.8.2 B. Example 2: Navier-Stokes equation with 5 modes. . . . 446

5.8.3 C. Navier-Stokes equations with 7 modes . . . 452

5.8.4 Problems and Complements . . . 454

5.9 Stability in Conservative Systems: Introduction . . . 458

5.10 Formal Theory of Perturbations. Hamilton–Jacobi Method . . . . 464

5.10.1 Exercises and Problems . . . 476

5.11 Analytic Implicit Functions theorems . . . 479

5.11.1 Problems and Exercises . . . 486

5.12 Perturbations of Trajectories. Small Denominators Theorem . . 487

5.12.1 Problems . . . 511

6 Appendices . . . 519

6.1 A: The Cauchy-Schwartz Inequality . . . 519

6.2 B: The Lagrange-Taylor Expansion . . . 520

6.3 C: Functions with Bounded Support . . . 521

6.4 D: Principle of the Vanishing Integrals . . . 522

6.5 E: Matrices, Eigenvalues, Eigenvectors . . . 523

6.6 F: Positive-Definite Matrices. . . 525

6.7 G: Implicit Functions Theorems . . . 527

6.8 H: The Ascoli-Arzel´a Convergence Criterion . . . 534

6.9 I: Fourier Series for Functions in C∞([0, L]). . . 536

6.10 L: Proof of Eq. (5.6.20) . . . 537

6.11 M: Proof of Eq. (5.6.63) . . . 539

6.12 N: Analytic Implicit Functions . . . 540

6.13 O: Finite-Difference Method . . . 544

6.14 P: Astronomical Data . . . 546

6.15 Q: Gauss Method for Planetary Orbits . . . 548

6.16 S: Definitions and Symbols . . . 565

6.17 T: Suggested Books and Complements . . . 566

References . . . 569

(14)

1

Phenomena Reality and models

1.1 Statements

The results of physical experiments are determined by observations based on the measurement of various entities, i.e. the association of well defined sequences of numbers with well defined sequences of events.

The physical entities are “operationally defined”. This means that they are defined in terms of the operations used to construct the numbers that provide their “measure”.

For instance, the sequence of operations necessary to measure the “dis-tance” between two given points P and Q in space consists in choosing a particular ruler and placing it on the straight line joining points P and Q, starting from P . Taking the endpoint of the ruler as the new starting point, the procedure is repeated n times until the endpoint of the ruler is superim-posed on Q. If the distance P Q is not an exact multiple of the length of the ruler, one may, after n such operations, reach a point Qn 6= Q preceding Q on the line P Q; and after n + 1 operations one may reach point Qn+1following Q on the line P Q. Then one takes a new ruler “ten times shorter” and puts it on QnQ trying to match, as before, the second endpoint with Q. When this turns out to be impossible, one can, as in the first case, define a new point Qn1 on QnQ and, then, take a third ruler ten times shorter than the second

and repeat the operation.

Thus, inductively, a number n + 0.n1n2. . . (in decimal representation) is built which, by definition, is the measure of the distance between P and Q.

The above sequence of operations appears well defined but, in fact, a care-ful analysis shows that it does not have the prerequisites to be considered a

(15)

mathematically precise definition. What, for instance is “space”, what is a “point”, what is a “ruler”? Is it possible to “divide” a ruler into parts, and infinitely often?

The physicist is not too concerned (or, rather, not at all concerned) with such aspects of the question: he considers a physical entity well defined when-ever the empirical procedure necessary for its measurement is clear.

A measurement procedure is considered to be clear when every observer is led to the same result when measuring the same physical entity. It should be stressed, however, that this is an empirical criterion perpetually subject to critique; thus physical entities which today are considered to be well defined may no longer be so in the future.

Hence, the physicist, from his observations of nature, obtains a set of num-bers corresponding to the performance of some operations which are consid-ered to be “objectively defined”. Trying to organize such numbers coherently, the physicist often formulates “models”.

In the attempt to organize coherently such numbers, the physicist formu-lates “models”: i.e. he associates well-defined mathematical structures with his measurements, and he tries to establish a (small) number of mathematical relationship among them. From such relationships new ones logically follow, which reinterpreted through the model, used inversely, may serve to predict new relations between various empirical measurements.

The belief in the existence of good models motivated Galileo to write: “Philosophy is written in the great book which is always open before our eyes (I mean the universe) but it cannot be understood unless one first learns the language and distinguishes the characters in which it is written. It is a mathe-matical language and the characters are triangles, circles and other geometri-cal figures, without which it cannot be understood by the human mind; without them one would vainly wonder through a dark labyrinth”.1

A mathematical model is considered satisfactory whenever it does not lead to contradictions with the experiments. If a contradiction occurs, the physicist dismisses the model as “wrong”; nevertheless, the mathematical construction built with it remains valid and is witness to an imperfect representation of nature.

Strictly speaking there is no model which is not wrong: only models that have not yet been shown to be wrong exist. However, all “serious” models (such as the dynamics of point masses, the theory of relativity, quantum mechanics, electromagnetism, thermodynamics, statistical mechanics, etc.) have led, and still lead, to the formulation of extremely interesting mathematical problems. Furthermore, it often happens that the analysis of the mathematical properties of a “wrong” model helps in the formulation of the new “more elaborate” model that the physicist tries to set up as a substitute.

A link between phenomena reality and mathematics can therefore be es-tablished as just described, through what has been called “a model”. However,

(16)

1.2 Example of a Model 3

it would be impossible to give a precise mathematical definition of the notion of a model because it is a rather empirical notion which can only be well understood through the analysis of several concrete cases.

1.2 An example of a Model

Consider the historically particularly important and significant case of the “mechanics of point masses”. Its construction from empirical observations will be briefly and concretely analyzed, presenting it as a model of one or several point masses subject to forces.

The first statement (or “axiom”, to use a mathematical term) says that the point masses are in a three-dimensional Euclidean spaceR3 in which any point can be represented by its three coordinates with respect to an orthogonal reference system (O; i, j, k). The notation means that O is the origin and i, j, k are the three orthogonal unit vectors pointing along the x, y, z coordinate axes, respectively.

Such an idealization has a clear mathematical meaning, but it appears to be unprovable in mathematical terms: it just renders the following empirical observation.

In practice, a point in space is determined by measuring (often only in principle and with the ruler method described in§1.1) its distance from three orthogonal walls. It is to be remarked that all such operations are ordinarily considered well defined.

A second statement (or “axiom”) concerns “time” which, for the physicist, is the physical entity measured by a “clock” (classically described as a pen-dulum, although any more modern device will do as well). One assumes that time is an absolute “entity”: in other words, one states that, at least in prin-ciple it is possible to associate with every point in space a clock mechanically identical at every point, and, furthermore, to coordinate (“synchronize”) the clocks.

This means that if P, P′ are two points and t, tare two chosen time instants t < t′ it is then possible to send a signal from P towards Pleaving P at time t and reaching P′ at time t(as indicated by the local clocks in P and in P′, respectively); while, vice versa, if t > t, the above operation should be impossible.

A little thought makes it clear that the operational definition of a “system of synchronized clocks” is based on the empirical fact that it is possible to send signals with arbitrary speed. It is also clear that the notion of time is a phenomenological notion, far from being mathematically well posed.

Accepting the point of view so far discussed, one is led to say that the math-ematical scheme, or model, representing the space-time continuum,where our observations take place, consists of a four-dimensional space: each of its points (x, y, z, t) represents a point seen in a Cartesian coordinate frame (O; i, j, k)

(17)

(“laboratory”) and observed at the instant t (as measured by the formerly introduced universal clocks).

Empirically, a point mass is any object which, at least as far as our obser-vations are concerned, can be assimilated with a point in space (for instance, a planet or a star in the universe, a stone falling in a ravine, a ship sailing in the ocean, etc.). Such a point preserves its identity over the course of time; hence, it is possible to define its trajectory through a function of time t→ x(t), where x(t) = (x(t), y(t), z(t)) is the vector whose components are the coordinates of the point at time t, in the chosen reference frame (O; i, j, k).

Mathematically, a point mass moving in the reference frame (O; i, j, k) observed as t varies over an interval I is represented as a curve C inR3 by the vector equations P (t)− O = x(t), t ∈ I; and the parameter t has the interpretation of time (i.e., it is called “time”).

Given a point mass moving as t varies in I, one can associate with it its “velocity” at time t ∈ I. Operationally, velocity is defined by fixing t0 ∈ I, finding the positions P (t0) and P (t0+ ε), and setting

v(t0) =

P (t0+ ε)− P (t0)

ε , (1.2.1)

where the parameter ε > 0 is to be chosen “suitably small” (according to well-defined criteria which, however, depend on the concrete cases). The mathe-matical model defines the point mass velocity at time t0∈ I as the derivative of the function t→ x(t) at t = t0.

To complete the mathematical model of a point mass, it is important to define the “force” acting on it.

Operationally, the force acting at a given instant on the point mass con-sists of three scalar quantities which together define a vector f (t). The force acting on the point mass moving inR3 and observed in the frame (O; i, j, k) is measured through a “dynamometer” which is an instrument whose use is convenient to describe in a strongly idealized form. It is, basically, a suitably built spring which will be imagined as a very thin, light segment with a hook.

Consider a point mass moving in R3, with a velocity v = (v

x, vy, vz) relative to the reference frame (O; i, j, k) at time t0. To measure the force acting upon it, hook it to the dynamometer to which the same velocity v has been imparted and which will be kept fixed during the measurement. Then try to adjust the spring length and direction so that the acceleration at time t0+ ε is 0, where ε > 0 is chosen “suitably small”. (The empirical notion of acceleration and the corresponding mathematical model of it, as the second derivative with respect to t of the point position, is discussed along the same lines as the notion of velocity.)

The force is then the vector f whose direction is that of the dynamometer at time t0+ ε, whose orientation is that parallel to hook but pointing away from it and whose modulus is the size of the spring elongation.

Summarizing: a point mass subject to forces and observed in a frame (O; i, j, k) in R3 as time varies within an interval I is, in its mathematical

(18)

1.3 Example of a Model 5

model, described by a curve in seven-dimensional space: one of its points (t, x, y, z, fx, fy, fz) represents a point mass which at time t has coordinates (x, y, z) in (O; i, j, k) and, in the same frame, is subject to a force (fx, fy, fz). The curve representing this situation can be parameterized by the parameter t itself, as t varies in some time interval I; it shall also be assumed that in this parametric representation the functions t → (x(t), y(t), z(t)) are twice continuously differentiable so that a mathematical definition of velocity and acceleration is meaningful.

1.3 The Laws of Mechanics

Once it is established what is meant by a point mass subject to forces and studied in a given frame of reference in R3 as the time varies in an interval I (briefly, “a point mass subject to forces”), it is possible to complete the mathematical model of the point mechanics. For this purpose, the “laws of dynamics” and their mathematical interpretation have to be discussed.

Experimentally, given a point mass, a simple relation is observed between its acceleration a at time t (in a given frame of reference) and the force f acting on it at that time (observed in the same frame). Such a relation is called the Second Law of Mechanics and establishes the existence of a constant m > 0, characteristic of the point mass and independent of the frame of reference used for the observations, such that:

ma = f . (1.3.1)

This law introduces, via the properties of the differential equations, many relations among the quantities x, v, t, and such relations can sometimes be experimentally checked. For instance, if it is known a priori which force will act on the point mass whenever it is at the point (x, y, z) at time t with velocity (vx, vy, vz), then, denoting such force as f (vx, vy, vz, x, y, z, t) = f (v, x, t), the differential equation

m ¨x = f ( ˙x, x, t) (1.3.2)

allows the determination of the motion following an initial state, in which the velocity v0 and the position x0are given at time t0, at least for a small time interval around t0 if f is a smooth function, see Chapter 2.

The First Principle of Mechanics postulates the existence of at least one reference frame (O; i, j, k), called “inertial frame”, in R3 where a point mass “far” from the other objects in the universe appears to be subjected to a null force in (O; i, j, k). Such a frame is experimentally identified with a frame with origin in a fixed star and with axes oriented towards three more fixed stars. It is to such a frame that motion is often referred.

Of course the notions of “far” and of “fixed star” are empirical notions rather than mathematical ones.

(19)

Mathematically, the first principle is used to grant to a particular frame of reference in the space-time continuum a privileged role and to define the “absolute force” or the “true force” as that acting on the point mass in this frame. This frame has to be chosen once and for all and is called the “fixed reference frame” (as opposed to “moving reference frame”).

It is possible and sometimes convenient to introduce frames whose ori-gin and axes vary with time with respect to the “fixed” frame (O; i, j, k) : (0(t); i(r), j(t), k(t)).

Since f = ma, it follows that if the moving frame is in uniform rectilinear translational motion with respect to the fixed frame, then the force acting upon the point is the same whether observed in the fixed frame or in the moving frame: hence, in this moving frame, the “inertia principle”, i.e., the first principle, is valid: a point mass which is “very far” from the other objects in the universe is subject to a null force, since the acceleration is the same in the two frames. All frames in rectilinear uniform motion with respect to a fixed frame are called “inertial frames”.

The mathematical model of a point mass with mass m subject to forces and obeying the laws of dynamics is then, simply, a point mass subject to forces, in the sense of the preceding section, and such that the relation

m a = f (1.3.3)

holds and, furthermore, f is a function of the point velocity, position, and time; i.e., the following relation holds:

f = f (v, x, t). (1.3.4)

Clearly, from such a mathematical viewpoint (where f is imagined as given a priori), the first principle is deprived of its deep physical meaning.

An important extension of the point mass model is a model for the me-chanics of a “system of N point masses”. Mathematically, such a system con-sists of N point masses with mass m1, . . . , mN, in the above sense, satisfying the Third Principle of Mechanics. This means that it should be possible to represent the force fi acting on the i-th point as

fi= X j6=i

fj→i, (1.3.5)

where fj→iare such that

(a) fj→i=−fi→j, j, i = 1, 2, ..., N, i6= j;

(b) fj→iis parallel to Pj− Pi, i.e., to the line joining the positions Pi and Pj of the i-th and j-th points;

(c) fj→idepends solely upon the positions and velocities of the it-h and j-th points and on time:

(20)

1.3 Example of a Model 7

This assumption corresponds to a precise empirical fact: it is possible to define operationally what should be understood by fj→i “the force exerted by the point Pj on the point Pi”.

For instance, the force fj→i could be measured as follows: one measures, in the given inertial frame of reference, the force fi, acting on i and then one measures, after removing the point j from the system, the new force acting on the i-th point, obtaining the result fi(j); then one sets

fj→i= fi− fi(j). (1.3.7)

The Third Principle of Mechanics arises from the experimental observation that fj→i=−fi→j, that fj→iis parallel to Pj− Pi, that the total force acting on a singe point mass is the sum of the forces exerted on it by the other system points (in the sense of vectors addition) if observed in an inertial frame of reference, and, finally, that fj→i depends only upon the positions and velocities of the points involved and, possibly, on time.

Physics often places still more requirements and restrictions upon the laws of force which can be used to give a more detailed specification of a mechani-cal system model. However, they do not have a general character comparable to the three principles but, rather, are statements explaining which laws of force are to be considered a good model under given circumstances. For in-stance, two point masses “without structure” (this is, again, an empirical notion which we refrain from elucidating) attract each other with a force of intensity mm′/kr2, where r is the distance between the points, m and mare their masses, and k is a universal constant. If the structure of the two points can be summarized by saying that they have an “electric charge e” (a new em-pirical notion), the mutual force will be the vector sum of the above-described gravitational force and of a repulsive force with intensity k′e2/r2, where kis another universal constant.

The principles of mechanics already place enough restrictions upon the na-ture of the forces admissible in mechanical problems: therefore it is convenient and interesting to examine their implications before passing to the analysis of special models obtained by concretely specifying the “force laws”, i.e., the functions giving the forces in terms of the points positions and velocities and of time.

It should be stressed, and this is a general comment on the mathemati-cal models for physimathemati-cal phenomena, that the mathematimathemati-cal model is always “poorer” than the physical reality that it tries to imitate. For instance in the above mathematical model for mechanics, the first principle loses its meaning. Another example, implicit in the above discussion, is the following.

To give an operational meaning to the notions of position, speed, force, etc., it must be possible to repeat “identical” experiments several times (e.g., see the position measurement in§1.1 by repeating the measurement operations. However, time inexorably flows away, and this is impossible. Physically, this difficulty is avoided by the “principle of homogeneity of space-time” which

(21)

says that experiments starting at any time in any space location will yield the same results if the points involved are in the same relative positions and situations.

In the mathematical model for mechanics just described, the necessity of understanding the above problems does not arise, nor do many other similar problems which the reader will easily think of.

Usually it is possible to complicate the models in order to imbue them with any given number of physical facts: but an analysis of this type of questions would lead us beyond the scope of this book.

In any case, a decision is always needed on where to put a stop to the process of model improvement, which would otherwise hopelessly continue ad infinitum. We must recall that we have the more down-to-earth, and more interesting, problem of obtaining some concrete prediction algorithms for our observations of nature.

1.4 General Thoughts on Models

In this book more abstract schematization processes concerning empirically observed phenomena will be met (e.g., when we discuss the notion of an “observable” or of a “vibrating string”). In such cases, however, the details of the construction of the mathematical model will not be repeated: a very common practice based on the idea that the very words used to designate well-defined mathematical objects will implicitly define the model.

It is such a practice, or better, its imperfect understanding, which some-times causes misunderstandings between physicists and mathematicians and provokes allegations of non-rigorous use of mathematics.

It is important to realize that when the physicist speaks in mathematical terms he is by no means attributing to them the same rigid meaning that a mathematician would assume for them. Rather he is using this language to help himself in the formulation of a model which, once well defined, he shall rigorously treat (since he believes, or at least hopes, that the book of nature is written in mathematical characters).

Possibly logically non rigorous steps or apparently wild mathematical ap-proximations in a physicist’s argument should always be interpreted as further complications or, better, refinements of the model that the physicist is trying to build.

In the hectic development of research, a physicist often modifies a model while using it, or he modifies the mathematical meaning of the objects and entities which belong to the model without changing their names (otherwise, a dictionary would not suffice). He does this because his main interest is in the construction of models and only secondarily in its mathematical theory, often considered trivial for his needs.

To avoid excessively pedantic discussions, we shall adhere, in the following, to the well-established practice of avoiding the physical analysis necessary to

(22)

1.4 Thoughts 9

the construction of a model and shall leave it to the reader to imagine such an analysis via the suggestive names used for the various mathematical entities (with the exception of a few important cases). In any case, this book is devoted to the mathematical, rather than physical aspects, of mechanical problems.

Bibliographical Comment. It is very useful to study at least the defi-nition and the laws of motion in the Philosophiae Naturalis Principia Mathe-matica by I. Newton, [37], to understand exactly the Newtonian formulation of mechanics and its modernity. To avoid “reading too much”, i.e., to avoid interpreting these immortal pages in too modern a way, it is a good idea to read the paper Essays on the history of mechanics by C. Truesdell, pp. 85-137 ([48]). The reading of the first two chapters of the work by E. Mach, [31],) will be a very useful and stimulating complement to the first three chapters of this book.

(23)
(24)

2

Qualitative Aspects of One-Dimensional

Motion

2.1 Energy Conservation

Consider a point mass, with mass m, on the lineR and subject to a force law depending uniquely on its position. Therefore, a force law ξ → f(ξ) is, given ξ∈ R, which we shall suppose to be of class C, associating with every point ξ on the lineR the component f(ξ) of the force acting on the point when it happens to occupy the position ξ.

A “motion” of the point mass, observed as t varies in an interval I, is a function t→ x(t), t ∈ I, of class C(I) such that

m ¨x(t) = f (x(t)), ∀ t ∈ I (2.1.1)

The “energy conservation theorem” follows by multiplying Eq. (2.1.1), side by side, by ˙x(t):

m ˙x ¨x = ˙x f (x), (2.1.2)

omitting, as will often be done, the explicit mention of the t-dependence. Then, defining the functions,

η→ T (η)def= 1 2m η 2, ξ → V (ξ)def= − Z ξ f (ξ′) dξ′, (2.1.3) it is d dtT ( ˙x) = m ˙x ¨x, d dtV (x) =−f(x) ˙x (2.1.4)

(25)

so that Eq. (2.1.2) becomes d

dt(T ( ˙x) + V (x)) = 0 (2.1.5)

This implies a constant E can be associated with every motion t → x(t), t∈ I, depending on the motion under consideration and such that

T ( ˙x(t)) + V (x(t)) = E, ∀ t ∈ I. (2.1.6) The expressions T ( ˙x) and V (x) are respectively called the “kinetic energy” and the “potential energy” and Eq. (2.1.6) has to be read as follows: “in every motion developing under the action of a force with potential energy V , the sum of the kinetic energy and potential energy is a constant”. This constant is given the name “total energy” of the considered motion. The “qualitative theory” of Eq. (2.1.1) is concerned with the analysis of the properties of the motion verifying Eq. (2.1.1), which are valid independently of the choice of f , at least for vast classes of functions f . The energy conservation is a first example of a qualitative property.

Observations. The energy conservation goes back at least to Huygens; after-wards, it was used by J. and D. Bernoulli together with the law of conservation of linear momentum (Descartes) (see [48], p. 105 and following).

Eq. (2.1.6) implies an expression for the velocity: ˙x(t) =± m2(E− V (x(t)))12, t

∈ I (2.1.7)

This relation, which will be used and discussed in§2.6, allows the reduc-tion of the determinareduc-tion of the evolureduc-tion law t → x(t), t ∈ I, “time law”, to an area-computation problem for a planar figure, “quadrature”. In fact, supposing ˙x > 0, it yields: t = Z x(t) x(0) dξ q 2 m(E− V (ξ)) dξ (2.1.8) when I⊃ [0, t].

Hence, the area under the graph of the curve with equation ξ→ T (ξ) = (2

m(E− V (ξ)))−

1

2 above the interval [x(0), x(t)] is the time that the point

needs to reach x(t), starting from x(0) at time 0 with positive speed and energy E, at least for small t (i.e., as long as ˙x > 0).

Newton “reduced to quadratures” the simplest problems of motion without explicitly using energy conservation ([37], for instance Book I, Propositions XXXIX, XLI, LIII, LVI, etc.).

(26)

2.2 Uniqueness 13

2.2 General Properties of Motion. Uniqueness

In the preceding§2.1, a motion developing, under the action of a force f, in a time interval I was supposed to be given. We can ask which further properties of a particular motion allow us to select it from among all motions which, in the same time interval I, take place under the action of the same force.

One can even preliminarily ask whether, given an interval I, there exist any motions, i.e., C∞ solutions of Eq. (2.2.1) thought of as an equation for t→ x(t), t ∈ I.

In view of the importance of such questions, before proceeding in the analysis of Eq. (2.1.1), some attention will be devoted to the general problem of the existence, uniqueness, and regularity of the solutions of differential equations in Rd.

Eq. (2.1.1), thought of as a “second-order” differential equation inR1, is equivalent to a “first-order” equation inR2: it suffices to write it as

˙x(t) = y(t), ˙y(t) = f (x(t)), (2.2.1) where Eq. (2.2.1) is an equation for the unknown C∞function t→ (x(t), y(t)) defined on I and with values inR2.

More generally, consider an arbitrary “s-th order” differential equation in Rd, s = 0, 1, . . ., like dsx(t) dts = f ( ds−1x(t) dts−1 , . . . , dx(t) dt , x(t), t), (2.2.2)

with t∈ I, where f is an Rd-valued Cfunction defined onRd× Rd× R and t → x(t) is an unknown Rd-valued Cfunction on I. The latter equation may be thought of as a first-order equation inRd by setting

dx(t) dt = y1, dy(t)1 dt (t) = y2, . . . dy(t)s−2 dt = ys−1, dy(t)s−1 dt = f (ys−1(t), . . . y1(t), x(t), t) (2.2.3) and then considering Eq. (2.2.3) as an equation for the C∞ function t (x(t), y1(t), . . . .ys−1(t)) defined on the interval I and with values in Rd × . . .× Rd =Rd s.

Eq. (2.2.2) is the most general differential equation that will be met in this book. By virtue of the preceding remark, it will then suffice, for our purposes, to study first-order differential equations inRd having the form

˙x(t) = F(x(t), t), t∈ I, (2.2.4)

It will be convenient to introduce a precise convention about what a dif-ferential equation is or about what one of its solutions is.

(27)

1 Definition. Given anRd-valued function F∈ C(Rd× R), the expression (2.2.4), denoted, for short, ˙x = F(x, t), will be called a “differential equation on Rd associated with F”.

A “C(k) solution”, k > 1, of Eq. (2.2.4) on the interval I, closed or open or semi open, will be a C(k) function which turns Eq. (2.2.4) into an identity when substituted into it.1 A “solution” of Eq. (2.2.4) for t

∈ I is a C∞ solution. The solutions of Eq. (2.2.4) will often be called “motions”.

Let us first examine the uniqueness problem for the solutions of Eq. (2.2.4). 1 Proposition. Let (ξ, t)→ F(ξ, t) be an Rd-valued Cfunction on Rdx

R. Given a > 0, b > 0, t0 ∈ R, let t → x(t) be a C(1) solution of Eq. (2.2.4) on J = [t0− a, t0+ b]:

(i) the function t→ x(t) is in C(J);

(ii) if t→ y(t) is another solution of Eq. (2.2.4) on J and if y(t0) = x(t0), then x(t) = y(t),∀ t ∈ J.

Observations.

(1) This proposition applied to Eq. (2.2.2) via Eq. (2.2.3) tells us that two C(s) solutions of an s-th order differential equation in

Rd for t

∈ J coincide if and only if at time t0∈ J (“initial time”) they have the same first (s − 1) derivatives (“equal initial data”). When Eq. (2.2.2) is the equation governing a physical motion in Rd, it is s = 2; this means that the motion is uniquely determined, if existing at all, by its initial position x(t0) and by its initial velocity ˙x(t0), i.e., as one says, by its initial “act of motion” ˙x(t0).

(2) It would appear that it might be interesting or important to know if, by specifying properties of the solutions of Eq. (2.2.2) other than the just-mentioned initial data at some initial time, the solution verifying such prop-erties is uniquely determined 2, if existing at all. The uniqueness criterion that we chose above for illustration purposes, Proposition 1, has been se-lected only because it quickly leads to a simple answer and because it is one of the uniqueness criteria which are most useful in many applications. (3) From the proof it will appear that if F had been only supposed to be of class C(k), k≥ 1, then uniqueness would have followed in an equal way. The regularity of t→ x(t), t ∈ J, could also be deduced in this case, but one would only obtain that t→ x(t) is a C(k+1) function.

Proof. By integrating both sides of Eq. (2.2.4) and by setting x0= x(t0) = y(t0), we get: x(t) = x0+ Z t t0 F(x(τ ), τ ) dτ, t∈ J, (2.2.5) 1

We shall see that every C(k)solution, k > 1, is automatically a Csolution, if F∈ C. 2For instance, we can ask the following question. Consider Eq. (2.2.2) with s = 2 and

lei t1, t2be two times and x1, x2∈ Rdbe two positions. Is the motion [solution of Eq.

(2.2.2)] leading from x1 to x2 as time elapses from t1 to t2 (assuming that one such

motion, at least, exists) unique? We shall see that the answer to this question will, in general, be no.

(28)

2.2 Uniqueness 15

and, similarly, since also t→ y(t) is a solution of Eq. (2.2.4): y(t) = x0+ Z t t0 F(y(τ ), τ ) dτ, t∈ J. (2.2.6) Hence, x(t)− y(t) = Z t t0 (F(x(τ ), τ )− F(y(τ), τ)) dτ. (2.2.7) To prove (ii) the procedure that will be followed is very interesting since it obviously goes beyond the particular result that we wish to obtain.

Informally, the argument is the following: the difference |x(t) − y(t)| is, by Eq. (2.2.7), about |t − t0| |F(x(t), t) − F(y(t), t)|, if t ∼ t0; however, the increment |F(x(t), t) − F(y(t), t)| is proportional, by Lagrange’s theorem, to the increment of the argument of F, i.e., to C|x(t) − y(t)|, where C is an estimate of the first derivatives of F. Hence, Eq. (2.2.7) implies that |x(t) − y(t)| and C|t − t0| |x(t) − y(t)| are about equal if t ∼ t0, and this, in turn, implies that |x(t) − y(t)| = 0 for t dose to t0 because for t ∼ t0, one has C|t − t0| < 1.

To estimate the integrand of Eq. (2.2.7) let S ⊂ Rd be a sphere with so large a radius that it contains all the values x(τ ), y(τ ),∀ τ ∈ J, and let

MS = max ξinS,τ ∈J d X i,j=1 ∂F(i) ∂ξj (2.2.8)

where F(i)(ξ, t) is the i-th component of the vector F(ξ, t) = (F(1)(ξ, t), . . . , F(d)(ξ, t))

∈ Rd. Then, from Taylor’s formula:

|F(x(τ), τ) − F(y(τ), τ)| ≤ MS|x(τ) − y(τ)|. (2.2.9) Inserting this inequality into Eq. (2.2.7), yields

|x(t) − y(t)| ≤ MS Z t

t0

|x(τ) − y(τ)| dτ (2.2.10)

Let M (t) = maxt0≤τ ≤t|x(τ) − y(τ)|, t ∈ [t0.t0+ b]; then Eq. (2.2.10) implies

|x(t) − y(t)| ≤ MSM (t)|t − t0|, ∀ t ∈ [t0, t0+ b].

Since M (t) is monotonic nondecreasing and since this inequality holds for all t∈ [t0, t0+ b], one easily finds that

M (t)≤ MS|t − t0| M(t), ∀ t ∈ [t0, t0+ b] (2.2.11) which implies M (t) = 0 for|t − t0| < MS−1, t∈ [t0, t0+ b].

Hence, x(t0+MS−1) = y(t0+MS−1), if t0+MS−1< t0+b, and the argument can be repeated, replacing t0 by t0+ MS−1, to show that M (t) = 0 for t ∈ [t0, t0+ 2MS−1] if t0+ 2MS−1< t0+ b, etc., so that M (t) = 0 for t∈ [t0, t0+ b]. For t∈ [t0− a, t0], one proceeds likewise.3

3 Alternatively, Eq. (2.2.10) could be iterated n times to yield, if µ = max

|x(τ) − y(τ)|, τ∈ [t0− a, t0+ b]:

(29)

To check (i), i.e., that t → x(t) is a C∞ function on J, remark that if t → x(t) is a C(1)(J) function, then Eq. (2.2.4) implies that t → ˙x(t) is in C(1)(J), being a composition of a C∞ function with a C(1) function; furthermore, by differentiating Eq. (2.2.4):

¨ x(t) = d X i=1 ∂F ∂ξi (x(t), t)· ˙x(i)+∂F ∂t(x(t), t) (2.2.12) which, in turn, implies that t→ ¨x(t) is a C(1)function by the same argument as above. Then, by differentiating Eq. (2.2.12), one finds that x. . .(t) is a C(1)

function on J, etc. mbe

2.2.1 Problems for §2.2

1. If t → x(t), t ≥ 0, solves ˙x = f (x) and x(0) = x(T ) for some T > 0, then x(t) = x(t + T ),∀ t > 0; assume f ∈ C∞(Rd). Would this also be true if f∈ C1(Rd)? (Hint: Use

uniqueness).

2.The property of the preceding problem is not valid when the differential equation right-hand side is explicitly time dependent (i.e., ˙x = f (x, t), and ∂f /∂t6= 0, the “non autonomous case”). Find an example.

3.Let f (x, t) be such that f (ξ, t) = f (ξ, t + T ) for some T > 0 and for all ξ∈ Rd. Suppose

that t → x(t) is a solution of ˙x = f (x, t) such that for some integer m > 0, one has x(0) = x(mT ), then x(t)≡ x(t + mT ), ∀ t ≥ 0. (Hint: Use uniqueness.)

4. Consider the equation ˙x(t) = ℓ(t) x(t) with ℓ ∈ C∞(R). Show that if t → x(t) and

t→ y(t) are two solutions for t ∈ J and if x(t) 6≡ 0, there exists a constant A such that y(t)≡ Ax(t), ∀ t ∈ J.

5. If the function ℓ of the Problem 4 is periodic with period T > 0 and t→ x(t) 6≡ 0, is one of its solutions then also t → x(t + T ) is a solution. Hence, ∃λ 6= 0 such that x(t + T ) = λx(t). Show that λ > 0. (Hint: Otherwise either λ = 0 and x(T ) = 0, hence x(t)≡ 0 (by uniqueness on [0, +∞)), or λ < 0 and there would be t ∈ (0.T ] where x(t) = 0: hence, again, x(t) = 0 by uniqueness.)

6.The most general solution t→ y(t), t ∈ R+, of the equation in Problem 4, with ℓ periodic

with period T has the form y(t) = Aλt/Tz(t), where z∈ C(R+) is T -periodic.

7.∗Consider the equation ˙x = L(t)x inRd, where t→ L(t), t ∈ R, is a d× d-matrix valued

C∞ function. Consider d solutions x(1), . . . , x(d) for t ∈ I = [a, b] and call them

“inde-pendent” if∃ t0∈ I such that the d vectors x(1)(t0), . . . , x(d)(t0) are linearly independent.

Show that, if t∈ I, then also x(1)(t), . . . , x(d)(t) are linearly independent whenever they

are such for t = t0 and, furthermore, any solution t→ y(t), t ∈ I, can be represented as

y(t) =Pdj=1Ajx(j)(t),∀ t ∈ I. (Hint: If for t = t, the d vectors were not independent,

|x(t) − y(t)| < MSn Z [t0,t] dτ1 Z [t0,τ1] dτ3. . . Z [t0,τν−1] dτn|x(τn)− y(τn)| ≤ Mn Sµ Z [t0,t] dτ1 Z [t0,τ1] dτ3. . . Z [t0,τν−1] dτn = MSnµ|t − t 0|n n! ≤ M n Sµ (a + b)n n! so that x(t)− y(t) ≡ 0 since n is arbitrary and it can be let to +∞.

(30)

2.2 Uniqueness 17

one could find constants A, . . . , Ad, not all equal to zero, such thatPdj=1Ajx(j)(t) = 0;

hence, by linearity and uniqueness, Pdj=1Ajx(j)(t) = 0,∀ t ∈ I which contradicts the

independence for t = t0.)

8. Show that Problem 7 implies that, given d solutions t→ x(1)(t), . . . , x(d)(t), t∈ I, to

˙x = L(t)x, the matrix W (t) (“Wronskian matrix” of x(1), . . . , x(d)) defined by

Wij(t) = x(i)j (t), i, j = 1, 2, . . . , d, t∈ I

has a determinant w(t) non vanishing for t∈ I if and only if ∃t0∈ I such that w(t0)6= 0.

(Hint. By linear algebra, this is just another way of phrasing Problem 7: d vectors are linearly independent if and only if the “determinant of their components” is not zero.) 9.Using the determinant differentiation rule, by rows, show that

d dtw(t)≡ d dtdet W (t) = Xd i=1 ℓij(t)  w(t); hence, ifPdi=1ℓij(t) = ℓ(t), one has w(t) = w(t0) e

R

t t0ℓ(τ )dτ.

10.In the context of Problem 8, suppose that the matrix function t → L(t), t ∈ R, is periodic with period T > 0, i.e., t→ ℓij(t), i, j = 1, . . . , d are T−periodic functions. Let

x(1), . . . , x(d)be d linearly independent solutions for t > 0. Then there exist d2 constants

A(i)j , i, j = 1, . . . , d, such that

x(i)(t + T ) =

d

X

j=1

A(i)j x(j)(t), t≥ 0. Show that det W (T )/ det W (0) = w(T )/w(0) = det A6= 0.

11.Suppose that the matrix A is similar, via a real nonsingular matrix S, to a real diagonal matrix Λ, Λij= λiδij, i, j = 1, . . . , d: SAS−1= Λ. In the context of Problem 10, define

y(i)(t) =

d

X

j=1

Sijx(j)(t).

Show that y(1), . . . .y(d)are linearly independent solutions, λ

1, . . . , λd6= 0, and

y(i)(t + T ) = λiy(i)(t), t≥ 0

12.Suppose that A is a matrix similar to a diagonal matrix Λ via a complex nonsingu-lar matrix S. Show that y(1), . . . , y(d), defined as in the preceding problem, are complex

solutions of ˙x = L(t)x and that y(i)(t + T ) = λ

iy(i)(t),∀ t ≥ 0. (For applications, recall

that from linear algebra (see Appendix E), a sufficient condition for the similarity between A and a diagonal matrix Λij = λiδij is that the roots λ1, . . . , λdof the secular equation

det(A− λ) = 0 are pairwise different.)

13.Given the assumptions of Problems 10,11 and supposing λ1, . . . , λd> 0, show that the

most general solution to ˙x = L(t)x has the form x(t) =

d

X

j=1

αjλt/Tj z(j)(t)

where the functions z(1), . . . , z(d)are d Cfunctions periodic with period T , and α1, . . . , αd

are arbitrary constants. (Hint: Let z(i)(t) = λ−t/T i y(i)(t).)

(31)

14.Suppose that for every nonzero complex number λ, there exists a C∞ function t

γ(t), t∈ R, such that γ(t + t′) = γ(t)γ(t), γ(0) = 1, γ(T ) = λ−1, γ(t)6= 0 ∀ t ∈ R; then

the conclusions of Problem 13 would hold, replacing λ−t/T by γ(t), without the assumption

λj> 0, j = 11, . . . , d, under the only assumption det A6= 0. See also the following problem.

15.Let λ∈ C, λ−1 = ̺ (cos θ + i sin θ, ̺ > 0, θ∈ [0, 2π]. Define γ(t) = ̺t/T(cos t Tθ +

i sinTtθ). Show that γ(0) = 1, γ(t)γ(t + t′) = γ(t + t), γ(T ) = λ−1, γ(t)6= 0, ∀ t ∈ R (e.g.,

(−1)t/T = cos t

Tπ + i sin t Tπ).

Observations to Problems 8-15.

We shall see that there always exist d linearly independent solutions to ˙x = L(t)x. However, the existence of S is a restrictive condition. When such an S does not exist, it is possible to show that the most general solution to ˙x = L(t)x, with L periodic with period T > 0 and C∞, can be written in the form

x(t) = p X j=1 δ(j)−1X k=0 αjkλt/Tj tkz(j)(t),

where Ppj=1δ(j) = d, and δ(j), λj are suitably chosen, and t→ z(j)(t), t≥ 0, are C∞

functions periodic with period T and possibly complex valued (when λj are not positive

and λt/Tj is interpreted as explained in Problem 15), and αjk are arbitrary constants (see

[38], for instance, Vol. 1, pp. 63-68, ).

16.Consider a differential equation ¨x + a(t) ˙x + b(t)x = 0, t ∈ R, a, b ∈ C∞(R). After

reducing it to a first-order system of two differential equations inR2, interpret the results

of Problems 7-15 in terms of its solutions. Show first that the matrix W (t) associated with this system is expressed in terms of two of its solutions t→ x(1)(t) and t→ x(2)(t) as

W (t) =

x(1)(t) ˙x(1)(t)

x(2)(t) ˙x(2)(t)



and ˙w(t) = a(t)w(t).

17.* Extend Problem 16 to the case of the sth-order differential equation inR: dsx dts + s−1 X j=0 aj(t)d jx dtj, t∈ R.

2.3 General Properties of Motion. Existence

An existence problem for the solutions of Eq. (2.2.4), hence of Eq. (2.2.2), naturally associated with the uniqueness property given in Proposition 1, §2.2, is solved by the following proposition:

2 Proposition. Let F be anRd-valued function in C(Rd

×R). Let x0∈ Rd and t0∈ R. Let S(ξ0, ̺) be the closed ball inRd with center ξ0 and radius ̺. Let θ > 0. There exists T̺,θ > 0 and a solution of Eq. (2.2.4), i.e., ˙x = F(x, t), defined for t∈ [t0− T̺,θ, t0+ T̺,θ] and of class C∞ such that:

(32)

2.3 Existence 19

Furthermore, if one defines: M̺,ξ0,t0,θ def = max ξ∈S(ξ0,̺) t∈[t0−θ,t0+θ] |F(ξ, t)| ≡ M (2.3.2)

one can choose

T̺,θ= ̺

̺ + θM θ. (2.3.3)

Observations.

(1) By Proposition 1, §(2.2), it is enough to show the existence of a C(1) solution verifying Eq. (2.3.1).

(2) The proof that follows is “constructive” in the sense that it provides a sequence t→ x(n)(t), t∈ [t

0− T̺,θ, t0+ T̺,θ], of functions approximating (as n → ∞) the solution and, at the same time, it provides an estimate of the approximation error defined as max|x(t) − x(n)(t)|, where the maximum is taken on the interval [t0− T̺,θ, t0+ T̺,θ].

(3) It is often useful, in applications, not to follow the solution scheme pro-posed by the following proof of Proposition 2. It might, in fact, be more convenient to use ad hoc procedures based on the particular features of the F under analysis in a concrete case. Usually, with such procedures one finds much better error estimates than the ones following from general methods, where one cannot take into account some special properties of the equations (e.g., symmetry properties, Hamiltonian form, etc.).

(4) To understand informally the bound on the magnitude of the interval of existence consider first that, during the proof, it appears necessary to have an a priori control of how far x(t) can travel away from the initial position ξ0. The continuity of F guarantees the boundedness of the maximum of|F(ξ, t)|, for, say, ξ ∈ S(ξ0, ̺), t ∈ [t0 − θ, t0+ θ]. It follows that during the whole time interval [t0− T̺,θ, t0+ T̺,θ], the point x(t) stays inside S(ξ0, ̺) because ˙x(t) = F(x(t), t) and the right-hand side of this relation does not exceed M , Eq. (2.3.2): notice, in fact, that T̺,θ has been chosen, just to achieve this effect, smaller than both θ and ̺M−1 (i.e., T̺,θ = (θ−1+ ̺−1M )−1 so that M T̺,θ < ̺).

(5) The interval [t0− T̺,θ, t0+ T̺,θ] is certainly not optimal, at least because the choice of the set S(ξ0, ̺)× [t0− θ, t0+ θ], where the maximum of |F| is considered, was arbitrary. A better existence interval could be obtained using this arbitrariness and optimizing the result over the possible sets on which one takes the maximum. Also, once the existence of a solution verifying Proposi-tion 2 has been established, one could apply ProposiProposi-tion 2 and ProposiProposi-tion 1 to the equation with initial datum x(t0+T̺,θ) at the initial time t0+T̺,θ, thus continuing it beyond T̺,θ. However one cannot hope, in general, for an infinite existence interval containingR+: this can be seen through counterexamples. The simplest among them is provided by the equation ˙x = x2, x(0) = 1, in

(33)

Proof. Rather than studying C(1)solutions of ˙x = F(x, t) verifying the initial conditions (2.3.1), look forRd-valued C(0)([t

0− T̺,θ, t0+ T̺,θ]) solutions of the equation: x(t) = ξ0+ Z t t0 F(x(τ ), τ ) dτ. (2.3.4) Every C(0)([t

0− T̺,θ, t0+ T̺,θ]) function verifying Eq. (2.3.4) is a C(1) solution to the original equation also verifying Eq. (2.3.1), and vice versa. For t ∈ [t0− T̺,θ, t0+ T̺,θ] define the sequence of Rd-valued functions t→ x(n)(t), n = 0, . . ., through the following recursive scheme:

x(0)(t) = ξ0, x(1)(t) = ξ0+ Z t t0 F(x(0)(τ ), τ ) dτ, . . . x(n)(t) = ξ0+ Z t t0 F(x(n−1)(τ ), τ ) dτ, (2.3.5)

and remark that each such function is in C∞(R) and it s natural to try taking the limit as n→ +∞. The existence, uniformly in t ∈ [t0− T̺,θ, t0+ T̺,θ], of

lim

n→∞x

(n)(t) = x(t) (2.3.6)

should imply that the limit function will also be continuous. Existence and uniformity of the limit is obtained by rewriting it as

x(0)(t) + ∞ X k=1

(x(k)(t)− x(k−1)(t)) (2.3.7)

and deducing that if

µk = max t∈[t0−T̺,θ,t0+T̺,θ]|x (k)(t) − x(k+1)(t)|, then (2.3.8) ∞ X k=0 µk < +∞ (2.3.9)

This will mean that the series of Eq. (2.3.7) is uniformly convergent for t [t0− T̺,θ, t0+ T̺,θ]: hence, the same will hold for the limit of Eq. (2.3.6).

To estimate µk we can refer to Eq. (2.3.5) to obtain for k = 2, 3, . . .,

x(k)(t)− x(k−1)(t) =Z t t0

F(x(k−1)(τ ), τ )− F(x(k−2)(τ ), τ ) (2.3.10) Through Lagrange’s theorem in the form

(34)

2.3 Existence 21 |F(ξ, τ) − F(η, τ)| ≤ L |ξ − η|, ∀ ξ, Bh ∈ S(ξ0, ̺), ∀ τ ∈ [t0− T̺,θ, t0+ T̺,θ] (2.3.11) where L = max ξ∈S(ξ0,̺) t∈[t0−T̺,θ,t0+T̺,θ] d X i,j=1 ∂F(i) ∂ξj (ξ, t) (2.3.12)

Eqs. (2.3.10) and (2.3.11) imply:

|x(k)(t)

− x(k−1)(t)| ≤ LZ [t0,t]

|x(k−1)(τ )− x(k−2)(τ )| dτ (2.3.13) ∀ k = 2, 3, . . . provided we preliminarily check that for all k = 0, 1, . . ., the functions t− → x(k)(t), t∈ [t

0− T̺,θ, t0+ T̺,θ], take their values in S(ξ0, ̺). This last property is proved inductively starting from Eq. (2.3.5): keeping in mind the choice of T̺,θ (chosen, as essentially stated in observation (4), just in such a way to make this property true) suppose, inductively, that |x(h)(t)

− ξ0| ≤ ̺, ∀ h = 0, . . . , k − 1; it is a property which holds for k = 1. To check that|x(k)(t)

− ξ0| ≤ ̺ remark that Eqs. (2.3.5) and (2.3.3) give |x(k)(t) − ξ0| ≤ Z [t0,t] dτ|F(x(k−1)(τ ), τ )| ≤ M ̺,ξ0,θ|t − t0| < ̺ (2.3.14)

Eq. (2.3.13), follows because Eq. (2.3.14) with k = 1 yields for t ∈ [t0 − T̺,θ, t0+ T̺,θ], |x(k)(t)− x(k−1)(t)| ≤ Lk−1 Z [t0,t] dτ1 Z [t0,τ1] dτ2. . . × Z [t0,τk−2] dτk−1|x(1)(τk−2)− ξ0| ≤ Lk−1Tk−1 ̺,θ (k− 1)! ̺ (2.3.15)

since T̺,θ ≥ |t − t0|. Eq. (2.3.15) shows the convergence of the series of Eq. (2.3.9) and, therefore, the limit of Eq. (2.3.6) exists uniformly for t ∈ [t0− T̺,θ, t0+ T̺,θ] and defines a function t→ x(t) on this interval with values in S(ξ0, ̺). It satisfies Eq. (2.3.4) as it is seen by taking the n→ ∞ limit in Eq. (2.3.5) and by using the uniformity of the limit of Eq. (2.3.6) to exchange the

integration with the limit. mbe

2.3.1 Problems

1.Give a lower estimate for the magnitude of T̺,θ, the amplitude of the existence interval

as in Proposition 2, for the following second-order equations, assuming x(0) = 0, ˙x(0) = 1 or x(0) = 1, ˙x(0) = 0 as initial data at t0= 0:

(35)

¨

x = x, x = x + x¨ 3, ¨x = x− ˙x + x3, ¨x =− ˙x2, ¨x ==− sin x.

Also estimate sup̺,θT̺,θ from below. (Hint: Reduce the equation to first order and then

apply Proposition l.)

2.Solve the equation ¨x = x with initial datum x(0) = 1, ˙x(0) = 0.

3.Solve the equations ˙x =−x2, ˙x = cos x, ˙x = (cos x)2with initial datum x(0) = 1.

4.Solve the equation ˙x = x + y, ˙y =−x + 2y with initial datum x(0) = 0, y(0) = 1. 5.Using the “quadrature method”, solve the equation ¨x = 4(x3− x), x(0) = 0, ˙x =2 (see

§2.1, final comment).

6.As in Problem 5 for ˙x =−(4x3+ 6x2− 2), x(0) = 0, ˙x(0) =2.

7.Find two linearly independent solutions for the equation in Problem 4. 8.* Compute w(t) for the equation in Problem 4 (see Problem 8,§(2.2).

9.* Let t→ L(t) be a d × d-matrix-valued C∞ function on R. Show that the equation

˙x(t) = L(t)x(t) admits d linearly independent solutions defined for|t| ≤ T with T small enough. (Hint: Let x(i)be the solution with initial data x(i)

j (0) = δi,j, i, j = 1, . . . d. Then

evaluate an existence interval for such initial data.)

10.* Compute T1,1 for the equation in Problem 9 when |t0| < σ and ξ0 is arbitrary,

ξ0 = x(t0); for the symbols, see Proposition 1. Show that |ξ0| T1,1 can be taken to be

independent of t0and ξ0at a given σ > 0. Deduce from this that every solution to ˙x = L(t)x

can be extended to a solution defined for t∈ R.

11.Let L be a d× d matrix and consider the equation ˙x = Lx in Rd. Suppose that L has

d pairwise distinct real eigenvalues (see Appendix E for the eigenvalue notion) λ1, . . . , λd.

Let v1), . . . , v(d)be the respective real linearly independent eigenvectors (see Appendix E).

Show that the functions t→ eλitv(i)are d linearly independent solutions. Show that any

solution t→ x(t) has the form x(t) =

d

X

j=1

αjeλjtv(j), with (α1, . . . , αd)∈ Rd.

2.4 General Properties of Motion. Regularity.

In proving Proposition 2 it was found that C(1) solutions of ˙x = F(x, t), F C∞(Rd× R), are necessarily Csolutions. This is the simplest regularity property shown by the solutions of such differential equations. Other regularity properties of the solutions will be now analyzed.

In applications it often happens that the right-hand side of Eq. (2.2.4) depends on parameters α ∈ Rm and that, furthermore, it is important to know how the solutions change as the initial data ξ0 and the parameters α vary in Rd and

Rm, respectively. A first answer to this question is provided by the following proposition.

3 Proposition. Let ξ, t, α → F(ξ, t, α) be a C∞(Rd

× R × Rm) function taking its values in Rd, and consider the equation

(36)

2.4 Regularity Regularity 23

x(t) = ξ0+ Z t

t0

F(x(τ ), τ, α0) dτ (2.4.1) as an equation for the continuous function t→ x(t) parameterized by ξ0, t0, α0 ∈ Rd

× R × Rm. Given ̺, θ, a > 0 and (ξ, t, α) ∈ Rd

× R × Rm, there exists T > 0 such that:

(i) Eq. (2.4.1) admits a solution for every (ξ0, t0, α0) close enough to (ξ, t, α) such that|ξ −ξ0| <̺2, |t−t0| < θ2,|α −α0| < a. Such solution will be denoted t→ St(ξ0; t0, α0) and it is defined for t∈ [t0− T, t0+ T ].

(ii) The function St(ξ0; t0, α0), defined for |ξ − ξ0| <

̺

2, |t − t0| < θ

2, |α − α0| < a, |t − t0| ≤ T (2.4.2) takes its values inside the ball S(ξ; ̺) with center ξ and radius ̺ and it is a C∞ function of its arguments.

(iii) The value T can be taken as:

T = ̺

2(̺ + θ max|F(ξ, t, α)|)θ (2.4.3)

where the maximum is considered on the set|ξ−ξ| < ̺2,|t−t| <θ

2,|α−α| < a. Observations.

(1) Eq. (2.4.1) is equivalent to

˙x(t) = F(x(t), t, α0), x(t0) = ξ0 (2.4.4) and, therefore, the above proposition provides a regularity theorem for the solutions of Eq. (2.4.4) as functions of the initial data, of the initial time, of time itself, and of the parameters α on which F may possibly depend. The set (2.4.2) and the key estimate (2.4.3) should not be taken too seriously as they are not optimal: they merely show an example of the type of concreteness that can be attained in the formulation of a regularity criterion (see, also, observation 4, p. 19).

(2) Let β = (β1, . . . , βd+m+2)≡ ((ξ0)1, . . . , (ξ)d, (α0)1, . . . , (α0)m, t, t0) and x(t) = (x1(t), . . . , xd(t)) = St(ξ0; t0, α0)

≡ (St(ξ0; t0, α0)1, . . . , (St(ξ0; t0, α0)d)

(2.4.5) Formal differentiation of Eq. (2.4.4) with respect to βi, i = 1, 2, . . . , m + d, gives d dt ∂x(t) ∂βi = d X h=1 ∂F ∂ξh (x(t), t, α0) ∂xh(t) ∂βi + d X h=1 ∂F ∂αk (x(t), t, α0) ∂αk ∂βi (2.4.6)

Riferimenti

Documenti correlati

Indeed, the extent of ionization depends on the compactness of the three-dimensional structure: while folded proteins are characterized by narrow CSDs (a few peaks)

Urakawa: Rigidity of transversally biharmonic maps between foliated Riemannian manifolds, to appear in Hokkaido Math.. [36] Ye-Lin Ou and Liang Tang: The generalized Chen’s

Ciascuna allerta è stata associata con una riduzione minima relativa nell’uso predetto di antipsicotici atipici: 5% per il primo warning, 4.9% per il secondo e 3.2% per il terzo

One compound can have a higher affinity for the stationary phase with a particular mobile phase and will present a higher retention time than another metabolite with a lower

We employ a set of recent, theoretically motivated fits to nonperturbative unquenched gluon propagators to check on how far double gluon exchange can be used to describe the soft

capitalism that seem to highlight the real historical evolution of capitalism: classical capitalism, market-oriented corporate capitalism, bank-oriented corporate

NOTE: In the solution of a given exercise you must (briefly) explain the line of your argument and show the main points of your calculations1. Solutions without adequate

Description: the space including the object is partitioned in cubic cells; each cell contains references to “primitives” (i.e.. Primitives are