Classical mathematical analysis involves an intriguing interplay between finite and infinite collections and between discrete and continuous structures. What makes the interplay intriguing is the emergence from the most elementary considerations of results that are paradoxical, or, at least, utterly counterintuitive. What, for example is the sum of 1 + 2 + 3 + … + n + …?
The sum?
Surely this sequence of numbers is simply getting bigger and bigger; and beyond this, it is not converging to anything.
Not so. The sum is –1/12.
Niels Abel called divergent series the Devil’s invention.1 Having made outstanding contributions to the subject as the Devil’s plaything, he knew what he was talking about.
Infinite Sequences
Mathematical analysis begins by considering infinite sequences of real numbers and by defining their limit. For the moment, I will consider the real numbers and only the real numbers. An infinite sequence is an arbitrary function whose domain coincides with the set ℕ = {0, 1, 2, …, n, …} of natural numbers, or with some of its infinite subsets, and whose range is contained in the real line R. Thus {an:n∈N} or more explicitly, if less compactly
a0,a1,a2,…,an,….
The terms an of such sequences are often generated by a fixed rule, or by applying a specific algorithm to each n.
Limits are next. A real number a is a limit of a sequence {an:n∈N} if, for any ε>0, there exists a natural number n0=n0(ε) such that |an-a|<ε whenever n>n0. If this is the case, the sequence converges (or tends) to a as n tends to infinity: a=lim{an:n∈N}. It is easy to show that a is unique.2
The definition of a limit having been given—by Augustin-Louis Cauchy, as it happens—mathematicians wished naturally to know whether and under what conditions such limits existed. The classical criterion—by Cauchy again, as it happens again—affirms that {an:n∈N} is convergent if, and only if, for any ε>0 there exists a natural number n0=n0(ε) such that |an-am|<ε whenever n>n0 and m>n0.3
Among the glories of mathematical analysis, no one beyond Cauchy has ever found these ideas easy, and no one beyond Leonhard Euler has ever found them obvious.
The Cauchy criterion reflects the completeness property of the real numbers, from which necessary and sufficient condition for the existence of a limit follow almost at once. Many other conditions for convergence are more convenient in practice. If a sequence of real numbers is monotone and bounded, then it necessarily has a limit.
If two sequences {an:n∈N} and {bn:n∈N} converge to a and b respectively, then, for any real r and t, the sequence {ran+tbn:n∈N} converges to ra+tb.
Suppose that, for two sequences {an:n∈N} and {bn:n∈N}, two natural numbers m and n can be found such that am+i=bn+i whenever i∈N. Then {an:n∈N} converges if and only if {bn:n∈N} converges and, in this case, so does the equality
lim{an:n∈N}=lim{bn:n∈N}
The convergence of a sequence of real numbers depends only on its tail.
Example 1.
The sequence 1,21/2,31/3,…,n1/n,… converges and its limit is equal to 1.
Example 2.
The sequence {an:n∈N}, where an=(1+1/n)n for n>0, converges to Napier’s constant, e = 2.71828… .
Example 3.
The sequence {an:n∈N}, where an=1+1/2+…+1/n-ln(n) for n>0, converges to Euler’s constant, C = 0.57721… .
If there exists no limit of a sequence {an:n∈N}, then the sequence is divergent. In some sense, most real sequences are divergent.
Example 4.
Any unbounded sequence is divergent; in particular, the unbounded sequence {bn:n∈N}, where bn=1+1/2+…+1/n for n>0.
Example 5.
The sequence {an:n∈N}, where an=1 for all even n, and an=-1 for all odd natural n, is divergent.
Example 6.
The sequence {an:n∈N}, where an=(n!)1/n for n>0, is unbounded, and thus divergent.
Sometimes it is convenient to treat certain divergent sequences as if they were converging to infinity. A sequence {an:n∈N} tends to +∞ if, for every r>0, there exists an n0=n0(r) such that an>r whenever n>n0. A sequence {bn:n∈N} tends to –∞ if, for every r<0, there exists an n0=n0(r) such that bn<r whenever n>n0.
Given a sequence {an:n∈N}, one can associate another sequence {sn:n∈N} to it, where sn=Σ{ak:k≤n} denotes the sum of the first n+1 terms of {an:n∈N}. Observe that the initial sequence {an:n∈N} is completely determined by {sn:n∈N}, because an=sn-sn-1, a point valid for all n∈N. The pair ({an:n∈N},{sn:n∈N}) is called an infinite series, a series for short.4 With far less by way of notational clutter, mathematicians write
a0+a1+a2+…+an+… .
A series a0+a1+a2+…+an+… is convergent if the corresponding sequence {sn:n∈N} has a finite limit s, the sum of the series; otherwise, it is divergent.
Cauchy’s convergence criterion now follows: an infinite series has a sum if and only if, for any ε>0, there exists a n0=n0(ε) such that |an+1+an+2+…+an+m|<ε whenever n>n0 and m∈N.
The convergence of a0+a1+a2+…+an+… implies that
lim{an:n∈N}=0.
In the field, where time is short, simpler tests are used: D’Alembert’s test, the nth root test, Leibniz’s test, Raabe’s test, Kummer’s test, Bertrand’s test, Gauss’s test, the integral test, Abel’s test, Dirichlet’s test, Ermakov’s test.5
There are sufficiently many tests to hold an endless series of undergraduate examinations.
Example 7.
The series 1/12+1/22+1/32+…+1/n2+… converges, because all of its terms 1/n2, where n>1, are less than 1/(n(n-1))=1/(n-1)-1/n, and the series
1+1/2+1/6+1/12+…+1/(n(n-1))+…
converges to 2. Here the so-called comparison test does its work. A more delicate calculation shows that the sum of 1/12+1/22+1/32+…+1/n2+… is equal to π2/6.6 This is a result well hidden from anyone’s intuitions.
As one might expect, series come in special types. There are geometric series, harmonic series, hyper-geometric series, alternating series, and many more besides. In many cases, a series of the form
a0(x)+a1(x)+a2(x)+…+an(x)+… ,
is up for grabs, where an(x)(n∈N) are real-valued functions of a real variable x. For each value r of x, one has the usual series
a0(r)+a1(r)+a2(r)+…+an(r)+… .
Did I mention power series, Taylor series, Laurent series, Dirichlet series, trigonometric series, Fourier series, and the ever-popular binomial series?
Example 8.
The geometric series 1+q1+q2+…+qn+… , where |q|<1.
Example 9.
Grandi’s series: 1-1+1-1+…+(-1)n-1+… and, more generally, for any strictly positive integer k, the series 1-k1+k2-k3+…+(-1)n-1kn-1+… .
Example 10.
The series 0!-1!+2!-3!+…+(-1)nn!+… .
Example 11.
Dirichlet’s series:
f(x)=a1/1x+a2/2x+a3/3x+…+an/nx+… ,
where x∈R, and {an:n∈N} is a sequence of real numbers. In particular, the generalized harmonic series, which is a part of Riemann’s zeta fuction, appears almost everywhere:
ζ(x)=1/1x+1/2x+1/3x+…+1/nx+… ,
and the generalized alternating harmonic series, which is a part of Dirichlet’s eta function
η(x)=1/1x-1/2x+1/3x-…+(-1)n-1/nx+… ,
is obtained in this way. The series 1/1x+1/2x+1/3x+…+1/nx+… converges for all real x>1, and the series 1/1x-1/2x+1/3x-…+(-1)n-1/nx+… converges for all real x>0.
Example 12.
A series a0+a1+a2+…+an+… is called absolutely convergent if the series |a0|+|a1|+|a2|+…+|an|+… converges. Every absolutely convergent series converges.
The converse is not true, alas. The alternating harmonic series
1-1/2+1/3-1/4+…+(-1)n-1(1/n)+… ,
converges and its sum is equal to ln(2),7 but the same series is not absolutely convergent, as Example 4 might indicate.
A convergent series a0+a1+a2+…+an+… is conditionally convergent if its associated series |a0|+|a1|+|a2|+…+|an|+… diverges.
If a series converges conditionally, then for any real r there exists a permutation of its terms such that the new series converges to r. This is Riemann’s classical theorem, which admits a nontrivial generalization to a series of vectors in a finite-dimensional Euclidean space.
Strangely Divergent Series
Strangely divergent series such as 1+2+3+4+… are straightforward inasmuch as they are divergent; they are strange inasmuch as they are not. This is a claim that would seem to tiptoe to the very edge of paradox. Nonetheless, Euler dealt systematically with strangely divergent series; he was able to manipulate them successfully. After many clever manipulations, Euler thought to claim that any divergent series must have a certain sum. Euler’s position was criticized by the fastidious Jean D’Alembert, the equally fastidious Abel, and by other mathematicians, all of them indifferent to the possibility that Euler might have been right and they wrong. Only much later was some rational fragment of Euler’s idea transformed into a rich theory of the summability of divergent series.
Many divergent series; and so many sums; many sums, and so many summability methods: Abel’s summation method, Borel’s summation method, Cesaro’s summation method, Euler’s summation method, Holder’s summation method, Kummer’s summation method, Lambert’s summation method, Lindelof’s summation method, Poisson’s summation method, Voronoy’s summation method, and many others.8
The simplest method of summation is named after Ernesto Cesaro. If a series a0+a1+…+an+… is given, then it may sometimes happen—not always, not in every case—that the corresponding sequence of arithmetical means of its partial sums
s0,(s0+s1)/2,(s0+s1+s2)/3,… ,
(s0+s1+…+sn)/(n+1),… ,
converges to a certain real number. This is its Cesaro sum.
The notion of a Cesaro sum is compatible with the usual notion of the sum of a convergent series. Indeed, according to one of Cauchy’s classical theorems, if a sequence of reals has a limit, then the sequence of the arithmetical means of these reals also converges and has the same limit. Simple counterexamples show that the converse is not true.
It should also be noticed that the Cesaro’s summability method inspired tauberian results in the general theory of series. Those results allow mathematicians to establish the convergence of a given series a0+a1+…+an+… , if a0+a1+…+an+… is summable by some method and satisfies certain additional constraints.9
After loitering in the Devil’s toolshed, Abel suggested a technique now known as Abel’s summation method, or, sometimes, the Abel-Poisson summation method. The Devil, it is well known, has work for idle hands.
Suppose that a series a0+a1+…+an+… is given and, for a variable x ranging over the open interval (0,1), consider the associated power series
a0+a1x1+a2x2+…+anxn+… .
It may happen that this series converges for each r in (0,1), and therefore determines a concrete function f(x) on the interval. It may happen that there exists a limit of f(x) as x tends to 1. Such is the series’ Abel sum. Cesaro’s method is compatible with the usual notion of the sum of a convergent series. So is Abel’s method. If a series a0+a1+…+an+… converges to s, then, by virtue of Abel’s classical theorem,10 the Abel sum of a0+a1+…+an+… is also equal to s. The converse fails.
The Devil’s checks are never blank.
Example 13.
For any strictly positive integer k, the Abel method implies that the divergent series
1-k1+k2-k3+…+(-1)n-1kn-1+… ,
sums to 1/(k+1), and, in particular, the Grandi’s series
1-1+1-…+(-1)n-1+…
sums to 1/2. The Grandi’s series is also summable by Cesaro’s method and its Cesaro’s sum is again equal to 1/2. This is not simply a happy coincidence. Abel’s method is substantially stronger than Cesaro’s. On the other hand, for a natural number k strictly greater than 1, the series
1-k1+k2-k3+…+(-1)n-1kn-1+… ,
is not Cesaro summable.
There are many other examples of series that are summable by Abel’s method and not by Cesaro’s method.
Example 14.
Consider the series 1-2+3-4+…+(-1)n-1n+… which is trivially divergent and not Cesaro summable. Abel’s method works, but Cesaro’s method lapses. Indeed, if 0<x<1, then the series 1-2x1+3x2-4x3+… converges, and its ordinary sum is equal to 1/(x+1)2. If x=1, Abel’s method promptly yields the equality 1-2+3-4+(-1)n-1n+…=1/4.
Abel’s method is very helpful in dealing with the products of infinite series. What follows represents one formula, but two series:
a0+a1+…+an+… ,
b0+b1+…+bn+… .
Their product c0+c1+…+cn+… is defined by the equality
cn=a0bn+a1bn-1+a2bn-2+…+anb0 (n∈N)
The product of two conditionally convergent series is not, in general, convergent. But if both series a0+a1+…+an+… and b0+b1+…+bn+… are Abel summable, then their product c0+c1+…+cn+… is also Abel summable, and equal to the product of the Abel sums of a0+a1+…+an+… and b0+b1+…+bn+… .
This is a wonderfully elegant result, and, perhaps, the simplest result accessible to intuition that suggests that the techniques contrived by definition have a rich internal structure.
Example 15.
Using the fact that the sum of products is the product of sums—above, just above—there is, at once, the equality
(1-1+1-1+…)(1-1+1-1+…)
=1-2+3-4+… .
Example 16.
Emil Borel created, or discovered, several important summation methods. If
a0+a1+…+an+… ,
is a series trotting off into the unknown, then the function
f(x)=e-x(Σ{snxn/n!:n∈N}) ,
makes sense—perfect sense, in fact—applied to the set of real numbers x, for which this function is well defined. It may happen that f(x) has a finite limit y as x tends to +∞. If so, y is called Borel’s sum of the series. If a series a0+a1+…+an+… is convergent, then it is Borel summable, too, and its usual sum coincides with its Borel sum.
The proof is easy.
What follows involves the same Borel, but a different method: Borel integral summation.
Example 17.
The series 0!-1!+2!-3!+…+(-1)nn!+… has as its Borel sum
∞∫0e-t1+tdt.11
Example 18.
Consider the partial Riemann zeta function ζ(x) and the partial Dirichlet’s eta function η(x) for all real numbers x strictly greater than 1. The convergence of
1/1x+1/2x+1/3x+…+1/nx+… ,
1/1x-1/2x+1/3x-…+(-1)n-1/nx+… ,
readily implies:
η(x)=(1-2(1-x))ζ(x).
Even better, one can consider 1/1x-1/2x+1/3x-…+(-1)n-1/nx+… for complex numbers z. Replacing x with z, one obtains 1/1z-1/2z+1/3z-…+(-1)n-1/nz+…, which converges to a complex value whenever the real part of z is strictly greater than 1. That η(x) admits an analytic continuation on the whole complex plane C is a provable fact of life. On top of this, there is
η(z)=(1-2(1-z))ζ(z),
where ζ(z) is an analytic function on the open set C\{1}, extending ζ(x). By remembering that η(-1)=1/4, one is finally able to discern that, against every expectation,
1+2+3+…+n+…=ζ(-1)=-1/12.
This is an extraordinary result in mathematics, but it is also an extraordinary result in mathematics that has nontrivial applications in contemporary theoretical physics.
Did the inspired Euler, blind since middle age, who saw so much, see this?
The Theory of Infinite Summation
Suppose that a sequence {gn:n∈N} of functions is given on some subset D of R such that there exists an accumulation point d of D. Suppose also that, for each n∈N, there exists a limit of gn(t) as t∈D tends to d, and that this limit is equal to 1.
Consider a series a0+a1+…+an+… and let the function series
g(t)=a0g0(t)+a1g1(t)+…+angn(t)+… (t∈D)
be its natural associate.
It may happen that the function g(t) is well-defined on some neighbourhood of the point d and, in addition, that there exists a limit s=s(a0+a1+…+an+…) of g(t) as t∈D tends to d. The series S=a0+a1+…+an+… is summable, and its generalized sum is equal to s.
Example 19.
Assume that a sequence of functions {gn:n∈N} satisfies these two conditions:
- all functions from {gn:n∈N} are uniformly bounded; and
- for each t∈D, the sequence {gn(t):n∈N} is monotone.
If S converges to s, then S has s as its sum.
Many classical summation methods correspond to the appropriate choice of a sequence of functions satisfying 1) and 2). For Cesaro’s summation method, the functions gn(n∈N) can be defined on N in the following way:
gn(m)=(m+1-n)/(m+1) if n≤m,
and gn(m)=0 if n>m
For Abel’s summation method, the functions gn(n∈N) can be defined on the open interval (0,1)
gn(t)=tn (0<t<1)
Taking the positive ray (0,+∞) of R as D, and defining the functions gn(n∈N) by the formula
gn(t)=1-(∑{tk/k!:k<n})/et (t>0)
one gets a certain version of Borel’s summation method.
There are other general schemes.12 Not everything is smooth sailing. Two different schemes may prove inconsistent in the sense that, for some concrete series, they yield two distinct generalized sums.
Example 20.
Quite often a summability scheme is formulated in terms of a real-valued matrix T with countably many rows and columns. T-summability methods transform one infinite sequence into another in the expectation, or hope, that the transformation will yield a sequence better behaved so far as convergence goes. The celebrated Silverman–Toeplitz theorem establishes the necessary and sufficient conditions under which the class of all convergent sequences is transformed by T into itself.13
Among the many interesting and important statements about matrix summability, I wish to mention one result.14 For a given matrix T and a series a0+a1+…+an+… , consider all those rearrangements of the terms of a0+a1+…+an+… that yield T-summable series, and denote by the symbol S(a0+a1+…+an+… , T) the set of all T-sums obtained in this way. It turns out that, changing a0+a1+…+an+… and T in all possible ways, the family of all sets S(a0+a1+…+an+… , T) yields the family of all analytic, or Suslin, subsets of R. This family is much wider than the family of all Borel subsets of R.15
Various summability schemes lead naturally to a large family of point sets in R, and are connected with more or less delicate facts of classical descriptive set theory. Connections of this sort even touch upon the problem of the Lebesgue measurability of all subsets of R.
From a more general point of view, any summability method may be described as a functional f defined on some vector subspace L = dom(f) of the space RN of all infinite real sequences. The elements of L are of the form {sn:n∈N}, where sn(n∈N) are partial finite sums of a series. As a rule, summability methods satisfy both linearity and regularity conditions. Linearity means that f is a linear functional on L; regularity, that f extends the standard linear functional limit defined on the space c of all convergent sequences in RN.
Sometimes, a third condition is imposed, a certain analogue of translation invariance. If s is a sum of a series a0+a1+…+an+… and r is a sum of a1+…+an+… , then s=a0+r.
Example 21.
No summability method satisfying the translation condition can be applied to the series 1+1+1+…+1+… . The proof is obvious: s=1+(1+1+1+…+1+…)=1+s, which implies that 0=1.
Now, if a sequence {tn:n∈N} does not belong to L and r is an arbitrary real number, the functional f can be extended to a linear functional g defined on a larger vector subspace K of RN such that
{tn:n∈N}∈K, g({tn:n∈N})=r.
This statement admits a constructive proof, one that does not rely on the axiom of choice.
Euler Vindicated
Euler’s wonderful intuition now stands vindicated:
For any concrete divergent series, there exists a linear and regular summability method that assigns a real number to the series as its sum.
A reservation follows at once. There exists no linear summability method satisfying the translation condition that assigns a finite sum to 1+2+3+…+n+… .
Suppose that
1+2+3+…+n+…=s,
where s is a real number. Then:
s-1=2+3+…+n+… , and
-1=(s-1)-s=(2-1)+(3-2)+…+(n+1-n)+…
=1+1+1+…+1+… ,
which is impossible.
If one wishes to assign certain generalized sums to all divergent series all at once, then one must appeal to strong forms of the axiom of choice. A standard application of Zorn’s lemma, for example, shows that that there exists a linear regular summation method whose domain is the whole of RN. This result is useless in practice because of its extremely non-constructive character. On the other hand, it follows from a fundamental result by Robert Solovay that, under countable forms of the axiom of choice, it is logically consistent to assume that the domain of any linear regular summability method is a first category subspace of RN.16 Since RN is a Polish topological space, RN is of second category on itself. Consequently, the domain of any constructively defined linear and regular summability scheme is a very small part of RN. Within the same weak fragment of set theory, one may assume that the class of all linear regular summation methods, equipped with its natural partial ordering, does not possess maximal elements.
Using uncountable forms of the axiom of choice, mathematicians were able to construct some remarkable linear functionals on various vector subspaces of RN. Among them the best known is the Banach functional limit, or Lim, defined on the space of all bounded real sequences, Lim is linear, extends the standard functional limit, and is translation invariant.17 But the existence of such functionals always implies the existence of ultimately pathological subsets of R, namely, sets non-measurable in the Lebesgue sense or sets not having the so-called Baire property.18
Example 22.
Consider the following function series FS:
cos(t)+cos(2t)+cos(4t)+…+cos(2nt)+… ,
where a variable t ranges over R. Assume that for each t from a Lebesgue measurable subset of R with strictly positive measure, that FS sums to f(t) by some linear, regular summation scheme satisfying the translation condition. It follows that f is non-measurable in the Lebesgue sense. Such is Kolmogorov’s theorem.19
Solovay’s profound result shows that there is no hope of effectively constructing a non-Lebesgue measurable subset of R or,20 equivalently, a non-Lebesgue measurable function acting from R into R.21