Dual to the problem addressed in the previous post is that of when the result of the fundamental theorem of calculus holds true, namely that . It turns out the condition of absolute continuity defined at the end of the last post is sufficient. First, define the variation of a complex-valued function over a partition to be . It is easy to see that variation increases in the “fineness” of the partition; we say that a function is of bounded variation if the supremum of the variation over all partitions, the total variation , is finite. This will be the case, roughly speaking, for functions that do not oscillate too widely or too frequently. We can see that real, monotonic, bounded functions, as well as differentiable and bounded functions, are of bounded variation.
The first result of this post will show some of the motivation for studying these functions.
Result 1: Bounded variation implies differentiability almost everywhere.
Proof: To prove this, let’s first narrow our focus solely to increasing functions. We can do this because of the following characterization of functions of bounded variation: they are precisely the functions that are differences of two increasing bounded functions. One direction is obvious, that the difference of two increasing bounded functions is of bounded variation. To prove the other direction, define positive and negative variation and over the interval and , where the sums are taken over all positive and negative differences, respectively. We immediately have that and . Then to prove the other direction of our claim, simply take the functions to be and .
Next, we’ll prove the lemma that forms the crux for our proof of almost-everywhere differentiability.
Riesz’ Lemma: For a real, continuous function , let be the set of points such that there exists to the right of some such that . is the union of open intervals so that on each interval, takes the same value at both endpoints except in the case of the interval starting at for which .
Proof: The proof is evocative and beautiful: think of the function as a series of rolling hills and imagine a sun shining to the right of the function, The shadows on the hills are precisely the intervals composing , and each contiguous shadow certainly starts where it ends except for the first one! The other way to prove this is just to apply the intermediate value theorem repeatedly.
Returning to the proof, define the coarse differential quantity and the four Dini numbers , , , to be the limit superior of from the right and left, and the limit inferior of from the right and left, respectively. Because we have that , it suffices to prove that is finite and bounded above by .
It is time to use our lemma: for a given , let be the set of for which . We want to show that the measure of shrinks arbitrarily as becomes large, which would imply that almost everywhere. Apply Riesz’ lemma to the function and note that is part of the union of open intervals described in the lemma: if a point satisfies , then for some and thus there exists to the right of some such that . So the measure of is bounded by where so that . Thus , and because is increasing, this is bounded above by , proving as desired that becomes arbitrarily small as increases.
We use the same approach to prove that almost everywhere: define a set that can be used with Riesz’ lemma and prove it’s of measure zero. For fixed , let be the set where and . Because this is for arbitrary , we just need to prove . To the contrary, assume that . Then there is some between and such that . is the union of disjoint open intervals , so pick one; we’ll use Riesz’ lemma twice. Basically, in each , we’re going to construct a set which is small enough relative to (in fact, by a factor of ) but still contains in . Then , a contradiction stemming from the strict inequality which arises from our false assumption that .
Because we are approaching from the left in the case of and our lemma considers high values lying on the right, we reflect through the origin by considering and obtain a union of open intervals, and reflecting these back through the origin, we get some such that and thus . For each of these subintervals , further apply the Riesz lemma using to get another layer of subintervals so that . Taking the union of all these subintervals, we get a set whose measure we can deduce, by the fact that is increasing, to be bounded above by , as desired.
Now that we have differentiability almost everywhere, we can approximate by convergents and then invoke Fatou’s lemma on these convergents to get that for increasing and continuous, . By our characterization of functions of bounded variation, this inequality extends to those functions as well.
But it turns out we can do no better than this inequality without narrowing our focus further. The Cantor-Lebesgue function is one such counterexample; the basic premise is to construct a series of continuous “stairstep functions” where the steps come at the endpoints in the Cantor set convergents. In this case the derivative is almost everywhere zero, but . Specifically, define the convergents of the Cantor-Lebesgue function such that , on the complement of , and is linear on . An image I like to use to visualize this is that we’re climbing from height 0 to height 1 in such a way that even when we make it to 1 at the end, our vertical progress has been so slow that it might seem like we’d traveled nowhere.
What it takes to get the fundamental theorem of calculus to hold is the stronger condition of absolute continuity. Recall that the generalized definition is that for any there exists a such that implies . In the one-dimensional case, a function is absolutely continuous if for every there exists a such that (i.e. \ for all disjoint intervals such that their combined length is less than .
It is easy to see that absolute continuity implies uniform continuity and, more importantly, bounded variation and thus differentiability almost everywhere. We will prove that if the derivative of an absolutely continuous function is zero almost everywhere, the function is constant. This gives our desired theorem, one direction of which was proven in the last post:
Result 2: If is absolutely continuous, then . On the other hand, if is integrable, then is absolutely continuous and .
Proof: In the former direction, the equation to be proven makes sense because the integral of the derivative, which for convenience let’s denote by , by our bound from Fatou’s lemma, is finite and functions of bounded variation are differentiable almost everywhere. But then we can use the first main result of our last post: local integrability implies that the limit of almost everywhere, so in other words, almost everywhere, so is constant by what we will prove later. So .
In the other direction, we already know this by the Lebesgue differentiation theorem of the last post.
To conclude this post, we prove that zero derivative almost everywhere implies constant.
Proof: It suffices to prove that at the endpoints of the interval , takes on the same value, because then we can restrict to any subinterval we wish. Let be the set of for which ; by hypothesis it has measure . By definition, we can find for every and every a neighborhood of width around such that . This is an example of a Vitali covering, a covering by balls for every point in the set and every radius , there is a ball containing that point and of measure less than . It turns out that, roughly speaking, for any Vitali covering we can find a disjoint sub-collection of balls whose combined measure is arbitrarily close to that of the set being covered, i.e. for any there are finitely many disjoint such that .
If we can prove this, then we can find a disjoint sub-collection of intervals such that and such that which can be made arbitrarily. Because we’re able to shrink the complement as we wish by our Vitali result, by absolute continuity the same sum of over all the intervals making up the complement can be made arbitrarily small, so we are done.
So it remains to show the result for Vitali coverings. In fact we can make use of our covering argument from the previous post: given a finite collection of balls, we can get within a factor of using a finite sub-collection. The argument is then simply: i) pick a subset which si compact and has at least measure , ii) cover it by finitely many balls and invoke the result to find a sub-collection which is at least , iii) if this is at least , we’re done, otherwise repeat on minus the sub-collection of the current balls’ closures and repeat for all balls in the original Vitali covering which do not intersect any of the .