next up previous

STAT 804: Lecture 19 Notes

Processes with Periodic Components

Some of the series we have looked at have had clear annual cycles, returning to high levels in the same month every year. In our analysis of such processes we have tried to model the mean tex2html_wrap_inline164 as a periodic function tex2html_wrap_inline166 . Sometimes we have fitted specific periodic functions to tex2html_wrap_inline164 - writing tex2html_wrap_inline170 .

Another process we studied, that of sunspot numbers, also seems to show a clearly periodic component, though now the frequency or period of the oscillation is not so obvious. In this section of the course we investigate the notion of decomposing a general stationary time series into simple periodic components. We will take these components to be cosines and sines. We will be focussing on problems in which the period is not prespecified, that is problems more like the sunspot data than the annual cycle examples.

For a statistician, the simplest description of what we will do is to say that we will examine the correlation between our series and sines and cosines of various periods. We will use these correlations in several ways:

Periodic Functions

A periodic function f on the real line has the property that f(t+d)=f(t) for some d and all t. The smallest possible choice of d is the period of f. The frequency of f in cycles per time unit, is 1/d. The most famous periodic functions are the trigonometric functions tex2html_wrap_inline192 and its relatives. This function has period tex2html_wrap_inline194 and frequency tex2html_wrap_inline196 cycles per time unit. Often, for trigonometric functions it is convenient to refer to tex2html_wrap_inline198 as the frequency; the units now are radians per time point.

The achievement of Fourier was to recognize that essentially any function f with period 1 can be represented as a sum of functions tex2html_wrap_inline202 or tex2html_wrap_inline204 . The tactic is to suppose that

displaymath206

To discover the values of the coefficients we make use of the orthogonality properties:

displaymath208

displaymath210

and

displaymath212

Now multiply f(t) by say tex2html_wrap_inline204 and integrate from 0 to 1. Expanding the integral using the supposed expression of f as a sum gives us

displaymath220

Similarly tex2html_wrap_inline222 .

Mathematically the fact that we can derive a formula for the coefficients is far from proving that the resulting sum actually represents f; the key missing piece of the proof is that any function whose Fourier coefficients are all 0 is essentially the 0 function.

Correlation between functions

The integrals in the previous section can be thought of as analogous to covariances and variances. For instance a Riemann sum for

displaymath226

is

displaymath228

which is an average product. In fact it is possible to show that

displaymath230

So that the average product is just a ``sample'' covariance. It is also possible to evaluate the average product exactly to see that

displaymath232

exactly. When j=k this becomes a variance, equal to 1/2 so that the correlation is just the covariance times 2, which is 0 in any case when tex2html_wrap_inline236 .

Interpreting all the integrals above, then, as covariances we see that all the sines are uncorrelated with each other and with all the cosines and all the cosines are uncorrelated with each other.

Notice particularly that the sine with frequency j and the cosine with frequency j are uncorrelated. This has an important implication for looking for components at frequency j cycles per time unit in a time series: if we want a certain frequency we have to consider both the cosine and the sine at that frequency. An alternative summary of what we need is to consider the trigonometric identity

displaymath244

When we look for a component with frequency tex2html_wrap_inline198 we will allow ourselves to adjust the number tex2html_wrap_inline248 , called the phase, in order to mazimize the correlation with our data. This is equivalent to adjusting the coefficients tex2html_wrap_inline250 and tex2html_wrap_inline252 to maximize a correlation with the right hand side of the trigonometric identity.

Complex Exponentials

Many of the identities in this subject are more easily derived using complex variables. In particular, the identity

displaymath254

where tex2html_wrap_inline256 permits any series in sines and cosines to be rewritten in terms of exponentials. We can then often use tricks involving geometric sums to simplify our algebra.

For instance we can write

displaymath258

and

displaymath260

These permit us to rewrite the expansion (1) in the form

displaymath262

where tex2html_wrap_inline264 for k;SPMgt;0, tex2html_wrap_inline268 and tex2html_wrap_inline270 for k;SPMlt;0. In fact

displaymath274

Fourier transforms

For functions which are not periodic we can proceed by a further approximation Suppose f is defined on the real line and fix a large value of T. Define

displaymath280

Then g is defined on [0,1] and

displaymath286

according to (1) above. Re-express the conclusion in terms of f to get

displaymath290

which simplifies to

displaymath292

You should recognize this sum as a Riemann sum for the integral

displaymath294

which then converges as tex2html_wrap_inline296 to the expression

displaymath298

The function

displaymath300

is called the Fourier transform of f and we have derived a Fourier inversion formula. [WARNING: no proofs here! This integral will exist for, for example, f which are integrable over all the real line. ] This inversion formula expresses the function f as a linear combination of sines and cosines, though there are infinitely many frequencies involved.

Transforms of Stochastic Processes

We now seek to apply these ideas with the function f being our stochastic process X. We have several difficulties:

The discrete nature of X leads us to the study of a discrete approximation to the integral:

displaymath322

This object has real part

displaymath324

and imaginary part

displaymath326

so that apart from the means not being 0 we are studying the sample covariance with sines and cosines at frequency tex2html_wrap_inline198 . We now study the statistical properties of these objects and then try to interpret them.

Suppose that X is a mean 0 stationary time series with autocovariance function C. We define the discrete Fourier transform of X as

displaymath336

Our choice to divide by the square root of T is motivated by the recognition that the sum of T terms typically has a standard deviation on the order of tex2html_wrap_inline342 leading us to expect that tex2html_wrap_inline344 will have a standard deviation which has a reasonable limit as tex2html_wrap_inline296 .

We begin by computing moments of tex2html_wrap_inline344 . Since tex2html_wrap_inline344 is complex valued we have to think about what these moments should be. One way to think about this is to view tex2html_wrap_inline344 as a vector with two components, the real and imaginary parts. This would give tex2html_wrap_inline344 a mean and a 2 by 2 variance covariance matrix. Also of interest however will be the expected modulus squared of tex2html_wrap_inline344 , namely

displaymath358

where tex2html_wrap_inline360 is the complex conjugate of z. (If z=x+iy with x and y real then tex2html_wrap_inline370 .)

Since the Xs have mean 0 we see that

displaymath374

(you should note that the expected value of a complex valued random variable is computed by finding the expected value of the real and imaginary parts). Then

displaymath376

The expected values are just C(s-t). We can gather together all the terms involving C(0), all those involving C(1) and so on to find

displaymath384

which simplifies to

displaymath386

As tex2html_wrap_inline296 the coefficents of C(k) converges to 1 and we see (using C(k)=C(-k))

displaymath394

The right hand side of this expression is defined to be the spectral density, or power spectrum, of X:

displaymath398

There are a number of ways to look at spectral densities and the discrete Fourier transform:


next up previous



Richard Lockhart
Mon Nov 3 11:39:19 PST 1997