Module A · Foundations and Honesty - Chapter 06

Correlation Is Not Cointegration

The lesson almost everyone gets wrong, computed live on NSE data: the most correlated pairs need not be cointegrated, and a less correlated pair can be. Spurious regression and why correlation traders blow up.

STATNSE

What you'll learn

·Correlation vs cointegration
·Most correlated is not cointegrated
·The corr-vs-coint scatter
·Spurious regression
·A Monte Carlo of nonsense
·Window sensitivity

Every blown-up pairs book starts the same way. Someone runs a correlation screen, finds two names that move together like twins, shorts the expensive one against the cheap one, and waits for the gap to close. It does not. The gap widens. They average down. It widens more. Eventually the desk pulls the position, and the post-mortem reads the same every time: the pair decorrelated. It never did. The correlation was real the whole way down. It was simply never the thing that makes a spread come back. This is the hinge of the entire course. If you keep one idea from it, keep this: correlation is not cointegration, and confusing the two is how statistical-arbitrage traders die.

The mistake almost everyone makes

Correlation answers a short-term question: did these two names move in the same direction on the same days? It is measured on returns (the daily percentage changes), it always lands between minus one and plus one, and it is genuinely useful for one thing only - building a hedge that does not jump around day to day. What it cannot tell you is whether the gap between two prices will ever come back. Two stocks can post a +0.8 return correlation for a decade while their price ratio drifts steadily from 1.0 to 3.0 and never returns. Every daily wiggle agrees, yet the long-run levels walk away from each other forever.

Cointegration answers the question you actually care about: is there a specific combination of these two prices that stays tied to an average? It is measured on price levels (log levels, to be exact). A "yes" means the spread - what's left after you subtract one stock (scaled by a hedge ratio) from the other - is stationary: it wanders away from an average and keeps getting pulled back. That average is the only thing a pairs trade leans on. No stationary spread, no reversion, no trade - however gorgeous the correlation looks on the screen.

These are not two flavours of the same statistic. They are different questions about different objects: one about how returns move together in the short run, the other about whether price levels stay tied over the long run. Mix them up - measure correlation and assume you have found a reverting spread - and you have made the single most expensive error in the field.

Two definitions, kept honest

The left panel is a trap; the right panel is a trade. Same "they move together" - opposite outcome.

The picture above is the whole chapter in one frame. On the left, two prices that are highly correlated day to day but whose gap has no anchor: it widens and keeps widening. On the right, two prices that still wander like random walks, but with a spring between them - every time the gap stretches, something pulls it back to the average. Only the right-hand gap is tradable. Correlation cannot tell these two pictures apart, because the daily co-movement looks identical in both. Cointegration is the test that can, because it asks the one question that separates them: is the spread stationary?

Key idea

Correlation gives you a comfortable hedge, but it is not a reversion signal. The reversion signal is cointegration - a stationary spread with an average it returns to. A correlation screen finds pairs that trended together. Only a cointegration test finds pairs whose gap comes back. Build your screen on the wrong one and you will keep picking pairs with no anchor.

The headline table: where the ranking breaks down

Enough theory. Here are both statistics computed directly on real NSE names over the window 2016-01-01 to 2026-06-25 - roughly 2,470 trading days per pair, each tested on its own overlapping history. Correlation is measured on log returns. Cointegration is the Engle-Granger p-value on log price levels. A small p-value here means cointegrated, because the test starts from the assumption of no cointegration and a small p is the evidence against it. The table is sorted by correlation, so watch the verdict column as your eye travels down.

Pair	Sector	Return corr	Coint p	Verdict
TATASTEEL / JSWSTEEL	same	0.76	0.204	no - drifts
TCS / INFY	same	0.67	0.062	borderline
HDFCBANK / ICICIBANK	same	0.62	0.337	no - drifts
KOTAKBANK / HDFCBANK	same	0.60	0.001	cointegrated
HCLTECH / WIPRO	same	0.57	0.301	no - drifts
MARUTI / M&M	same	0.53	0.000	cointegrated
MARUTI / ICICIBANK	cross	0.46	0.277	no - drifts
TCS / HDFCBANK	cross	0.26	0.388	no - drifts
INFY / TATASTEEL	cross	0.26	0.842	no - drifts
WIPRO / MARUTI	cross	0.24	0.791	no - drifts

Read the top row and the bottom of the same-sector block together, because that single comparison is the entire lesson. The most correlated pair in the table, TATASTEEL / JSWSTEEL at +0.76, is not cointegrated - its Engle-Granger p-value is 0.204, a flat "no, this spread drifts." Meanwhile MARUTI / M&M, with a much weaker +0.53 correlation, is firmly cointegrated at p = 0.000, and KOTAKBANK / HDFCBANK joins it at p = 0.001. A trader who ranked these pairs by correlation and traded the top of the list would have picked the steel pair - the one with no anchor - and skipped the two pairs that actually mean-revert. The correlation ranking and the cointegration ranking are simply not the same list.

In this window only 2 of the 6 same-sector pairs are cointegrated at the 5% level, and 0 of the 4 deliberately cross-sector "fake" pairs are. That is the one comforting result: pairs with no economic reason to be tied together correctly fail the test. Now watch the difference on real prices - the most-correlated-but-not-cointegrated pair against a genuine one.

Drift apart vs snap back: the spread that has no anchor and the one that does chart — EX 1Drift apart vs snap back: the spread that has no anchor and the one that doesSTATch06/01_see_it_the_most_correlated_but_not_coint.py

In the top row, both pairs are rebased to 100 and both look like textbook "they move together." The difference shows up in the bottom row - the spread drawn as a z-score, which is just how many standard deviations the spread sits from its own average right now. (The spread itself is the leftover, or residual, from an OLS regression - the standard line of best fit - of one leg on the other.) TATASTEEL / JSWSTEEL wanders out to an extreme and stays there: there is nothing to trade, no level it returns to. MARUTI / M&M keeps crossing back through zero. That crossing - and only that crossing - is the edge.

Correlation tells you almost nothing about cointegration

One pair could be a fluke, so widen the net. Below are the same two statistics over a 13-name set spanning IT, Banks, Metals and Auto: return correlation on the left, cointegration p-value on the right.

Correlation clusters by sector; cointegration does not chart — EX 2Correlation clusters by sector; cointegration does notSTATch06/02_two_heatmaps_side_by_side_over_a_curated.py

The correlation map on the left lights up in clean sector blocks - the IT names move together, the banks move together, the metals move together, exactly as economic intuition says they should. The cointegration map on the right is patchy and stubbornly refuses to fill in those same blocks. The bright (correlated) cells and the bright (cointegrated) cells do not line up. If correlation predicted cointegration, the two pictures would look the same. They look nothing alike, because they measure genuinely different things.

To turn "they look different" into a number, plot every same-sector pair in the universe - correlation on the x-axis, cointegration p-value on the y-axis. If correlation predicted cointegration, the cloud of points would slope down to the right: higher correlation, lower p. It does not slope at all.

The scatter that kills the myth: Spearman = -0.04 chart — EX 3The scatter that kills the myth: Spearman = -0.04STATch06/03_the_scatter_that_kills_the_myth_correlat.py

Across 75 same-sector pairs, only 11 are cointegrated at 5%, and the Spearman rank correlation between return correlation and the cointegration p-value is -0.04 (p = 0.75). Spearman just measures whether one ranking tracks the other; at -0.04 it is indistinguishable from zero. Let that land. Knowing how correlated two stocks are tells you essentially nothing about whether their spread mean-reverts. The most popular screen in pairs trading - sort by correlation, trade the top - has almost zero predictive power for the only property that makes a pair tradable.

Tip

Use correlation for what it is good at and nothing more. It is a fine secondary filter once you already have a cointegrated spread - a higher-correlation hedge moves around less day to day and is more comfortable to hold. But it must never be your primary screen. Screen on cointegration first. Then rank the survivors on correlation, half-life (how long a deviation takes to shrink by half) and cost.

Why correlation lures traders to their death

There is a reason correlation and regression are so seductive on price data. It is worth seeing the machinery directly, because it is the engine under every spurious "pair." (Spurious just means a relationship that looks real but is not.)

The spurious-regression trap: trending data manufactures significance out of pure noise.

Take two completely independent random walks - two prices where each next value is just the last value plus random noise, with no link whatsoever between them - and regress one on the other. Standard statistics says you should find a "significant" relationship about 5% of the time. On trending, non-stationary data you do not. You find it almost always. Here is the Monte Carlo - a large repeated simulation - over 3,000 such pairs.

EX 4Spurious regression: 90% of unrelated random walks look "significant"STATch06/04_spurious_regression_monte_carlo_regress_.py

Out of 3,000 pairs of independent random walks, the regression slope looks "statistically significant" (|t| > 1.96) in 90.1% of cases - against the ~5% a correct test would deliver - with a median |t-statistic| of 10.0, and the R-squared exceeds 0.5 in 16.6% of runs on pure noise. This is the Granger-Newbold (1974) spurious-regression result, and it is the deep reason correlation is so dangerous on prices. Ordinary least squares assumes stationary, well-behaved inputs. Feed it two trending series and it hands you a confident, completely fictitious relationship. A correlation or regression screen run on raw price levels is, quite literally, a machine for inventing pairs that were never tied together.

Heads up

A high t-statistic or a high R-squared on price levels is not evidence of a relationship. On trending data it is the default, even between things that have nothing to do with each other. Never trust a regression on non-stationary series. The right test for this question checks whether the residual spread is stationary: cointegration. That is why the field runs cointegration tests instead of correlations on price levels. But cointegration is not a lie detector either - it is still sample-dependent, and regime changes, short samples, multiple testing and structural breaks can all fool it, so any positive must be confirmed out of sample.

The intuition, in plain words

Strip away the econometrics and it comes down to three sentences.

Correlation measures whether two names move together on the same days. It is a short-horizon statistic built on returns. It can be coincidence, it drifts over time, and two strongly correlated prices can separate and never come back together.
Cointegration measures whether a specific combination of the two prices stays tied to an average over the long run - whether the spread is stationary. That average is the magnet a pairs trade leans on. No stationary spread, no reversion, no trade.
Why the difference is lethal. A correlation-only screen picks pairs that trended together, which on non-stationary prices is mostly luck - the 90% from the histogram above. You enter expecting reversion, the spread keeps widening, and you average down into a position with no anchor. That is the classic blow-up, and it traces straight back to confusing these two ideas.

Check yourself

1. Two stocks have a daily-return correlation of 0.9. Does that make their spread tradable?

No. Correlation says they tend to move on the same days. It says nothing about whether the gap between their prices stays bounded. A high-correlation pair can still have a spread that drifts off and never comes back.

2. What is a spurious regression, and why is it so common on price levels?

It is a regression that reports a strong, "significant" relationship between two series that are actually unrelated. Ordinary least squares assumes stationary inputs; feed it two trending random walks and it finds a confident, fictitious link - here in about 90% of runs on pure noise.

3. What does a cointegration test check that a correlation does not?

Whether the spread - the residual after hedging one stock against the other - is stationary, i.e. whether the gap actually comes home. That is the property a pairs trade needs; correlation is necessary but nowhere near sufficient.

Where this breaks

The honest caveats matter as much as the result, because every number above is provisional.

Sample-window sensitivity is the big one. Every verdict in this chapter is true only in this window. Split each pair's history in half, re-run the identical test, and the labels flip. MARUTI / M&M goes from p = 0.012 in the first half to p = 0.096 in the second - cointegrated, then no longer. INFY / HCLTECH swings from p = 0.007 to p = 0.861, a complete reversal. Cointegration is not a permanent stamp on a pair; it is a statement about one particular stretch of data.

Verdicts move with the window: the same test, flipped chart — EX 5Verdicts move with the window: the same test, flippedSTATch06/05_verdicts_move_with_the_window_split_each.py

Multiple testing (a preview of what's next). We scanned 75 same-sector pairs and 11 came back "cointegrated." When you run that many tests, a handful pass at p < 0.05 by chance alone - the same false-discovery trap the spurious-regression Monte Carlo warns about. A low p-value pulled from a big scan is a hypothesis, not a discovery. The next chapter puts numbers on how many spurious "cointegrated" pairs to expect from pure noise, and how the Bonferroni and FDR corrections claw it back.
The Engle-Granger test is itself fragile. It assumes a single, fixed hedge ratio (how many units of one leg you trade against the other) over the whole window, it is sensitive to which leg you put on the left-hand side of the regression, and it loses power against structural breaks. Later chapters replace the fixed ratio with one that is first diagnosed from the residuals and then updated as new prices arrive.
A tether is still not an edge. Even a genuinely, stably cointegrated pair only becomes tradable after you account for costs, borrow and short availability, half-life, and out-of-sample survival. Correlation versus cointegration is the entry exam, not the finish line.

Real-life implementation note. Everything here uses real adjusted equity closing prices to learn the statistics. Trading a cointegrated spread means shorting one leg, and in Indian markets that needs a real vehicle - intraday square-off, borrowed stock, or a stock-futures proxy. Each choice changes costs, margin, borrow availability, taxes and slippage. A statistical tether existing is not the same as a tradable edge existing. The gap between the two is the rest of this course.