Two systems land on my desk with identical Sharpe ratios of 1.40. One of them spends 18 months grinding out a quiet equity curve with shallow 4% pullbacks. The other had a 17% drawdown in March 2020 that took 9 months to recover, and a 12% drawdown in late 2022 it never fully recovered before its next loss. On Sharpe, they tie. On the lived experience of trading them, they are nothing alike.
The sortino ratio is built to settle exactly that disagreement. It keeps the Sharpe shape, excess return divided by a volatility measure, but it strips the denominator down to the part that actually hurts: downside deviation. Upside swings are no longer treated as risk. What the sortino ratio tells you is closer to the question a trader is really asking, which is “how much pain does this strategy cost me per unit of return.”
What the Sharpe ratio actually measures
The Sharpe ratio takes a strategy’s excess return over a risk-free rate and divides it by the standard deviation of its returns. Both gains and losses sit in that standard deviation. A strategy that returns +8% in one month and -2% in the next contributes the same amount of “risk” from the +8% month as from a +8% month in a strategy with a -8% month sitting next to it.
That mathematical symmetry is the source of every quiet complaint about Sharpe. A momentum book that catches a +14% month in a runaway tech name gets punished in the denominator the same way a trend system that takes a -14% gap-down loss gets punished. The Sharpe ratio cannot tell those two events apart. I find that more than an academic problem when I am comparing screeners, and our AlphaSharpe screening note walks through how the Sharpe rank shuffles when you re-rank by sortino instead.
Why upside variance is not a loss
If a trader runs a strategy for a year and one month delivers a +22% return, that month does not need to be penalised. Capital is not destroyed by a good month. The trader did not have to sit through psychological pressure to hold the position, and the equity curve did not retrace. The only reason a +22% month is “noise” to the Sharpe ratio is that the formula was written symmetrically.
What the Sharpe ratio does NOT say is that upside variance is bad for the trader. It says upside variance is bad for the ratio. Those are different sentences. A common misread is to look at a high-variance momentum strategy with a Sharpe of 0.9 and call it “too volatile,” when in fact every one of its volatile months may have been a gain. Strip the upside out of the denominator and the same equity curve can produce a sortino above 2.
How the sortino ratio is calculated
The calculation is a small surgical change to Sharpe. The numerator stays the same: the strategy’s average excess return over some minimum acceptable return, often written MAR (the risk-free rate, zero, or a hurdle rate, depending on the trader’s choice). The denominator is replaced.
Instead of the standard deviation of all returns, the sortino uses the downside deviation. Concretely: take every return that fell below the MAR, square the shortfall, average those squared shortfalls (dividing by the total number of periods, not just the losing periods, which is the convention used by the original Sortino-Price papers), then take the square root.
The formula collapses to: sortino = (mean return – MAR) / downside_deviation. The upside months contribute zero to the denominator. A trader who runs the same monthly returns through this calculation will always get a sortino that is greater than or equal to the Sharpe, because the denominator can only shrink or match.
A worked example with two equal-Sharpe strategies
I will use a deliberately small dataset to make the arithmetic visible. Strategy A returns, in twelve monthly percentages: +3, +2, +4, -3, +2, -2, +3, +1, +2, -1, +4, +5. Strategy B returns: +6, +5, +1, -4, +7, -5, +1, +8, -2, -1, +2, +4. Both average +1.67% per month. The standard deviation of both rounds to 3.0%. With a 0% MAR, Sharpe is roughly 0.56 for each.
Now the downside numbers. Strategy A has three negative months: -3, -2, -1. Squared shortfalls are 9, 4, 1, summing to 14. Divide by 12 periods, take the square root, and downside deviation is about 1.08%. Sortino A is 1.67 / 1.08, which is 1.55.
Strategy B has four negative months: -4, -5, -2, -1. Squared shortfalls are 16, 25, 4, 1, summing to 46. Divide by 12, take the square root, and downside deviation is about 1.96%. Sortino B is 1.67 / 1.96, which is 0.85.
Same Sharpe. Sortino A is almost twice Sortino B. The trader who runs Strategy B is taking deeper losses to produce the same average, and the sortino ratio surfaces exactly that. I keep these two columns side by side whenever I am comparing strategies, and our equity-curve-trading guide shows how the same divergence shows up visually on the curve itself.
Picking the MAR (minimum acceptable return)
The MAR is where most of the disagreement between practitioners shows up. Set it to 0% and the sortino measures “how often, and how badly, the strategy lost capital.” Set it to the risk-free rate (call it 0.4% per month for a roughly 5% annualised T-bill) and the sortino measures “how often the strategy underperformed cash.” Set it to a hurdle rate like 1% per month and the sortino measures “how often the strategy missed its own internal target.”
I generally prefer MAR = 0% for technical strategies because the question I am answering is whether the system loses money, not whether it beats Treasuries. For a fund that has to clear a benchmark, the MAR moves to the benchmark’s return. The number itself is less important than picking one and holding it across every strategy you compare. Comparing a sortino calculated against 0% MAR to a sortino calculated against a 5% hurdle is a category error, not a comparison.
Where the sortino ratio misleads you
The sortino ratio does NOT measure tail risk. A strategy with a long run of small losses and one catastrophic -40% month can still produce a respectable sortino, because the downside deviation averages the squared shortfalls across all periods. A single fat-tail event gets smoothed into a number. The Taleb critique of these ratios is exactly that: any single-number summary of a return distribution misses the shape of the tail.
It also rewards strategies that quietly add leverage. If a trader doubles position size for a year, the negative months get bigger and the positive months get bigger in lock-step. Sortino can hold steady or even improve, because the mean return scales linearly while the downside deviation scales linearly too. The ratio looks clean. The capital at risk has just doubled.
A third misread is treating the sortino as a “loss” metric. It is a return-per-unit-of-downside-deviation metric, not a maximum drawdown. A strategy with a healthy sortino can still take a 25% intra-period drawdown the trader has to sit through, because drawdown depends on the sequence of losses and the sortino only sees their squared average. I cover the sequence question in our hard drawdown stops note.
How I use the sortino against Sharpe in practice
The single most useful pattern I have found is to run both ratios side by side and look at the gap. A strategy with a Sharpe of 1.2 and a sortino of 1.4 is roughly symmetric: its volatility is genuinely two-sided. A strategy with a Sharpe of 1.2 and a sortino of 2.8 is heavily right-skewed: most of the variance is on the upside, and the trader is being unfairly punished by the Sharpe denominator.
The reverse is the warning sign. A Sharpe of 1.2 paired with a sortino of 0.9 is unusual and bad. It says the strategy’s downside deviation is higher than its full-period standard deviation, which can only happen when the losing months are deeper than the gaining months and the calculation is being dominated by a small cluster of large losses. I treat that pattern as a flag to look at the drawdown chart directly and not to trust the Sharpe rank.
I also keep a sortino-to-Sharpe ratio (sortino divided by Sharpe) as a quick skew read. Above 1.5, the strategy is asymmetric to the upside. Below 1.0, it is asymmetric to the downside. Between 1.0 and 1.5, the two ratios largely agree.
What the sortino ratio does not tell you
The sortino ratio does NOT tell you about trade frequency. A strategy with two trades a year and a strategy with two hundred trades a year can produce identical sortinos. The trader has to know whether the number is built on a sample large enough to be stable.
It does NOT tell you about correlation to the rest of the book. A 1.8 sortino on a single strategy is meaningless if that strategy is 0.9 correlated to another strategy already in the portfolio. The combined book sortino is the only number that pays the bills.
It does NOT replace the equity curve. The number is a summary statistic. The curve is the actual record. Two strategies with the same sortino can have visibly different curves, one with the losses clustered and one with them spread, and the trader has to look at the curve before sizing. Paul Tudor Jones built a career on watching the actual record and not the ratio, and I think that order of operations is still right.
Compare the ratios, then look at the actual losses
The sortino ratio fixes a real defect in the Sharpe ratio. It stops punishing upside variance, surfaces strategies that look equivalent on Sharpe but lose money differently, and gives you a cleaner read on whether the volatility you see is the volatility you fear. None of that makes it a sufficient statistic. Use it as the second number in a pair, not as a single verdict, and always look at the drawdown sequence after the ratio.
Learn the pattern. Ride the trend. Keep the gains.
Educational content only. Not investment advice. Trading involves risk. You are responsible for your decisions.
