With respect to being number one, the definitive review of the field is provided by Langville and Meyer (2012) in Who's #1?: The Science of Rating and Ranking. And, in terms of estimating true talent levels, Adam Dorhauer delivers two excellent articles worthy of publication, Elo vs. Regression to the Mean: A Theoretical Comparison and Regression with Changing Talent Levels: The Effects of Variance.
Thursday, June 29, 2017
True Talent Levels
With respect to being number one, the definitive review of the field is provided by Langville and Meyer (2012) in Who's #1?: The Science of Rating and Ranking. And, in terms of estimating true talent levels, Adam Dorhauer delivers two excellent articles worthy of publication, Elo vs. Regression to the Mean: A Theoretical Comparison and Regression with Changing Talent Levels: The Effects of Variance.
Monday, March 13, 2017
Cheltenham 2017: Supreme Novices Hurdle Handicapping
- Supreme Novices Hurdle is similar to Kentucky Derby - young horses, many attempting graded stakes, championship race for first-time with little form in book.
- Eliminate non-contenders and whatever remains, no matter how improbable...
- Avoid horses with pedigree mismatch to former winners [
Melon,Crack Mome]. - Avoid horses from small fields [
Labaik], [Elgin]. - Late speed important [
Magna Cartor,Glaring,Capitol Force]. - Poor Cheltenham Form [
River Wylde]. - Poor FPR [
Pingshou,].High Bridge - Minimum price 10/1 [
Ballyandy,Bunk Off Early].
Note, given the limited exposure of all the runners, we are not saying that those we have eliminated are not going to win - simply that they did not meet our criteria for live longshots to run in the money. The key takeaway, as always, is using a process of elimination not selection for identifying contenders.
Footnote: In their respective next outings, Cilaos Emery won G1 (Punchestown) and Beyond Conceit finished second in G1 (Aintree).
Friday, February 03, 2017
Adaptive Boosting
Friday, December 23, 2016
Biased Coin (Haghani & Dewey, 2016)
- Financial: Though the median final bankroll of $10,504 is derived in the footnotes, there is not sufficient attention drawn to it in the paper itself. Time-Value automatically generates this value whereas Expected-Value generates the wholly unrealistic $3,220,637.
- Psychological: The fallacy of “Playing With House Money” – “…you are offered a stake of $25 to take out your laptop to bet on the flip of a coin for thirty minutes.” What would have happened if the subjects had to pay $25 to play instead of being given it for free?
No less a luminary in both the financial and gambling worlds than Ed Thorp says: “This is a great experiment for many reasons. It ought to become part of the basic education of anyone interested in finance or gambling.”
Monday, November 07, 2016
Evens-Equivalent Trades
Sport
|
Bank
|
Trades
|
Avg.
Odds |
Avg.
Win Rate |
Avg.
Stake |
Evens
Odds |
Evens
Win Rate |
Evens
Stake |
Expect.
Value |
Std.
Dev. |
EV.SD
|
Edge
|
Rsk
Of Ruin |
MLB
|
5,000
|
500
|
2.10
|
51.00%
|
250.00
|
2.00
|
53.37%
|
263.05
|
17.75
|
262.45
|
219
|
7.10%
|
7.11%
|
H-R
|
2,500
|
1,000
|
3.50
|
31.00%
|
100.00
|
2.00
|
52.62%
|
162.10
|
8.50
|
161.87
|
363
|
8.50%
|
16.53%
|
In the classic treatment of ruin, there is a working assumption of even-money trades to make the calculations tractable. To that end, we must first transform our real-world trades into their even-money equivalents with the same edge and volatility, see Krigman (1999). Despite having a smaller edge and a larger stake, you have a lower probability of depletion than your brother principally because you are risking a lower percentage of your bankroll per trade. Ideally, your RoR should be below 5% and to achieve this you both would have to either increase your bankroll or decrease your stake, as follows. [(MLB: 5%, 218.20 or 5730); (H-R: 5%, 54.97 or 4,546)].
Note that Edge's impact only equates with that of Volatility after 219 trades for you and 363 trades for your brother. And it takes a minimum of 806 trades for you and 1336 trades for your brother before you can be at least 95% confident that the combined effects of positive edge and mixed-bag volatility work in your favor to guarantee positive bankroll growth. In other words, despite having potentially successful trading strategies, you both will be well into your second season of handicapping before you can be sure of beginning to reap the benefits!
Saturday, October 01, 2016
Juvenile Finish Position Ratings
- First, calculate "horses beaten" (n-f) and "horses beaten by" (f-1) from finishing positions (f) and number of runners (n) for each race in a horse's past performances.
- Then, sum across all races for wins (w=Σ(n-f)) and losses (l=Σ(f-1)) respectively.
- Next, calculate a horse's posterior probability m=(w+α)/((w+α)+(l+β)). Prior wins (α) and losses (β) are derived from two full seasons of 2yo races and are equivalent to a horse finishing fourth of seven runners in a virtual race. Note that a first-time starter would automatically have a posterior probability of 0.50=(0+3)/((0+3)+(0+3)).
- Finally, convert probabilities to performance ratings (min:112=8-00, max:126=9-00) using the following formula: r=((a+(b-a))*((m-x)/(y-x))), where a=out.min, b=out.max, x=in.min, and y=in.max of all runners in the current race.
In summary, this finishing position rating system (fpr) does not take into account the strength of opposition, beaten lengths, weight carried, or finishing times; however, when it is based on a whole season of results, the fpr ranks correlate approximately 0.87 with the equivalent ranks from an Elo rating system.
As luck would have it, Sunday's Prix Marcel Boussac (Fillies' Group 1) - France's top 2yo fillies race - had a 0.92 correlation between fpr ranks and finishing positions, with the 10/1 winner (Wuheida) top-rated! Not scientific, nevertheless I get to keep the winnings.
Thursday, September 01, 2016
Speed-Stamina Fingerprints
In a similar vein, we can generate unique speed-stamina, power-law fingerprints for racecourses (and horses) based on the best times for various distances. The simplest interpretation of these racecourse fingerprints is to confirm our expectations of the demands imposed by similar distances for different course configurations (Epsom is faster than either Ascot or Newmarket (lower y-intercept); Ascot and Newmarket have similar stamina profiles (same slopes)). Another possible insight might be how these course fingerprints reflect the potential impact on horses with different pace profiles (early speed at Epsom). A further analysis might be on how to better baseline and equate speed figures at different racecourses (use standard course with own speed-stamina equation). Finally, more controversially, using speed-stamina fingerprints for classic-generation (3yos) horses to match with course fingerprints in the lead-up to Group 1 contests (Epsom Derby) or for comparing performances from different classic generations (Frankel vs Sea The Stars).
Wednesday, August 03, 2016
In sports markets, the probabilities implied by the prices on offer are a proxy for the wisdom of the crowd for that particular event. By adapting Shannon's Entropy formula, we can generate our own "Wisdom Of Crowd Market Index" (WCMI) to represent this information on a scale from 0 - 1.
First, calculate implied probabilities of prices: x = 1.00 / d
(where d = decimal odds).
Next, calculate log probabilities: y = log(x, n)
(where n = number of runners).
Then, multiply probabilities by log probabilities: z = x * y.
Finally, sum products and subtract from one for final index:
wcmi = 1 - (-sum(z)).


Note that the index is at a minimum (0.00) when the market is completely uninformed about the outcome (all prices are the same) and at a maximum (1.00) when the market has closed (no price for winner and maximum price for all others). In the realistic exchange market above (snapshot of prices taken one minute before going in-play), the crowd is relatively uninformed (0.03) about the likely outcome and presents an excellent opportunity for the informed sports trader. Personally, I do not trade in any market with an index above 0.13 (approx).
It is very gratifying to note that FlatStats are now (29-Dec-2017) using our WCMI as a guide to those markets in which the crowd is less well-informed!
Sunday, July 03, 2016
Fano Plane Transylvania Lottery And Trifectas
From a handicapping perspective, it would be interesting to check for a subset of seven runner races for which these particular (or equivalent) bet combinations would prove to be a successful trifecta strategy. At the very least, you would have bragging rights for always getting at least two selections correct on one or more tickets! More generally, the field of Combinatorial Design may hold additional insights for other exotic bets.
Sunday, June 05, 2016
Proebstings Paradox - Price Is Right If Marked-To-Market
The difficulty arises because the sequence of bets appears to cost more in total bankroll percentage than the Kelly Criterion would recommend as a standalone bet at the highest odds in the sequence.
In the Todd Proebsting example, the sports trader initially bets 25% of his bankroll on a 2/1 selection with a 50% win probability. Some time later, he is offered 5/1 on the same selection and calculates his Kelly stake at 22.5% leading to a total wager of (250 + 225) = 475. The problem with this result is that the Kelly stake for a single bet at 5/1 (assuming 50% win probability) is only 40% of bankroll (400) - leading to the theoretical possibility of ruin from betting at successively more attractive odds on the same selection in a single market.
To resolve this paradox, both Ed Thorp and Aaron Brown recommend that the sports trader should "mark to market" his bankroll after the initial 2/1 bet (reducing it by 12.5% from 1000 to 875) and use this updated position to calculate the 5/1 stake. In fact, to stay within the upper limit (400) defined by the standalone 5/1 bet, the trader also needs to reduce his estimated win probability to 42.5%! Using the above example, this would lead a sports trader to bet at most 146.25 = (16.71% * 875) at 5/1 giving a total wager of (250 + 146.25) = 396.25, which is roughly equivalent to the Kelly standalone stake (400) at 5/1 but with a lower upside!
Sunday, April 03, 2016
Thinking Fast And Slow
As luck would have it, just before placing the wager, you decide to look at the connections of your selections for confirmation of the wisdom of your program choices. To your horror, you notice that a relatively unexposed colt trained by a promising, young handler with a 7/1 offering is excluded. You make an "executive decision" to overrule the program selections and immediately place a double-unit wager on the unexposed colt only to watch the program top selection romp home at 7/2.
Recognize the scenario or some variant thereof? Daniel Kahneman, in his excellent book Thinking Fast And Slow, would tell you that System 2 (careful analysis) has just been trumped by System 1 (good story)!
The key lesson here is not to berate yourself for being human but to set in place measures to protect yourself from this kind of ambush. Focus your expert handicapping skills into determining the key factors not fully reflected in the starting prices and then use some level of automation to rate and rank your selections. Avoid looking for a good, coherent story and no fine-tuning!
Tuesday, March 15, 2016
Dosage Late-Speed And Novice Hurdler Championship Race
- Dosage Comparison With Former Winners;
- Past Performance Indicators Of Late Speed; and
- Live Longshot Prices.
Note that I am not claiming any great insights and will be pleasantly surprised with a positive result.
Monday, February 29, 2016
Handicapping: Benford's Law, Shannon Entropy, And Twenty Questions

This toy example is not as unrealistic as you might expect at first glance. Look at the very good approximation by Benford's Law of Starting Price position for roughly 20,000 GB flat races 2004-2013 inclusive.

Then, in simplest terms, the inherent uncertainty of the Benford Law Stakes race outcome is best represented by Shannon's Entropy: H(x) = -SUM((x)*log(x)) = 2.87, which number also suggests (under optimal conditions) the minimum number of yes/no questions (i.e. 3) the handicapper should ask himself to identify a potential winner. Taking our lead from Shannon-Fano Coding, we should iteratively divide the entrants into two approximately equal groups of win probabilities (i.e. 50%) and use Pairwise Comparison to eliminate the non-contenders using at most five questions.
Once again, this restriction is not as unrealistic as it might first appear. Slovic And Corrigan (1973), in a study of expert handicappers, found that with only five items of information the handicappers' confidence was well calibrated with their accuracy but that they became overconfident as additional information was received. This finding was confirmed in a follow-up study by Tsai et al (2008).
Wednesday, November 11, 2015
Time-Value Vs Expected-Value Or Likely Poverty Vs Average Riches


Monday, September 21, 2015
Ensemble Averages Vs Time Averages
Evaluating Gambles Using Dynamics outlines his idea of time averaging in contrast to ensemble averaging in a discussion of the St Petersburg Paradox. Note the parallels with volatility drag, median outcomes, and expected values!
Sunday, August 16, 2015
Doubling Rate Entropy And Kullback-Leibler Divergence
Long-Term Profit = Clairvoyance – Race Uncertainty – Market Divergence.
My experience is that most handicappers focus too much on trying to become clairvoyant and not enough on selecting open races and factoring in the “wisdom of crowds”.
Friday, June 05, 2015
Euro-Style Handicapping
As a long-distance, handicapper of Euro races, I look for events in England, France, and Ireland (Grade 1 Horse-Racing Countries) with a high degree of chaos (I have entropy scores for all race-types). From this initial list, I have selected seven race-types that I focus on exclusively. Because of the high level of uncertainty in these races, the market (wisdom of crowds) is a less successful predictor. With a Bayesian-based, Elo-Class algorithm, I generate my own performance figures. Using the performance figures for all entrants in a chosen race, I run a Monte-Carlo simulation of 1000 races that automatically generates a realistic odds-line as final output. Critically, as long as there is at least one overlay (almost always) in the chosen race, I finally run a Haigh-like, Kelly-variant algorithm that selects the final contenders. (As I have stated elsewhere, this list may include both overlays and underlays).
Monday, March 09, 2015
Cheltenham Supreme Novices Hurdle (G1)
Though not my discipline (National Hunt Racing), Cheltenham's four-day graded stakes meeting is an exception. Effectively, this is the Breeders Cup of jumps racing. Lacking the in-depth expertise required to handicap this form of thoroughbred racing, I seek to focus on novice races where historical knowledge is as informative as current form. To that end, the Supreme Novices Hurdle has many similarities to the Kentucky Derby - young horses, many attempting a graded stakes, championship race for the first-time with little form in the book. Using dosage analysis of the "in-the-money" finishers over the last ten years and ranking the current field against this metric to identify "live" outsiders, shortlisted two interesting contenders - Shaneshill (14/1) and Tell Us More (25/1). Though not necessarily the most likely winners, these two selections are the best matched to previous contenders on dosage and, therefore, worthy of punting in both the win and show markets!
Friday, October 24, 2014
Analytical (Kelly) And Numerical (Solver) Solutions
Many sports trading problems yield to both an analytical and a numerical solution.
In the above example, the numerical solution (using Solver in Excel) to minimizing the difference between expected value and volatility drag over a sequence of similar bets equals the analytical solution (using Kelly) for the same sequence!
Thursday, October 16, 2014
Equivalent Single Bet
With multiple bets (illustration only) in a single win market, what is the equivalent single bet that best summarizes the overall position?
As the worst win-loss outcomes are to either win only the minimum profit or lose the total stake, then the most informative summary position is a combination of both scenarios.