12.10.2024|Dave White
This paper introduces distribution markets, a new kind of prediction market for events whose outcomes aren't just "yes" or "no" but could be any number.
Instead of betting on a particular outcome or range, traders can express how likely they think each different possibility is across the whole infinite range of outcomes.
Take the question "When will GPT-5 be released?" In a traditional prediction market, traders might have to choose between preset options like "Q2 2025" or "2026."
Prediction competitions like Metaculus instead let traders express their predictions as a curves showing exactly how likely they think each possible date is. However, Metaculus is a competition, not a market, and has no way for participants to trade on their beliefs and put skin in the game by trading on their beliefs.
Like Metaculus, Distribution Markets allow participants to come to consensus on the full probability distribution over all outcomes, but like traditional prediction markets they allow traders to profit by moving the shared view in the right direction.
The underlying mechanism is a constant function AMM over functions, with the invariant (analogous to Uniswap's xy=k) being a constant
Prediction markets have entered popular consciousness in the wake of the 2024 US Presidential elections, but the technology is likely still in its infancy. Further development could be of benefit both to developers and to the public at large.
More specifically, today's prediction markets generally allow participants to express probability distributions over discrete outcomes, but many questions of relevance to the real world involve continuous outcomes.
It's true that a perp market could elicit the expected value of a continuous variable from the market, but sometimes we would like to know more -- for example, do we know for sure a given project will take 10 years exactly, or could it perhaps be anywhere between 2 and 20?
This section is provided as an aid to understanding the continuous case, which is the main contribution of this paper.
We already have many options for prediction market AMMs in the discrete case, where the outcome is one of a finite set of options. So, the discrete case distribution market mechanism presented in this section isn't particularly useful.
However, the math in the discrete case mirrors the continuous case exactly, so this section may be helpful as an aid to intuition.
Consider some event with
We create outcome tokens
We can at any time mint or redeem a full set of these tokens for $1, because at expiry exactly one token will be worth $1 and the rest worth $0.
We’re going to create our AMM as a standard Constant Function Market Maker (CFMM), of which the most famous example is probably Uniswap, which has constant function
We initialize the AMM with
We denote the AMM’s vector of holdings
so that the
We define our AMM’s constant function to be
Since
which means the AMM's holdings are a a translated hypersphere with radius
Note that the minimum coordinate of this translated hypersphere along any dimension is
Say the true probability distribution of
If we assume the market is efficient, arbitrageurs will act to maximize the expected value of their own holdings
Since traders can’t affect
By the Cauchy-Schwarz inequality, the vector that maximizes this dot product given the fixed norm must be linearly dependent with
This means we must have
In other words,
By our definition above, the AMM’s holdings
which has the nice side effect that we can read the market’s estimated distribution directly from the AMM’s reserves.
In this way, the distribution market is an example of a market scoring rule.
The continuous case is the main contribution of this paper, because it unlocks a new behavior: prediction-market-like trading over continuous probability distributions.
The mechanism follows nearly identical logic to the discrete case, but with some additional constraints to ensure the market stays solvent.
This is a general construction, but by specializing the types of distribution allowed — to, say, uniform or Gaussian distributions — the resulting AMM can be made computationally efficient to run on, for example, Ethereum mainnet. We discuss this in more detail below.
Consider some event with outcomes over a continuous space, say
We can imagine creating one outcome token
The simplest way to express holdings of these tokens is as functions
Formally, all positions in the context of continuous prediction markets are functions. Depending on context, it may be most helpful to think of these functions either as infinite collections of outcome tokens, as curves, or just as abstract members of function-space.
Consider the constant function
As in the discrete case, we will initialize our AMM with a fixed amount of dollars,
We denote the AMM’s holding outcome function
The AMM can then sell up to
If traders have, in aggregate, bought outcome function f(x) from the AMM…
...the AMM’s holdings h(x) are b-f(x).
We restrict
Just as in the discrete case, we will choose a constant
Note that we’ve limited the
At its simplest, the AMM starts by holding some function
The gray curve is the starting f(x), the position initially held in aggregate by all traders and a scalar multiple of the market's starting estimate of the true distribution. The blue curve is g(x), the scakar multiple of the distribution the trader is moving the market to that leads to the appropriate l_2 norm. The green and red curve is g(x)-f(x), representing the trader's position after the trade. They will make money if the outcome is in the green region and lose money in the red region. We can see they have shifted the mean down slightly and increased the variance, so that they in general make money if the outcome is outside the peaked area of the original f(x).
More formally, say the true probability distribution of the outcome in question is described by a probability density function
where the last equality comes from our definition of inner product on this space.
If the market is efficient, arbitrageurs will act to minimize the AMM’s expected value. In other words, they are solving the optimization problem
Since the trader can’t affect
For a moment, assume the AMM has effectively infinite backing, so that the second constraint doesn’t matter and we simply have
Then, just as in the discrete case, the Cauchy-Schwarz inequality tells us that the vector that maximizes this dot product given a fixed norm must be linearly dependent with
In other words,
This means that the AMM’s holdings
and we can again read traders' aggregate estimated distribution directly from the AMM’s reserves.
If the true distribution p(x) looks like this…
In an efficient market, traders’ holdings f(x) will, in aggregate, be shaped proportionally…
…and the AMM’s holdings h(x) will be the mirror image
We assumed above for convenience that the AMM’s backing
When there are backing constraints, we have two options:
The first, and simplest, is to simply not permit traders to move
Alternately, we can simply enforce the constraint that
for whatever
The AMM allows permissionless adding of liquidity, with liquidity providers (LPs) receiving fungible LP shares, just like in Uniswap V2.
Imagine for a moment we had a uniswap V2 pool containing 10,000 USDC and 1 ETH, with 10,000 LP shares outstanding. If a market participant wanted to add liquidity to this pool, they would have to add tokens that are a scalar multiple of the AMM's position, and would get a proportional share of the pool in return. So, for example, if a new liquidity provider were to double the liquidity in the pool, they could add 10,000 USDC and 1 ETH, and would receive 10,000 LP shares.
Liquidity provision works just the same for the distribution AMM. A prospective liquidity provider needs to add assets proportional to the AMM's current position, and receives LP shares in return.
The AMM's position is
In order to create the position
The initial source of collateral to the AMM is the the first LP. If they initialize the pool with backing
Similarly, when a new LP adds liquidity, they are adding
Finally, let's assume the system is fully collateralized when a trade takes place, and that the AMM's position and the position of all traders sums to
However, in order to say the trader holds
The normal distribution is in some ways the canonical example of a continuous probability distribution.
In this section, we discuss how we can use distribution markets to create a prediction market over normal outcomes in an efficient manner onchain.
The Normal distribution with mean
with
where the latter equality comes from the closed form solution to the Gaussian Integral.
We can see that the
However, distributions with lower variance have higher
and therefore
In other words, the more peaked a trader's proposed distribution is, the less total probability mass the market will be willing to sell to them. Again, this might help the market avoid getting wiped out by traders with inside information about a specific outcome.
Remember we have
Since we can never have
so that
As discussed in the general continuous case section above, we can simply restrict traders in the AMM from choosing standard deviations less than this.
Alternately, we could allow traders to trade a capped Gaussian with any standard deviation they like, so that we would have
We can cap the trader's payout to b...
...so that the AMM's payout is 0 at minimum.
For the remainder of this paper, however, we'll just assume we're enforcing the lower bound on
As discussed above in the collateralization section for the general continuous case, traders need to collateralize their trades with
Unfortunately, there is no apparent closed-form solution to
Note that we could have a single distribution AMM capable of trading multiple distributions -- as long as the
This is relatively straightforward in the case of Normal -> Uniform and Uniform -> Normal distributions. We leave the calculations as an exercise to the reader.
We hope Distribution Markets help to spark ideas for builders and researchers on the cutting edge of information finance.
If that's you, we'd love to hear from you.
Dan Robinson, Yang You, Achal Srinivasan, Bhargav Annem, 5/9, Sofiane Larbi, Ciamac Moallemi, Tom Dean, andnasnd, 0xTomoyo, Pia Park, Qiaochu Yuan, Connor Lurring, Grant Stenger, Santiago Lisa
Copyright © 2024 Paradigm Operations LP All rights reserved. “Paradigm” is a trademark, and the triangular mobius symbol is a registered trademark of Paradigm Operations LP