Worldly success, measured by the accumulation of money, is no doubt a very
dazzling thing; and all men are naturally more or less the admirers of worldly
success.
We see the pattern looks similar (and if one is a young woman the the probability of winning Miss America in one's state is better than being the richest person in that same state).
For example, we see with the gold theoretical model line,
the same unmistakable path of how there are many states (particularly the less
populated ones) which fail to get a high number of Miss America winners in the
pageant’s history. Among the five most winning states, include the very-well populated states of: New York, Illinois, and Ohio. And we can see the flip-side of understanding how 2 of the most populated states not pulling their weight recently. Neither California nor Texas has won in more than 30 years- if we assumed just a 1/50 (2%) probability for each state winning per year, then this probability of winning at least once over this time is nearly 70%.
Returning to the illustration, these gold dashed lines are similar to the grey curved ranges in the top illustration above. And instead in the probability model chart further above, for the richest person per state, we notice what may be oddities, colored in blue on the scatter plot. As noted earlier, we show the more populated states tend to have that slim chance of seeing even more extreme outcomes.
While the Scottish reformist,
Samuel Smiles, authored this self-help quote in the mid-19th
century, the subject of the overemphasis society places on having excess money
has been around for millenniums. Jesus often
refers to it in the Bible. To this day,
we are uniquely interested in who makes up the extraordinarily wealthy in our society. Now, we’re not just referring to the popular
expression “top 1%”, but rather “the top 1% of the top 1%” (or the top 0.01%). A century ago, an American would need a
wealth of a million dollars to achieve this status. While inflation has grown by a multiple of mid-single
digits since then, the wealth threshold required to be in the top 0.01% has grown by a multiple of a couple hundred.
Of course this
narrow sliver of the top 1% still accounts for tens of thousands of people, but to put
further perspective on this, for an American to be included in the even more
exclusive Forbes 400, a wealth of nearly two billion dollars is needed. Looking at this list of Americans, we see (or
want to assume that there are) clusters of people residing in geographic
locations where it is more expensive and more exciting to live. For example, southern New York and northern California. And quite humbly, some of these billionaires take the time to read this statistics web log.
Though people
have their own geographic preferences as well; some of the country's uber-wealthy prefer to live in the
serenity of less-trafficked locations.
Some of these states are those that Americans only see by air; they are
sometimes referred to in a derogatory sense as flyover states.
What if we wanted to sharpen our saw by only focusing our attention on the richest person in each geography, curious (as Samuel Smiles suggested we might) about who makes up the well-to-do and wealthiest in each of the 50 states? This article goes about this analysis. Creating a mathematical model to learn
from the aristocrats, and provide insights for those who want to enjoy the
same monetary comfort of the top-end of uber-wealth. We see here that the more populated
states have typically higher average wealth, though there is a probability
advantage we mistakenly discount in less populated states to being able to be become
the very wealthy.
Take a look at
this recent list,
which identifies the richest person in each state. Sure it’s fun to see familiar names, and
to hypothesize about what patterns might exist in the data. It also seems intuitive that larger
populations would have a greater opportunity for the richest person to be further
along the extreme end of the wealth distribution. But how do we mathematically prove that?
Probability models can help identify how this works. We start this article with some mathematical theory, in how we can represent the very tail-end of a probability distribution. This is a unique portion of the distribution that we rarely need to model other than to explore the freakish curiosity of some event, such as modeling the wealth of a state’s richest person.
Probability models can help identify how this works. We start this article with some mathematical theory, in how we can represent the very tail-end of a probability distribution. This is a unique portion of the distribution that we rarely need to model other than to explore the freakish curiosity of some event, such as modeling the wealth of a state’s richest person.
Naturally we know that in each state, all else equal, there
will be a mound-shaped probability distribution of wealth among people. Bounded by zero on the low-end, and perhaps right
skewed to a limit on the high-end. Also,
more populated states would have –minimally- their typical value a little
higher versus the typical value from less populated states. Maybe the entire distribution fits something
similar to a lognormal model, as does many economic time series data. Except we know there are upper limits
on how much wealth one can accrue in a limited lifetime. And with a large enough standard deviation
away from the middle of the distribution, the distribution would have a wider
than normal tail distribution, in order to better allow for more extreme outcomes.
These are just some initial starting points to think broadly about the overall parameters that shape the basic distribution. Also note that the excess kurtosis (degree of fat-tails) associated with a lognormal distribution can be shown through complex moment functions:
These are just some initial starting points to think broadly about the overall parameters that shape the basic distribution. Also note that the excess kurtosis (degree of fat-tails) associated with a lognormal distribution can be shown through complex moment functions:
Focusing our attention now on just the extreme portion of the
distribution that we are interested in, we also know that we might be better
served there by using a more exotic distribution. Such as a power function or a
Pareto or a variation of the Weibull, etc.
Not all of these would have easily identifiable excess kurtosis. And it turns out that it doesn’t matter the actual distribution upfront here.
Because given that there are millions of people in each state (and tens of
millions in larger states), our only focus is to assess the patterns we observe
at the very tip-end of the distribution.
See the illustration below, as well as in other applications we can
linearly approximate (see here
for example).
Take a moment to appreciate the range of combinations
(curves shown in grey) that can be valid for this
tail distribution we are modeling. The
illustration on the right, which we will ultimately not need, shows what we call
the conditional or expected distribution at the extreme tails (so that the probability model sums to 100% or 1). We don’t need it in this case since we are
only focused on modeling the probability statistics with the maximum value
only.
Let’s also take a chance to note some of required characteristics, which can help narrow the types of extreme value models we can use. Of course we would want the slope of the
distribution function to continue to be negative as we move towards the infinite
end of the tail distribution. Concurrently
we would want the second derivative (or the change in the slope itself) to be
positive, so that the curve takes the convex form that we generally see in the
illustration above.
This latter point is important to be able to isolate a
non-parametric curve that best fits the tail end of the distribution. It is also worth solving before we move
ahead, via the probability calculus below, that for the popular normal
distribution that implies our tail event is within the other 16% (per tail)
probability of occurrence. This of course means a greater than one standard deviation distance away from the average, which we
clearly will satisfy for these extremely wealthy individuals. Note that even for the normal distribution,
the tails of which follow an exponential path as can be seen in the f(x)
formula below. And rules of thumbs still
generally dominate the academic literature in working with the expected maximum
value depending on sample size.
Here is the calculus formulae to understand when the second
derivative [f’’(x)] turns positive.
For simplicity we assume the distribution is symmetrically centered
about zero, and we could translate to another value for expanded analysis, if we
needed.
So for the last equation above, an
equation representing the second derivative of variable x, it can only be zero
when x=σ, or equal to a standard deviation that cuts off 16% of the distribution in each tail. Since a lognormal distribution –related to a
normal random variable x- is equal to ex, we can combine that with our undertanding of the expanding scale of the lognormal distribution at the upper end of it (e.g., the distance from the first quartile to the median is less then the distance from the third quartile to the median). So this means
that the positive convexity of a lognormal distribution is attained at just less than the mean-adjusted value of eσ.
Now using the probability mathematics just developed, let’s now
look at our theoretical probability distribution associated with the highest
wealth in each state. Note that on
the vertical axis we are capturing one minus the cumulative probability
distribution functions (cdf), and not the probability distribution function (pdf)
itself. Each actual state value is in red
(except for two in blue). And the pdf that is shown in the illustration
above maps directly to these inverse cdfs in the chart below, and hence they
are unnecessary for our proof of what is the model of the maximum wealth event
along the probability distribution.
Again we amalgamated our 50 richest list, by taking the
richest person in each state. This
technique does not mean that everyone would be a billionaire though, and 10 of these
richest people on the list are not. However, we
can see something on the far right of the illustration, at the greatest wealth
of about 80 billion dollars. Bill Gates is
not only the richest person for this one data point represnting Washington state, but he
is therefore also richer than the richest people in all the other states. The maximum of the maximums means he is also the
richest person in the United States.
This logic is also how the Miss America Pageant competition works. In fact we will strengthen our case here, by showing
below the past nearly century-worth of state winners of the Miss America scholarship
competition. Albeit this is a subjective competition, it is a clear data set with a rich history.
Returning to the illustration, these gold dashed lines are similar to the grey curved ranges in the top illustration above. And instead in the probability model chart further above, for the richest person per state, we notice what may be oddities, colored in blue on the scatter plot. As noted earlier, we show the more populated states tend to have that slim chance of seeing even more extreme outcomes.
What’s important here to understand is we are able to
properly articulate the maximum expected wealth that one would normally see,
based solely as a function of the state’s population size. This is of statistical importance and we see
this against the richest people per state on the chart below. People can -of course- relocate once they
achieve their wealth, though for many of the important examples in this list
this didn’t happen and also the business domicile address was used to add
additional stability to the results. We
might have expected for example, that many billionaires would relocate to
Hawaii or Florida, but that wasn’t much true. We
should note that most of the people on this list (35 of 50) were
first-generation entrepreneurs who started enterprises in those states in order
to create their wealth. America's most rich people have comfortably held their relative ranks over many years and have expanded their interests in life to beyond wealth creation. And as with any
aristocracy, in the list we see familiar names; people who have been glamourized corporate
magnates and leading thought leaders. This is likely true throughout the list
and not just at the top of the chart below.
Let’s look now at the pattern in the actual top wealth, which
similar to the probability model further above is shown in red.
There are a couple noticeable trend clusters in relating the state’s
population size to the wealth of the richest person in that state. The core trend moves along (though less than)
the theoretical probability model computed and shown in gold. There is also
a smaller cluster that runs parallel to the theoretical model (though at higher
values). And we highlight a couple
interesting individual state people in blue.
We are able to measure the fit of the actual wealth
to the theoretical wealth of the richest person per state, based solely on the
opportunities presented by the state’s population size. And from this exercise we see that half
of actual wealth for these richest can be explained by the state’s population
size. People such as Larry Ellison and
Michael Dell were able to set-up their technology businesses anywhere, and the
warmer climates that they enjoy should have been reflected in a higher than
theoretical level of wealth. But instead
it is less. So there is something here
to our mathematical connection between maximum wealth, and the size of the
underlying population. Though we see some finite upper bounds in the most populated
and well desired states.
We also see that there are a number of “anomalies” as well. The most striking are Warren Buffett (consistently among the country's top few in wealth) and
David Koch, revealed in the chart above though their blue
data. Of what can’t be explained by the
state’s population, a third is attributed to the hyper success of these two
powerful individuals. They are also clustered
together in two small neighboring states in the middle of the agricultural Heartland. The flyovers, which probably shouldn’t be
flyovers, of Nebraska and Kansas. This
is a reflection that these less populated states, away from the mega crowds of New
York and California, have a greater than theoretically expected wealth residing
in their richest people. For these guys specifically,
they are worth seven times more than would be expected just on population size
alone (and there wealth rivals those from the most populated states, but with a
lot less competition to get there).
These blue data are not outliers, but a passive
reminder teaching us something important: that extreme wealth is sometimes where one doesn’t
look.
Also, we also wouldn’t expect in those states to see others
of similar high wealth along that most extreme level of the distribution. We use the reference of ultra-high net worth
(UHNW) for those individuals with a net worth in an excess of 30 million
dollars. There are hundreds of UHNW
people in Nebraska and Kansas, though judging based soley on the extreme wealth of Buffett
and Koch using the left chart below, we’d expect many thousands assuming their
extreme wealth implied a proportional wealth level across their domiciled
states. Instead, as we see on the right
chart below, the two blue states have UHNW
levels instead much more proportional to their state’s population. In other words, outside of the most extreme
wealth found in each of the less populated states, the rest of the population’s
wealth very quickly falls back into the trend line.
For completeness sake, we recast the right chart above, through
the lens of the familiar bar chart style.
We can enjoy the patterns in the number of actual versus theoretical
UHNW per state. This includes what we
see in Nebraska and Kansas, where those states no longer stand out as much (in
fact few states actually stand out anymore using this metric). What we see suggesting a definite small wealth
enhancement chance for those living in the most populated states, but that’s it.
There are two countertrends in this data, from the hedge fund capital of
Connecticut (notice the chief of Bridgewater Associates doing well on the chart further above), a small state just north east of New York, and from Wyoming where
the UHNW data has been biased by a large influx of wealthy people migrating
there for particular state tax breaks. The richest person in Wyoming is not biased for this reason however since the source of business wealth was used more accurately in that source survey.
There are proud film song words: “if I can make it there, I'm gonna make it anywhere”.
Obviously this idea is in reference to the city of New York, with an island jewel at the center and a house for half of the state’s population.
While this is a feel-good theme if you live there, the comments are not
necessarily proven. Though New York is,
by all accounts, a cut-throat corporate jungle making it a competitive training
ground for anyone. If that’s not clear enough,
spending time there will sap your money quickly and prove it. The state’s recent, infamous gubernatorial
candidate reminds us of this situation with a political party name that speaks
for itself. The Rent is Too Damn High. Still, a disproportionate number of people (including very bright and inevitably successful people) from around the world flock towards New York, and also spots such as San Jose and San
Francisco, for a shot at making it very rich and spectacularly famous.
But if the thought of “making it” is simply to do well relative to
your most successful societal peers, then one would be probabilistically better off working in
a less populated state. This was the
logic Bill Gates himself used to start Microsoft, away from the action (and drama)
of New York.
The most famous physicist of the 20th century
quipped with disdain the value society places on pursuing excess money, for
any sake including money itself. Albert Einstein offered
this rhetorical question below.
The example of great
and pure characters is the only thing that can produce fine ideas and noble
deeds … Can anyone imagine Moses, Jesus, or Gandhi armed with the money-bags of
Carnegie?
Sympathetically the fascination about the lives, of the richest
among us, isn’t usually the most important or productive thing to focus on. But we can learn a little about the science of extreme
statistics, and what it says about our ability to simply employ in our life both reasonable chance
and success.






No comments:
Post a Comment