scroll
One Million Digits of π

π behaves like a fair ten-sided die.

A million digits, ten outcomes, two chi-squared tests, and one famous misbehavior at decimal place 762.

edt_01 · The hook

01π is suspiciously well-behaved

Across the first 1,000,001 digits of π, every digit 0–9 appears between 99,548 and 100,359 times. The expected count is 100,000.1. A chi-squared test against the uniform distribution returns p = 0.7874 — meaning a fair ten-sided die would have produced a less even result about 79% of the time. The most-frequent digit is 5; the least-frequent is 6; the gap between them is 811.

1,000,001 decimal digits in this dataset (the leading 3 plus the first million decimals).
p = 0.7874 Chi-squared test against uniform digits (df = 9). A fair die would do worse 79% of the time.
p = 0.6172 Same test on the 100-cell length-2 transition matrix (df = 99). Pairwise independence stands.

These are the digits of a number we know exactly. π is provably irrational (Lambert, 1768) and provably transcendental (Lindemann, 1882). Its expansion is fixed for all eternity, computable to the 100-trillionth place if you have time and CPUs. And yet the first million of those digits look exactly like the output of a perfectly fair random number generator.

That tension — total determinism, total apparent randomness — is what mathematicians call the normality conjecture: the unproved claim that every digit, every pair, every triple, every length-k pattern in π appears with frequency 10k in the limit. No one has proven π is normal. No one has proven any naturally occurring constant is normal in any base. We just keep counting digits and finding nothing.

Per-digit frequency with 95% binomial confidence intervals. Y-axis zoomed to ±0.15pp around 10%. Every CI overlaps the dashed reference line.
edt_02 · Self-transitions

02Does a digit avoid itself?

One way to look for hidden structure is to ask: given that we just saw digit i, how often is the next digit also i? If π behaves like fair dice, this “self-transition rate” should be 10% for every digit. The largest is 5 → 5 at 10.195%; the smallest is 7 → 7 at 9.821%. Every diagonal entry sits within ±0.20 percentage points of the 10% target — the binomial 95% CI half-width for ~100k trials is ±0.19pp.

P(ii) for each digit. Bars centered at 10%; the visible range is [9.7%, 10.3%]. No digit prefers or avoids itself meaningfully.

The original TidyTuesday hero figure shows exactly this: ten gauge dials, one per digit, all hovering around 10%. Stare at it as long as you like — there's no sticky digit, no avoidant digit, no detectable autocorrelation at lag 1.

The original TidyTuesday hero figure: ten gauge charts showing self-transition rates for each digit 0-9.
Reference: Manasseh Oduor's original TidyTuesday submission (2026 week 12).
edt_03 · The 10×10 matrix

03100 numbers, all the same

Push the test one step further. Compute every conditional probability P(next = j | current = i) — a 10×10 matrix of 100 numbers. The full chi-squared on length-2 strings: 94.21 with 99 degrees of freedom, p = 0.617. Critical value at α = 0.05 is 123.23. Once again: the data fails to reject independence with room to spare.

The single most-frequent two-digit transition is "94" with 10,239 occurrences (1.0239%). The least-frequent is "12" at 9,721 (0.9721%). The full range across all 100 transitions is just 518 counts. As a heatmap, the matrix is visually a flat sheet of pale color. That's the result.

Conditional probability P(next = j | current = i) in percent. Diverging color centered at 10.00%; the most extreme deviation is ~0.25pp. Hover for exact values.

Most quantitative findings are interesting because they are uneven. This one is interesting because it isn't. We tested for hidden structure twice and found two beautifully featureless surfaces.

edt_04 · The turn

04The Feynman point — π's one famous misbehavior

Now look at runs — consecutive identical digits. The first three-in-a-row in π is "111" at decimal place 153. The first four-in-a-row is "9999". The first five-in-a-row is "99999". The first six-in-a-row is "999999". All three records arrive in one place — decimal place 762 — and the digit setting them is 9. This is the famous Feynman point.

decimal place 762 Where six 9's first appear in a row — the Feynman point.
A library page with one row of digits zoomed in showing the sequence 999999 highlighted.
π's printed digits, with the six-9 cluster catching the lamp light.

Six 9's appearing this early would happen for a uniformly random sequence about 0.08% of the time. The earliest written reference to the joke is Douglas Hofstadter's Metamagical Themas (1985); the popular attribution to Richard Feynman has never been confirmed in his memoirs or by his biographer.

The Feynman point is also the only place in the first million digits where π looks weird. Digit 3 catches up later — seven 3's in a row starting at decimal 710,100 — but by then the dataset has long since proven the rule that π is, in every other respect, extremely well-behaved.

Longest run of each digit in the first million decimals, with where it starts. Digit 9 owns the famous Feynman-point run; digit 3 owns the longest run overall.
edt_05 · The absence

05The substring that isn't here

If π contains every possible string somewhere — and most mathematicians believe it does — what about "0123456789"? In the first million digits the answer is: it's not there. Neither is "9876543210", "123456", "123456789", or even "000000". But we do find "42" at decimal 92, "666" at 2,440, "0000" at 13,390, "1234" at 13,807, the leading digits of e (271828) at 33,789, and π reading itself ("314159") at 176,451.

The textbook answer for "0123456789" is decimal place 17,387,594,880 — about 17.4 billion in. That's roughly 17,400 times deeper than this dataset. So the absence isn't a failure of π; it's a feature of one million — long enough for cute coincidences, far too short for orderly long strings.

Where in π?

This searches a 5,000-decimal window of π (inlined for speed). The famous-string table below is the authoritative answer for the full million.
StringWhat it isFirst decimal place
42the answer to everything92
666"the beast"2,440
999999the Feynman point762
1234four ascending13,807
0000four zeros13,390
271828first 6 digits of e33,789
314159π reading itself176,451
888888six 8s222,299
000000six 0snot in first 1M
123456six ascendingnot in first 1M
9876543210ten descendingnot in first 1M
123456789nine ascendingnot in first 1M
0123456789ten ascendingfirst appears at digit 17,387,594,880 (≈17.4 billion)

The "your birthday is in π" parlor trick works ~63% of the time at this depth: 631,548 of all 1,000,000 possible 6-digit strings actually appear in the first million decimals. Length-5 coverage is 99.99% (8 missing out of 100,000). Length-7 coverage drops to 9.5%. The boundary at length 6 is exactly where probability theory predicts.

Fraction of all length-k strings present somewhere in the first million decimals. Observed values track expected probabilities almost exactly.
edt_06 · Convergence

06How fast does π converge?

At n = 10 digits, three of the ten possible digits haven't appeared at all and three others appear twice their share. By n = 100 the spread is ±4 percentage points. By n = 10,000 it's about ±0.5pp. By n = 1,000,000 every digit's running rate sits within ±0.05pp of 10%. This is exactly the 1/√n behavior the Central Limit Theorem promises.

Running per-digit percentage at log-spaced sample sizes n = 10 → 1,000,000. Ten thin lines collapsing onto the 10% horizontal.

Define "settled" as the smallest n at which a digit's running percentage stays inside [9.9%, 10.1%] forever. Digit 0 settles first, at n = 25,991. The slowest is digit 7, at n = 368,515. By the time we hit 369k digits, all ten have permanently joined the ±0.1pp club.

A 10-second ambient marimba bed — pi's digits as a calm, uniformly distributed soundscape. Optional.
edt_07 · The close

07What a million digits can and cannot tell us

NASA's JPL navigates spacecraft using 15 digits of π. Forty digits would compute the circumference of the observable universe to within the width of a hydrogen atom. We have a million.

A 1970s mainframe printer extruding a long strip of paper densely covered in computed decimal digits.
Decimal digits of π, computed and printed — a million is far more than anyone needs, and not nearly enough to settle the open question.

What that million can do: confirm uniformity at p ≈ 0.79, confirm pairwise independence at p ≈ 0.62, surface every length-5 substring (99.99% coverage), find the Feynman point at decimal 762, and watch each digit's running frequency converge to 10% on a 1/√n schedule. What it cannot do: prove π is normal, find runs of seven or more identical digits other than the one stray "3333333", or fit "0123456789".

π is a number we know exactly. Its digits behave as if we don't. That's the whole story — and the unsolved problem at the center of it is older than any of us, and probably will outlive most of what we'll ever write.
Content
Assets