arXiv blog

Benford's Law And A Theory of Everything

A new relationship between Benford's Law and the statistics of fundamental physics may hint at a deeper theory of everything

kfc 05/07/2010

  • 57 Comments

In 1938, the physicist Frank Benford made an extraordinary discovery about numbers. He found that in many lists of numbers drawn from real data, the leading digit is far more likely to be a 1 than a 9. In fact, the distribution of first digits follows a logarithmic law. So the first digit is likely to be 1 about 30 per cent of time while the number 9 appears only five per cent of the time.

That's an unsettling and counterintuitive discovery. Why aren't numbers evenly distributed in such lists? One answer is that if numbers have this type of distribution then it must be scale invariant. So switching a data set measured in inches to one measured in centimetres should not change the distribution. If that's the case, then the only form such a distribution can take is logarithmic.

But while this is a powerful argument, it does nothing to explan the existence of the distribution in the first place.

Then there is the fact that Benford Law seems to apply only to certain types of data. Physicists have found that it crops up in an amazing variety of data sets. Here are just a few: the areas of lakes, the lengths of rivers, the physical constants, stock market indices, file sizes in a personal computer and so on.

However, there are many data sets that do not follow Benford's law, such as lottery and telephone numbers.

What's the difference between these data sets that makes Benford's law apply or not? It's hard to escape the feeling that something deeper must be going on.

Today, Lijing Shao and Bo-Qiang Ma at Peking University in China provide a new insight into the nature of Benford's law. They examine how Benford's law applies to three kinds of statistical distributions widely used in physics.

These are: the Boltzmann-Gibbs distribution which is a probability measure used to describe the distribution of the states of a system; the Fermi-Dirac distribution which is a measure of the energies of single particles that obey the Pauli exclusion principle (ie fermions); and finally the Bose-Einstein distribution, a measure of the energies of single particles that do not obey the Pauli exclusion principle (ie bosons).

Lijing and Bo-Qiang say that the Boltzmann-Gibbs and Fermi-Dirac distributions distributions both fluctuate in a periodic manner around the Benford distribution with respect to the temperature of the system. The Bose Einstein distribution, on the other hand, conforms to benford's Law exactly whatever the temperature is.

What to make of this discovery? Lijing and Bo-Qiang say that logarithmic distributions are a general feature of statistical physics and so "might be a more fundamental principle behind the complexity of the nature".

That's an intriguing idea. Could it be that Benford's law hints at some kind underlying theory that governs the nature of many physical systems? Perhaps.

But what then of data sets that do not conform to Benford's law? Any decent explanation will need to explain why some data sets follow the law and others don't and it seems that Lijing and Bo-Qiang are as far as ever from this.

Ref: arxiv.org/abs/1005.0660: The Significant Digit Law In Statistical Physics

TRSF: Read the Best New Science Fiction inspired by today’s emerging technologies.

Print

Close Comments

To comment, please sign in or register

Forgot my password

lumidek2

13 Comments

  • 647 Days Ago
  • 05/07/2010

Silly

It's completely silly to assign this law with some mystery. Every good high-school student should be able to derive Benford's law.

Any quantity whose value may a priori be very different - differing by orders of magnitude - inevitably has a distribution that is quasi-uniform on the logarithmic scale.

It's pretty much equally likely for such a "universal" quantity to be between 0.1 and 1, as between 1 and 10. The multiplicative factors are just a matter of convention. That's surely the case e.g. for the price of stocks where the price of the "minimum, one stock" always depends on coincidences.

So because log(x) is distributed along a distribution, and the distribution extends by several decades, and is inevitably quasi-continuous or smooth inside, it follows that the probabilities for the first digits are fixed by the logarithmic law.

On the log axis, the period between 1 and 10 is copied many times. Log((N+1)/N) / Log(10/1) is the fraction covered by the first digit's being N. It's repeated everywhere, so it's clear that e.g. the probability of the first digit's being 4 is log(5/4) / log(10/1).

The most likely first digit is clearly 1, with probability log(2/1)/log(10/1). Intuitively, you may imagine that the numbers 12-19 are "almost as likely as "2-9" because they're just "shifted by ten" and there's not much difference between the behavior of "5" and "15" at the beginning. So "1" is more likely at the beginning than e.g. 2.

Benford's law proved above gives the exact result for the probability. Because the proof only uses elementary mathematics, we can't extract anything that would go beyond elementary mathematics out of it.

Reply

syb

32 Comments

  • 647 Days Ago
  • 05/07/2010

Re: Silly

I'm no mathmatician so help me understand this.  I interpret your explanation to mean it's a basic property of logarithmic numbers that this ditribution falls out, so there is nothing of interest here to discover and no underlying mystery.  But if that were true, it should work with any data set, not just some datasets.  It seems that if this were universal then it wouldn't require further examination, but it is selective, indicating there is some other element at work here.

Reply

johnswentworth

1 Comment

  • 647 Days Ago
  • 05/07/2010

Re: Silly

Benford's law only applies to distributions over countably infinite sets, where we cannot construct a uniform distribution. So for telephone numbers, for instance, we have finitely many possibilities so we can easily choose 10 digits uniformly. But for almost all useful distributions over the integers (which implicitly includes distributions over the reals with finite uncertainty), Benford's law applies.

Reply

socrates8

3 Comments

  • 646 Days Ago
  • 05/08/2010

real numbers

...i thought real numbers were not countably infinite?

Reply

Lanny Budd

1 Comment

  • 644 Days Ago
  • 05/10/2010

Re: Silly

I thought that the question was that if Benford's Law applies to countably infinite sets and does not apply to the reverse, then does the fit of a set of data to the Law describe the data as countably infinite or not?

If so, then what does that say about the distributions in the article? And why should countability and noncountability vary with temperature?

Reply

mcg11

1 Comment

  • 644 Days Ago
  • 05/10/2010

Re: Silly

One property that is different for EB and FD statistics is the concept of distinguishable vs indistinguishable electrons.  This might be the underlying difference of the data sets.  Possibly similar to countable or uncountable data sets or selection with or without replacement? 

Reply

shazam

1 Comment

  • 645 Days Ago
  • 05/09/2010

Re: Silly

lumidek2 say - "It's pretty much equally likely for such a "universal" quantity to be between 0.1 and 1, as between 1 and 10." Well, that's pretty much the question - by assuming that "universal quantities" are exactly those that fulfill this scale invariance, then the question is: why do so many phenomenon give rise to these sorts of (in your parlance) universal quantities? The article was pretty clear on the point. But if you simply take the widely occurring existence of scale invariant quantities for granted, then the nature of the question is not going to be understood.
Another way of looking at it: how/why do scale invariant quantities arise from physical laws?

Reply

Advertisement

dpfitz

1 Comment

  • 584 Days Ago
  • 07/09/2010

Are all the fundamental forces nothing but simple PHASE relationships?

Benford may have been on to something but I think G. Lisi and S. Wolfram are too.

Stephen Wolfram's "A New kind of Science" gives us these three (3) important facts:

1. Math can only explain simple things.
2. We need a model to explain complicated things.
3. But - a simple model can explain complicated things.

I ask myself this: Is G.Lisi's model similar to Dr. Milo Wolff's spherical, standing wave model?

While Wolff's model is only of the electron, Lisi's E8 model may be a similar model of a more complete quark-electron-gravitational setup.

If so then you must ask:

Are all the fundamental forces nothing but simple PHASE relationships?

"in phase attract"

Type those three words - above - into Google (Include quote marks) to learn not only the basics of electricity but how all the fundamental forces work.

Or click the following link that will give you the same page in Google. http://www.google.com/search?hl=en&source=hp&q=%22in+phase+attract%22&btnG=Google+Search

Not only do all electric motors obey these phase laws but this entire universe does as well.

(Click link below.)
http://www.amperefitz.com/in.phase.attract.htm

Reply

syb

32 Comments

  • 647 Days Ago
  • 05/07/2010

base 10?

what happens to the distribution of digits if you work in some system other than base ten?  If you were to look at the area of lakes dataset and convert them to hexadecimal, for instance, does this same trend exist?  Since I'm not a mathmatician, does this have any significance either way??

Reply

rsanchez1

213 Comments

  • 647 Days Ago
  • 05/07/2010

Re: base 10?

According to Wikipedia, it still holds regardless of which base is used, except that (of course) the probability distribution changes, since there are either more or less digits in different bases.

Reply

GadgetMan0101

1 Comment

  • 647 Days Ago
  • 05/07/2010

Human Nature

I'm not a mathematician, but if the sample data used to test Benford's Law are from different types of measurements, as this article suggests, then I would hypothesis that the reason for Benford's results lies in the method of how man defines a specific unit of measure.

We observe some natural event and attempt to quantify it. We identify a pattern and try to break it down into its smallest identifiably unique element and use this as our unit of measurement. So any measurements taken are more likely to be closer to whole units than not.

Combine this with using a base 10 numeric system and I would imagine that you've now stacked the deck in such a way that makes Benford's observations possible.

In this case I think Benford's Law provides more insight into how man perceives and quantifies the environment around him more than it describes nature itself.

Reply

syb

32 Comments

  • 647 Days Ago
  • 05/07/2010

Re: Human Nature

That idea may be the crux of this argument.  Is Benford's law a statistical artifact that comes from the methods we use and/or something intrinsic to math (a long way to say coincidence), or is there an underlying phenomenon in nature that we are seeing emerge when we look at the trends in these numbers.

Reply

minorgod

1 Comment

  • 647 Days Ago
  • 05/07/2010

Re: Human Nature

I was having the same thoughts. Don't suppose you've been reading Stephen Wolfram's "New Kind of Science"? Wolfram goes into extensive detail on methods of human perception and why we are particularly good at recognizing certain types of patterns.

Reply

mmv

1 Comment

  • 349 Days Ago
  • 03/01/2011

Re: Human Nature

Limits of a Boolean matrix become symetrical

Reply

Advertisement

mplockwood

1 Comment

  • 647 Days Ago
  • 05/07/2010

a plain language explanation

I can think of a common sense way of thinking about Benford's Law and why it would be true for some data sets. With lottery numbers, each number is chosen randomly so it makes sense that there would be no special distribution of first digits.

However, whenever your set of numbers measures something which has limits on its size (like the lengths of rivers) then the numbers wouldn't be random. At some size, larger numbers would become less likely, and as the numbers reach each new power of ten (10 to 100 to 1000, etc.) the chance of finding a number with a larger digit in the first position decreases at a greater rate. So this should be true regardless of the base number system.

Now, this is still interesting and may have significant implications for the quantum systems being studied. But the origin of Benford's Law isn't too mysterious, is it? Counterintuitive, maybe...

Reply

kyrgyz

1 Comment

  • 645 Days Ago
  • 05/09/2010

Re: a plain language explanation

Absolutely agree with your plain explanation. There is no sensational in this article. For example I read that students with names starting with first letters of of alphabet are more successful because they are always on top of alphabetically sorted student lists, and that's for more often tested by teachers.
Or another example is I would encounter number 5 much more often than number 1000000000.

Reply

iamnotbatman

1 Comment

  • 415 Days Ago
  • 12/25/2010

Re: a plain language explanation

I think Benford's law would say that 1 is more likely than 5, and 1.0x10^N is more likely than 5.0x10^N, etc, not that 5 is less likely than 1.0x10^N...

Reply

  • 647 Days Ago
  • 05/07/2010

Distribution of Digits

Can you help me understand? I would like to know what the distribution of digits other than 1,9. Thanks. Jim

Reply

neurator

1 Comment

  • 647 Days Ago
  • 05/07/2010

Benford's applies only to sequential, additive systems

It seems plain to me that Benford's only applies in systems that experience natural growth or loss.  Quantities of stocks go up and down.  Rivers grow or recede over time.  Computer file sizes accumulate depending on how much physical data exists. 

In other words, Benford's appears in systems where quantity is relevant.  Telephone numbers are chosen semi-randomly; the assignments are not doled out sequentially.  Lottery digits are (supposedly) completely random.  They are non-cumulative.

Perhaps someone could reveal a random data set where Benford's law applies, but I've never seen one.

Reply

socrates8

3 Comments

  • 646 Days Ago
  • 05/08/2010

Re: Benford's applies only to sequential, additive systems

so how would this apply to the digits of pi?

Reply

jbexpm

4 Comments

  • 415 Days Ago
  • 12/25/2010

Re: Benford's applies only to sequential, additive systems

See comment about PI… I've added further down comment page…

Reply

Advertisement

Michaelk

1 Comment

  • 647 Days Ago
  • 05/07/2010

Some numbers not numbers

Phone numbers and lottery number are not numbers, at least from a software/data type perspective.
One does not add multiply divide or subtract these 'numbers'. They are immutable identifying strings. Even though they contains digits, they are no more subject to arithmetic than the word 'arithmetic'.

Reply

fastartcee

3 Comments

  • 647 Days Ago
  • 05/07/2010

Not a mystery at all

Take the length of rivers as an example. Almost all lists include only rivers longer than 1000 miles. The size of continents constrains the length of rivers, so there are many between 1000 and 1999 miles in length, but none between 9000 and 9999 miles. So any list will have the most rivers in the category starting with a '1'.

I think the same rough analysis applies to many other lists  ...population of countries, for example. There would be more countries with populations between 100 million and 199 million than there would be between 900 million and 999 million.

So what's the big mystery?

Reply

syb

32 Comments

  • 647 Days Ago
  • 05/07/2010

Re: Not a mystery at all

I don't think it's that simple.  I think the most intriguing dataset that Benford's Law applies to is physical constants.  Physical constants are not numbers that are made up or assigned, they are measurements of particular physical properties that govern how the world works.  If you have a dataset of measured numbers that describe aspects of physical reality, like pi (the relationship of a circle's diameter to it's circumference) or the speed of light, and then you see a trend in their distribution, I think it's fair to wonder if there is some underlying reason for why these numbers are not randomly distributed.  To me it's mildly mind boggling to think that there may be some concept out there that explains why these numbers aren't randomly distributed, which may be an insight into a fundamental rule of how the universe works.

Reply

socrates8

3 Comments

  • 646 Days Ago
  • 05/08/2010

Re: Not a mystery at all

a circle is not an aspect of physical reality...show me a circle ...anyone...anyone?  Speed of light i agree is an aspect of physical reality..

Reply

jbexpm

4 Comments

  • 415 Days Ago
  • 12/25/2010

Re: Not a mystery at all- pi() ????

I'm not sure I appreciate what you are saying about PI.  I just found a website that claims to have the first 1 million digit of PI posted.  So I cut and pasted this number into my word doc, and found a script on line that counts the frequency of 0-9 occurring in that sequence…  The following are the results…
0-99959
1-99578
2-100026
3-100230
4-100230 (wow same as 3!)
5-100359
6- 99548
7-99800
8-99985
9-100106
First observation… all of these number are arguably evenly distributed.  But I decided to use excells trend line and found the a 6th power curve fits pretty well… you can do this yourself pretty quickly.  What is amazing to me and completely unexpected is that the curve seems to have a similar pattern whether you plot the frequencies of the first 11,001 ore the first 1,000,001 digits of pie… well. They aren't a perfect match, but certainly not logarithmic…  too bad I can't post a picture to show you… but it's easy enough to do with Matlab or excell.

Reply

DBCooper

4 Comments

  • 647 Days Ago
  • 05/07/2010

Re: Not a mystery at all

Try expressing the river lengths in centimeters. The distribution of digits will remain the same.

Reply

mkeithc

1 Comment

  • 647 Days Ago
  • 05/07/2010

Slide Rule

If you have an old slide rule around you could show the relationship quickly.  Natural growth would give values in fixed increments across the slide rule, and the number of increments that would exist between 1 and 2 is much greater than those between 9 and 10.
The same can be shown with log paper.
It's an interesting observation.  This must mean that most data we are collecting comes from sources that exhibit natural growth.  I understand that salaries are that way.  The rich get richer quicker.

Reply

Advertisement

SmokeyVW

1 Comment

  • 647 Days Ago
  • 05/07/2010

consider base 2

Q: What happens if you convert all your numbers into base 2?

A: 100% of the numbers will start with digit '1'.

Reply

xanthorp

2 Comments

  • 646 Days Ago
  • 05/08/2010

Re: consider base 2

All of the phone numbers I dial start with the numeral one.

Reply

moopoom

4 Comments

  • 647 Days Ago
  • 05/07/2010

This is not as mysterious as the author makes it seem.  The first post was spot on. 

Several people have asked when does Benford's law apply.  The answer is whenever you have a logarithmic distribution.  It doesn't have to do with our base 10 number system, or the choice of units -- if we had a base 5 number system and miles instead of kilometers, a similar law would still apply (although the probabilities would be different).

Well, it does have *something* to do with our choice of units, but more with our perception of the universe.  Clearly, if we were to pick logarithmic or exponential units in cases where Benford's law applies it would alter the distribution.  We already do this sometimes -- when measuring sound intensities, one could either use units of sound intensity, or the log of the intensity, which we refer to as the "decibel".  Decibels don't add in the same way that the intensity adds, so in some sense it's silly to use them, but they do more accurately describe the way the human ear responds to sound.

Another example is size or distance -- if you can start with a distance and double its size without doing something to the distribution, you have a logarithmic distribution.  Any random list of distances will do -- for example, distances to stars: http://en.wikipedia.org/wiki/List_of_stars_nearest_to_the_Earth
or the value of large purchases (yes, it has been used to find financial fraud).

When does something *not* scale universally?  Well, think about the distances between billiard balls on a table.  In this case, the distribution of distances between billiard balls are truncated at the low end, since the balls have finite sizes and cannot be closer together than the diameter of the balls themselves.  The distribution is also truncated at the high end, since the balls cannot be further apart than the length of the table.  So in this case there is no universal scaling factor, because doubling all the distances in the system doubles the size of the table and the size of the ball.  But if you only consider distances that are significantly larger than the size of the balls and significantly smaller than the size of the table, within that regime there may be universal scaling and it may be possible to construct a distribution where Benford's law applies.

In other words, Benford's law applies when you are far away from the granularity or size of the system.

Reply

drclue

1 Comment

  • 647 Days Ago
  • 05/07/2010

Obviously 1 is not the loneliest number

In data , we as humans for various purposes
elect to group things in scales of measurement
that often induce the appearance of 1's.

Would you want to buy 231 cubic inches of milk
or 1 gallon?

Are we selling you a 1 ton truck or a
2000 pound truck.

Is that 1 barrel of oil or 55 gallons.

Did I walk 1 mile for a camel or 5280 feet

Depending on the method, I either have 4 "1"s
or none at all.

The frequency of "1" would seem to be at least as much a product of convenience as anything else.

When something progresses beyond "1", we often have a habit of making new scales to bring the
quantity back to "1"

1 pair
1 dozen
1 case
1 truckload
1 battalion
1 set

So really , the presence of "1" is more
a product of the way the human mind works
than some big mysterious force.

Reply

jasonfromseattle

2 Comments

  • 647 Days Ago
  • 05/07/2010

Re: Obviously 1 is not the loneliest number

The logarithmic distribution is a result of scale invariance. Changing the units does not matter.

Plus Bendford's Law is the distribution of the first digit, so while we invent units to have values around 1 for convenience, that does not explain why we also simultaneously have more values around 10, 100, 1000 or 1 billion than 90, 900, 9000 or 9 billion.

Reply

nimrod

1 Comment

  • 647 Days Ago
  • 05/07/2010

This some smelly bullsh.t

Logarithmic distribution isn't mysterious, nor is its fit to integrals of some distributions with exponentials in the pdf. There is nothing fundamental going on and it's a crap piece of paper. Just because it's typed up in TeX and pretty doesn't mean sh.t

Reply

tjmerritt

1 Comment

  • 647 Days Ago
  • 05/07/2010

Distribution in base two

Benford's law is more obvious in base two where the probability is 100% that the leading digit  in a logarithmic distribution is a one.  This is commonly called the hidden bit in binary floating point systems, since there is no need to represent it.

The more interesting question is why some physical systems have distributions that are more logarithmic than others.

Reply

Advertisement

riemannzeta

10 Comments

  • 646 Days Ago
  • 05/08/2010

Finite size effect?

Seems to me that there might be some mathematical artifacts that result from the reduction of a continuous distribution (the natural phenomenon) into discrete measurements. Can Benford's Law be generalized to base e? Not that I can see. But it is applied to physical phenomena related to exponential growth.

Is there a Benford's Law for complex numbers? I would guess that proving that one way or the other would give an answer as to what's really behind the numbers.

Reply

riemannzeta

10 Comments

  • 646 Days Ago
  • 05/08/2010

Fractal dimension

To make the same point more precise -- don't all of the quantities whose measurements obey Benford's Law have non-integer dimension?

Reply

riemannzeta

10 Comments

  • 646 Days Ago
  • 05/08/2010

Actually...

This explanation looks pretty complete to me:

http://www.dspguide.com/ch34/7.htm

Saying Benford's Law applies to a given set of measurements tells you the measurements are smoothly distributed over the unit of the logarithmic scale in question.

Reply

xanthorp

2 Comments

  • 646 Days Ago
  • 05/08/2010

It's not some great mystery

The reason phone numbers and lotteries don't show the pattern is because phone numbers and lotteries aren't 'counts' or 'tallies'  .  Lengths of buildings areas and rivers are counts or tallies and will show Benford's Law. 

Think about how you count.  When you need a new digit it rolls over to what?  The numeral one.  Not nine.  Of course when counting it will be used more frequently.  Even wikipedia has a great explanation of the law.  That DSP Guide article is great BTW. 

Reply

  • 645 Days Ago
  • 05/09/2010

Benford's Law is Merely a Consequence of Number Representation

Lots of good posts, but they only dance around the real reason. Our numbers increment additively, rather than by ratio.

If you start with zero things, the first few added is a big deal. After awhile, each additional added thing is less important. Ask any kid. The first ice cream cone is much more desired than the 9th. And it is as forlorn as the log entries for 9.xxx.

Ever notice how many physical formulas are full of terms multiplied together? So when physical events happen, in a collection of events, each one is related to the others by ratios, not simple increments. As was mentioned earlier, constructed numbers, like phone or lottery aren't subject to the law.

Reply

apollard

1 Comment

  • 645 Days Ago
  • 05/09/2010

Look at your slide rule

For those of us that went to school using a slide rule, it is obvious that any result of a calculation is most likely to fall between 1 and 2 on the slide rule. This is even true when you multiply (or divide or square) lottery numbers.

Reply

dlittman

1 Comment

  • 645 Days Ago
  • 05/09/2010

More small things than large

Sort of interesting, but not interesting like Zipf's law.

Benford's law reinforces - but does not explain - the common observation that there are more "small" things than "big" ones -  exoskeletons;  rock volumes; mountain heights. And portion sizes in expensive restaurants.

Lots of reasons for it but, while nature abhors a vacuum, it also does not like too much stuff in one place.

Reply

Advertisement

Newcomb

1 Comment

  • 645 Days Ago
  • 05/09/2010

Re: Silly

Dear Silly, are you sure that Benford's law is so easily proven? Benford's law is indeed equivalent to equidistribution of logarithms, as can be seen, for example, in Persi Diaconis' paper "The Distribution of Leading Digits and Uniform Distribution Mod 1" (note that this is probably beyond the typical high school student's level as well). But even given this characterization, how does one PROVE this distribution? For example, if you take a specific arithmetic function, it is easy to observe computationally the distribution but can be very difficult to prove the long term behaviour in the limit.

As an exercise, consider the partition function p(n), which counts the number of ways to "split" an integer n into smaller integers. E.g. 10=1+9=2+2+6 are two different partitions. See wikipedia for more details. Can you prove that p(n) is Benford for every base?

Reply

mlrogers

1 Comment

  • 644 Days Ago
  • 05/10/2010

Ergodic scaling hypothesis

I agree with those who feel that there is nothing "mysterious" about this, but I also understand why people are unimpressed by the typical non-explanation of appealing to an assumed uniform distribution over a log scale.  If one *assumes* such a distribution, then, of course, Benford's law appears obvious by way of a simple mental picture.  So some people think anyone who isn't happy with that explanation simply doesn't get it.  But they miss the question that puzzles many people on this point: Other probability distributions can and often are perfectly valid descriptions of many other physical phenomena.  So why is the scale-free distribution, which gives rise to Benford's law, common to so many natural and social phenomena?

As is often the case, simply posing the question clearly goes a long way toward answering it. Scale-free distributions are ubiquitous in modern physics, from renormalization of Yang-Mills theories to the theory of phase transitions (where it applies to fluctuations) to turbulence (eddy sizes) and is very common in areas relating to "complexity" theory, such as the theory of "self-organized criticality". If you can conceptually bring out the intuition that guides their use in these areas you can remove the sense of mystery that surrounds the commonality of scale-invariant distributions.

What all these theories have in common is:
(1) Conservation Laws: They are theories of  systems governed by conserved (or approximately conserved) quantities with dynamics described by conservative dynamic or transport equations which
(2) Have some scale-invariant region in their effective phase space - some effective "intertial range" or several decade range in some of the conserved quantities, over which neither boundary conditions nor constitutive relations nor transport coefficients lead to any significant breaking of the underlying scale-invariance of the dynamic (or tranport) equations. And finally,
(3) Some underlying randomness is present which tends to some sort of ergodicity - uniformity of probability in the underlying phase space.

I offer no proof here, but it seems plausible to hypothesize that under conditions (1)-(3) - perhaps in some more precise form or with some stronger conditions - the probability distribution of the fluctuations of at least some of the conserved quantities (over the "intertial range") tends to a scale-invariant distribution over a long-time. 

This is just a starting point (it's not sufficiently precise as stated here).  But it would be fun to think about the different systems where Benford's law applies to see what the analogous conserved quantities and scale-invariant effective dynamics might be involved.  Talk amongst yourselves :)

Reply

CharlieW

3 Comments

  • 644 Days Ago
  • 05/10/2010

Can someone explain mystery to a layman?

Well, I’m not a mathematician at all and I think I’m just restating the first post above. But this one seems so infuriatingly obvious to me that I feel like I’m missing something:

If you pick a random number from 1 to 20, you have a 55% chance of pulling a 1 at the first digit and 5% chance of a 9. For 1 to 50, that becomes 22% and 2%, respectively. The ONLY possible lists for which the digits 1 to 9 are tied are 1 to 9, 1 to 99, 1 to 999, and so on, where each digit has a 1/9 chance of being the leading digit (ignoring 0 for the moment). This seems like a fairly specific kind of list. For any other size list, 1 always beats 9 as a leading digit. Since all lists have between 1 and "some finite number" of things (0 things aren't really a list), then this pattern should hold up for anything that could reasonably be called a "count of things".

As stated above, phone numbers and such aren’t really counts or measures of things but are really ID tags (they also have the property that they are filling a pattern that looks something like 000000 to 9999999). And, of course, human marketers have an inordinate preference for 9s. Also, you have an arbitrary "0." used to prefix any number between 0 and 1 but never for numbers greater than or equal to 1. But ignoring the 0 prefix, numbers that are really "ID tags", or numbers selected by humans, any set of actual counts, measures or constants from the real world should follow this pattern.

If I’ve discovered something mathematically profound or devised a novel proof that mathematicians have been seeking for 100 years, please give me a prize. Otherwise, can someone explain the mystery to me? The original post and comments supporting a "deep mystery" seem like they are trying to bamboozle me with mathematical Laws and such that I don’t know.

Reply

moopoom

4 Comments

  • 644 Days Ago
  • 05/10/2010

Re: Can someone explain mystery to a layman?

CharlieW, your thinking is correct, but your understanding of the "law" is not.

It is saying that even if you have a distribution of numbers from 1 to 99, that the probability of starting with a 1 will be greater than a 2 and greater than a 3...  in other words, that many physical constants tend to be logarithmically distributed, rather than linearly.

It is saying that there will be more things between 1 and 2 than between 2 and 3.  So if you look at a list of distances, most of them will start with 1.XXXX rather than 2.XXXX, and it doesn't depend on the units you use or the base of your number system (just because of the way the mathematics work out).

Reply

moopoom

4 Comments

  • 644 Days Ago
  • 05/10/2010

Re: Can someone explain mystery to a layman?

As mentioned earlier, a good place to look is wikipedia: http://en.wikipedia.org/wiki/Benford's_law

Reply

CharlieW

3 Comments

  • 644 Days Ago
  • 05/10/2010

Re: Can someone explain mystery to a layman?

Well, the wiki gives an explanation similar to mine. To restate in more technical terms, if I throw a dart at logarithmic paper, I have a 30% chance of hitting a 1. The Wiki says nothing special about constants. And I don't understand the comment above: constant aren't picked from a distribution of 1 to 99. They are real numbers that could be basically any value (changing the units). In other words, throw a dart at some logarithmic paper. I still don't see any mystery here.

Reply

luddite

407 Comments

  • 644 Days Ago
  • 05/10/2010

Re: Can someone explain mystery to a layman?

We all know that prime numbers rule,
And that quantum mechanics are nobodys fool,
But when a problem just can't be resolved,
Because it doesn't conform to any metaphysical laws,,
Then the whole math thing is not very cool.

Reply

Advertisement

moopoom

4 Comments

  • 642 Days Ago
  • 05/12/2010

Re: Can someone explain mystery to a layman?

In reply to CharlieW

You're absolutely right, it's like throwing a dart at logarithmic paper -- so no surprise that things are very likely to start with 1, right?  So I think the interesting question is then, "Why should constants be spaced logarithmically, instead of linearly?"  This is the (perhaps) surprising thing that the theorem points out.

But I'm not completely sure that we're understanding each other.  Hope it was at least fun to think about :)

Reply

greenjohn

1 Comment

  • 643 Days Ago
  • 05/11/2010

Re: Can someone explain mystery to a layman?

   "Let the numbers speak for themselves" is what Napier said after publication of the first table of logarithms.

   I can suggest that you download current values of the constants:
http://physics.nist.gov/cuu/Constants/Table/allascii.txt ,

express each numerical value as a common log, and arrange them all in absolute value order so as to have your own Gunter Table that "...speaks for itself".

   Be warned, however, of the temptations that follow.

   You might decide such a collection so valuable that it should be kept, augmented with every result of every calculation you ever perform, a self ordering archive for reference and comparison.  You might be tempted to scale some several different logarithmic number lines with some  of the most important constants, you might even be tempted to orient them orthoganally from a common origin, projecting in multidimensional maps the laws of physics and systems of units.

   That's when you join the controversy over alternatives to the scientific notation form of numerical expression, tailor made for use with obsolete slide rules and logarithm tables, that's when you join the controversy over alternatives to the least squares method of fundamental constant adjustment, like the Birge-Bond Diagram and the Isometric Consistency Chart, that's when you start finding values of constants that cannot possibly be correct, that's when you become a physics outlaw.

   Take my word for it.

Reply

Phineas

127 Comments

  • 637 Days Ago
  • 05/17/2010

Madoff Applied Benford's Law

Ted Hill devised ways of finding tax cheats using Benford's law. Statistical analasis of made-up numbers show up as fake. Bernie Madoff used made up numbers that stood up to this scrutiny

http://paul.kedrosky.com/archives/2008/12/19/bernie_vs_benfo.html

Reply

matzmunt

1 Comment

  • 618 Days Ago
  • 06/05/2010

Re: Madoff Applied Benford's Law

OK. This theory is the biggest crock! Below is an explanation that everyone should understand.

Above an example of nearest star distances was given. It just so happens that most of our nearest stars fall into the 10-16 light year distance band. Try expressing these distances as light minutes! all of a sudden the number 5 and 6 become the most common leading digits. There is nothing to this. If we changed all our units of measurement in physics then many constants would change also, obviously not ratio constants like Pi but the speed of light for example could be expressed with a new distance definition so the leading digit was 9! In statistics, even if we have a p-value of 0.000000000001 we still can't say with 100% certainty that we can reject the null hypothesis. The observed frequency of leading ones is just that: a rare probability in real life much like the same person being struck by lightning 3 times, not likely, but it happened and it's all down to the way we have defined our scales and measurement. :-)

Reply

trans

43 Comments

  • 415 Days Ago
  • 12/25/2010

Anthropic principle

I'm sorry to have to inform you that the Anthropic principle supersedes Benford's Law.

Congratulations, you just discovered man likes to structure his measures to favor numbers starting with 1.

For another glaring example, economic measures tend to end in 5 and 9.

This has nothing to do with the fundamental laws of the universe.

Reply

jbexpm

4 Comments

  • 414 Days Ago
  • 12/26/2010

Re: Anthropic principle

Funny… I crunched frequency of 0-9 for  PI to 11,001 and to 1,000,001 places. Because someone above suggested there was a pattern there… I can't say there is strong pattern… other than they are pretty evenly distributed.  Or seem that way, see my earlier posts.  Then I took the table at http://physics.nist.gov/cuu/Constants/Table/allascii.txt
and looked at frequency of numbers in that table.  By moving over, one digit at a time, and then plotting frequency of 1-9. (decimal places were replaced by number to right of decimal)  I can't say I saw any log pattern there… it was also remarkably evenly distributed.  This supports your Anthropic Trump… But what I am very puzzled by is how after almost 163 days of this URL being inactive, you post one message in the last 24 hours, and then Google Fast Flip "Science and Technology" identifies this URL again as 'newsworthy' enough to include it in the 'fast flip'.  That is truly amazing!
JB

Reply

heldervelez

5 Comments

  • 412 Days Ago
  • 12/28/2010

not a mystery

The first comment is on spot.
To the more demanding ones I suggest this document:
Explaining Benford's Law
www.dspguide.com/CH34.pdf

where the so called 'mysteries' are explained in detail.

Reply

jbexpm

4 Comments

  • 395 Days Ago
  • 01/14/2011

Re: not a mystery

When I post a reply to a comment do you get an email to indicate a reply to your comment?

Reply

Bio

The Physics arXiv Blog produces daily coverage of the best new ideas from an online forum called the Physics arXiv on which scientists post early versions of their latest ideas. Contact me at KentuckyFC @ arxivblog.com

Follow The Physics arXiv Blog on Twitter

Subscribe to the arXiv blog RSS Feed

Advertisement
Advertisement

Facebook

Advertisement