1 Introduction
Inequality is an important, ubiquitous feature of human society. What is the origin of economic inequality? Is it inevitable or avoidable? Are there common mathematical patterns of inequality? Is quantitative description of such patterns possible? These questions have puzzled scientists for centuries. Lately, in the last 20 years or so, the subject has moved to the center of public discourse.
Statistical physics and kinetic theory deal with big statistical ensembles and, as such, are branches of the applied theory of probabilities. The econophysics movement, emerging about 25 years ago
[1, 2], utilizes mathematical methods developed in these disciplines for probabilistic description of social ensembles. Several papers in the late 1990s – early 2000s applied these ideas to economic inequality [3, 4, 5, 6]. Since then, the volume of such literature has grown enormously [7, 8, 9, 10]. Importantly, the econophysics approach has attracted attention of economists and social scientists [11, 12] and is now debated within these disciplines [13], as well as in popular media [14, 15]. Physicsinspired approaches to economic inequality are briefly reviewed in Sec. 2 of this paper, particularly for the benefit of nonphysicists.Then, in Sec. 3, these ideas are applied to a quantitative analysis of the income distribution data in the United States for 1983–2018. We use the publicly available data from Publication 1304 by the Statistics of Income research division of the Internal Revenue Service (IRS), the United States tax agency [16]. The earliest year covered by Publication 1304 is 1983, and the latest year currently available is 2018, while the earlier data are available in Publication 79. Our quantitative study of such a wide historical range reveals salient trends of inequality evolution in the United States. The paper ends with Conclusions in Sec. 4.
2 Physicsinspired approaches to economic inequality
This Section briefly outlines mathematical approaches to economic inequality introduced in the early econophysics papers [3, 4, 5, 6] and subsequently followed by numerous further extensions. It is not intended as a comprehensive literature survey, but rather as a quick summary of the ideas relevant to our quantitative analysis of income distribution data presented in Sec. 3.
2.1 The Boltzmann kinetic equation
One approach is based on the Boltzmann kinetic equation. First, let us consider an ideal gas, where molecules have kinetic energies and experiencing pairwise collisions. The total energy of two molecules remains constant before and after collision
(1) 
while the energy is transferred from the first to the second molecule.
By analogy, now let us consider a monetary transaction between two economic agents, whose money balances are and [4]. Upon transaction, the first agent transfers some amount of money to the second agent in payment for goods or services. Then the money balances of the agents become and , so the sum is conserved
(2) 
The conservation of money expressed by Eq. (2) is nothing but the law of accounting [17]. From the accounting perspective, all monetary transactions are transfers of digits from one account to another, so the decrease in one account is equal to the increase in another. The flow of monetary digits, i.e., the abstract bits of information, between different accounts constitutes the informational layer of the economy [17]. It is coupled to another, physical layer of the economy, which represents counterflow of physical goods and services in transactions between agents. In contrast to the monetary digital layer, the items in the physical layer are not conserved, because they can be produced, consumed, and destroyed. However, production of physical items, by itself, does not change the number of digital monetary units in the system, because they belong to different layers and are governed by different rules.
For a statistical ensemble of many interacting economic agents, we introduce the probability distribution
of money among the agents. Its evolution as a function of time , as a result of monetary transactions, can be described by the Boltzmann kinetic equation [18](3)  
Here is the probability of transferring money from an agent with money to an agent with money per unit time. This probability, multiplied by the occupation numbers and , gives the rate of transitions from the state to the state . The first term in Eq. (3) gives the depopulation rate of the state , whereas the second term represents the reversed process, where the occupation number increases.
In order to make further progress, different papers make various assumptions about the structure of the transition probabilities . For example, Ref. [5] introduced a model of saving propensity, where the agents save some fraction of their money balances before engaging in monetary transactions described by Eq. (2). Whether such specific model assumptions are realistic remains an open question.
Many papers, particularly in this volume, apply Eq. (3) not to money , but to assets [3] or wealth . Some papers seem to use and synonymously, in which case there is no difference. But more generally, wealth is a sum of the money balance and the monetary value of tangible property, e.g., stocks , where is the stock price. Then, wealth is generally not conserved, because it may increase or decrease due to a change in price , while an agent is not conducting any transactions at all. For example, during the early COVID crisis in March 2020, many agents stayed passive and performed no transactions, but trillions of dollars of their wealth first disappeared and then reappeared due to changes of stock prices. In contrast, money balances were largely unaffected by such wild swings.
2.2 The equilibrium BoltzmannGibbs distribution
Interestingly, a stationary solution of Eq. (3) can be found without detailed knowledge of the transition probabilities , if they satisfy the timereversal symmetry
(4) 
The condition (4) means that the probabilities of transactions in direct and reversed directions are equal. The fundamental dynamical equations in physics do satisfy the timereversal symmetry. In contrast, there is no such general requirement in economics, e.g., the model [5] with saving propensity does not have the timereversal symmetry. Nevertheless, such symmetry may be approximately applicable in some cases.
In Eq. (3), the probability distribution is stationary, , when the rates of direct and reversed transitions are equal, so that the two terms in the curly brackets cancel each other. This is called the principle of detailed balance. If the timereversal condition (4) is satisfied, then the transition probabilities cancel out, and the detailed balance condition becomes
(5) 
In the notation of Eq. (2), Eq. (5) has the form and can summarized as follows. The probability is multiplicative, whereas the energy or the money appearing in the argument of is additive, due to conservation laws (1) or (2). A general solution of Eq. (5) is given by the exponential BoltzmannGibbs distribution [4]
(6) 
Here the coefficient is obtained from the normalization condition , whereas the money temperature is equal to the average money balance of an agent.
We assume that the boundary condition is imposed, so that debt, which can be interpreted as a negative money balance, is not permitted. Thus, represents the lowest possible balance of an agent, analogous to the groundstate energy in physics. Permitting debt, i.e., negative , makes the problem more complicated and potentially unstable [17], unless a limit on debt is imposed [4, 19]
The BoltzmannGibbs distribution of energy , analogous to Eq. (6), plays a fundamental role in the equilibrium statistical physics. In textbooks on this subject [20], it is usually derived using other methods, without reference to the dynamical kinetic equation (3). In particular, Eq. (6) can be derived geometrically from the ergodicity hypothesis [21, 22], which postulates that all microscopic configurations of the system consistent with macroscopic constraints are equally probable.
Alternatively, the BoltzmannGibbs distribution (6) can be obtained by maximizing the entropy of the system with a constraint on the total money in the system using the method of Lagrange multipliers. The entropy is the logarithm of the multiplicity , which is the number of microscopic realizations of a given distribution obtained by combinatorial permutations of the agents. The principle of entropy maximization follows from the Htheorem proved by Boltzmann [18]. The theorem shows that, for the probability distribution obeying the Boltzmann kinetic equation (3), the entropy monotonously increases in time, until it reaches its maximal value at the equilibrium BoltzmannGibbs distribution (6).
2.3 The FokkerPlanck equation
When the change is small for an agent with money balance , the integrodifferential Boltzmann kinetic equation (3) can be reduced the partialdifferential FokkerPlanck equation [18], also known as the diffusion equation or the Kolmogorov forward equation,
(7) 
The coefficients and
, known as the drift and diffusion, are the first and second moments of the random money balance changes
per time increment .Assuming the boundary condition , a stable stationary solution of Eq. (7) exists only for the drift in negative direction, , so that the coefficient is positive. The stationary solution of Eq. (7) is then obtained by setting the probability flux, which is the expression in the square brackets, to zero:
(8) 
where the coefficient follows from the normalization condition .
In the simplest case where the coefficients and are constants independent of , the stationary distribution (8) is exponential , thus reproducing the BoltzmannGibbs formula (6). Here the money temperature is expressed in terms of and similarly to the Einstein relation between temperature, diffusion, and mobility [18]. A wellknown example in physics is the barometric distribution of gas density versus the height in the presence of gravity, where is the mass of a molecule, is the gravitational acceleration, and is the gravitational energy of a molecule. The competition between downward drift due to gravity and spreading around due to diffusion produces the stationary exponential distribution . It represents a statistical equilibrium, in contrast to a mechanical one where two forces acting on an object balance each other (like supply and demand in economics).
2.4 Applications to income distribution
The Boltzmann kinetic equation (3) is a twobody equation, in the sense that it involves the probability distributions and of two interacting agents, so it is a nonlinear equation. In comparison, the FokkerPlanck equation (7) is a onebody equation, because it only involves for one agent, so this equation is linear and sometimes called the master equation. Coupling between agents is implicitly represented in Eq. (7) by the first and second moments of the fluctuations of the variable for a given agent, due to its interaction with the “environment” consisting of the other agents. Unlike the twobody approach, the onebody description does not invoke a conservation law explicitly. Thus, Eq. (7) can be generally applied to all kinds of stochastic variables, which is widely done in the literature. For example, the famous BlackScholes equation for random fluctuations of stock prices is based on a version of Eq. (7). A particular version of Eq. (7) was proposed in Ref. [6] for the distribution of wealth .
In the rest of the paper, we switch to the probability distribution of income , instead of the money balance . The income of an agent is the influx of money per unit time, typically per year, so it is analogous to the power in physics, if money is analogous to the energy. Following this definition, the income is nonnegative, so it satisfies the boundary condition . The main reason for this change of focus is that empirical data for income distribution are available from government agencies, whereas empirical data for are difficult to obtain. Income inequality is widely discussed in the literature and is an important subject by itself.
Let us treat the income of an agent as a stochastic variable experiencing random fluctuations over a time increment . Then the timedependent probability distribution of income obeys the FokkerPlanck equation (7) with the variable is replaced by the variable . We focus on the stationary distribution (8), where, again, is replaced by . To make further process, we need to make some assumptions about the drift and diffusion coefficients and . Empirical analysis of income distribution shows that it has a welldefined twoclass structure [23, 24]. The lower class is described by the exponential BoltzmannGibbs function, which was uncovered in Ref. [25] and then confirmed for the middle part of income distributions in many countries [26]. In contrast, the upper class is described by the Pareto power law [27]. The twoclass structure of income distribution can be rationalized on the basis of a kinetic approach as follows.
For the lower class, where income comes from wages and salaries, it is reasonable to assume that income changes do not depend on income itself, i.e., is independent of . Such a stochastic process is called additive [24]. Then the coefficients in Eq. (7) are some constants and independent of , and the stationary distribution of income (8) is exponential, analogous to the BoltzmannGibbs distribution (6)
(9) 
Here the parameter is the income temperature, which could be denoted as , but we drop the subscript to shorten notation. It is equal to the mean income of the lower class: .
In contrast, the upperclass income comes from bonuses, investments, and capital gains, which are calculated as percentages. Therefore, for the upper class, it is reasonable to expect that income changes are proportional to income itself, i.e., . This is known as the proportionality principle of Gibrat [28], and such a stochastic process is called multiplicative [24]. Then and , and Eq. (8) gives a powerlaw distribution
(10) 
The multiplicative hypothesis for the upperclass income was quantitatively verified in Ref. [29] for Japan, where tax identification data are officially published for the top taxpayers.
In practice, the additive and multiplicative processes may coexist. For example, an employee may receive a costofliving raise calculated in percentages (the multiplicative process) and a merit raise calculated in dollars (the additive process). Assuming that these processes are uncorrelated, we find that and , where . Substituting these expressions into Eq. (8), we find [7, 30]
(11) 
The distribution (11
) interpolates between the exponential law for low
and the power law for high , because either the additive or the multiplicative process dominates in the corresponding limit. A crossover between the two regimes takes place at , where the additive and multiplicative contributions to are equal. The distribution (11) has three parameters: the temperature , the Pareto exponent , and the crossover income . It is a minimal model that captures the salient features of the twoclass income distribution [7, 30]. The label distinguishes the exponent in the interpolated formula (11) from the exponent in the simple power law (10) for the purpose of data fitting, as discussed in Sec. 33.3. A formula similar to (11) was also derived by Fiaschi and Marsili [31] for a microeconomic model effectively described by Eq. (7). But Eq. (11) was published for the first time in 1895 in this journal by the statistician Karl Pearson [32] and is known as the Pearson Type IV distribution.3 Data analysis of income distribution in the USA in 1983–2018
3.1 The twoclass fit of the cumulative distribution function
We use the variable for annual income, which is called the adjusted gross income in the IRS Publication 1304 [16]. Income distribution data are reported in this publication at fixed income levels. At the high end, these levels are $100k, $200k, $500k, and $1M for the years earlier than 2000. Here k denotes thousand dollars, and M million dollars. Starting from 2000, the data for the income levels of $1.5M, $2M, $5M, and $10M are also reported. At the low end below $100k, the data are reported at many income levels.
Given these fixed income bins set by IRS, we introduce the (complementary) cumulative distribution function
(CDF), obtained by integration of the probability density function
(PDF). The cumulative function is the fraction of tax returns reporting adjusted gross income above in the IRS data. In our paper, we treat the fraction of tax returns as a proxy for the fraction of population with income above . This correspondence is only approximate for several reasons. First, some tax returns are filed jointly by married couples, i.e., one tax return per two people. The fraction of joint tax returns has dropped from about 1/2 in 1983 to about 1/3 in 2018. Second, people with income below a certain threshold are not required to file tax returns, but many of them do because of refunds from federal tax withholding and tax credit incentives. Clearly, the correspondence between the fractions of population and tax returns is not perfect, but it is the best we can do, given the publicly available IRS data.The cumulative distribution is given by the exponential and powerlaw functions for the corresponding probability densities (9) and (10):
(12) 
We perform a piecewise fit of the empirical data to at the low end and to at the high end of income distribution. The IRS publication reports many data points at the low end, which makes it easier to determine the income temperature from the fits. In contrast, only few data points are available at the high end, so the Pareto exponent is obtained with lower accuracy.
Having determined the temperature , we visualize the fits in Fig. 1 using the rescaled income as the dimensionless coordinate on the horizontal axis. Then the CDF curves for different years collapse on a single curve at the low end of the distribution. The inset in Fig. 1 shows the data in the loglinear scale, with the rescaled income in linear scale on the horizontal axis and CDF in logarithmic scale on the vertical axis. The collapse of the data points on a straight line in the loglinear scale indicates that is, indeed, well described by the exponential function. For clarity, the data for 1983–1999 and 2000–2018 are separated in the inset. The data points for the early 1980s deviate slightly below the exponential line, indicating lower inequality in those years, but this deviation subsequently goes away. The upper boundary of the exponential fit gradually shrinks from in the late 1980s to by 2000, with data points at the higher incomes deviating upwards for 2000–2018 in the inset.
The main panel in Fig. 1 shows the same data in loglog scale, where both axes for and CDF are logarithmic. At the lower end, the data points collapse on the black curve representing the exponential distribution. At the high end, the data points fall on straight lines, representing the power law in loglog coordinates. The slopes of these lines give the Pareto exponent . We observe a noticeable decrease in the Pareto slope from 1983 to 1999, which signals fattening of the upper tail, but then a narrower variation of the slope in 2000–2018. While the main panel in Fig. 1 presents clear databased evidence for the twoclass society, it also demonstrates the absence of the “middle class”. There is no objective, commonly agreed upon definition of the “middle class” concept in the economic literature, where different authors make up their own subjective definitions of the “middle class”.
As an alternative to the piecewise fit, in Fig. 2 we show fits of the CDF data for the last ten years 2009–2018 to the continuous interpolated function (11) in loglog scale. We first determine by fitting the lowend data as described above. Then we determine the Pareto exponent and the crossover income level by fitting to Eq. (11). Because of the different fitting procedure, the value of the Pareto exponent obtained from the interpolated fit is slightly different from the value obtained from the piecewise fit.
The crossover point between the exponential and powerlaw distributions is indicated in Fig. 2 by the red star. An alternative crossover point is obtained as an intersection (indicated by the small black circle) of the piecewise exponential and powerlaw fits shown by black curves for 2018. The values of and are generally different, but rather close. We perform the piecewise fit and determine for all available years. In contrast, the interpolated fit is performed only for the last ten years 2009–2018, where it works well. For the earlier years, it does not work well, because the crossover between the exponential and powerlaw distributions is less smooth and more cuspy, as seen in Fig. 1. Moreover, the lower part deviates downward from the exponential fit in the early 1980s.
G  G  

$41k  $57k  $78k  $167k  $185k  1.42  1.25  4%  34%  58%  22%  60%  82% 
The twoclass decomposition allows us to characterization the empirical income distribution by a small set of fitting parameters. Their values are shown in Table 1 for the latest year 2018 and discussed in subsequent sections, where we visualize the historical evolution of these parameters.
3.2 Divergence of the mean and the median incomes
The dotted curves in the top panel of Fig. 3 shows the historical evolution of the income temperature , as well as the mean and median incomes. All of them increase in time in nominal dollars, but some of that increase is due to inflation. We define the inflation parameter as the ratio CPI(year)/CPI(1983) of the Consumer Price Index (CPI) for a given year to the CPI in 1983 [33]. The solid curves in the top panel of Fig. 3 show the parameters and taken in 1983 and multiplied by the inflation parameter for the subsequent years. The solid curves are close to the dotted curves for and , so the increase in nominal dollars of the income temperature and the median income is primarily due to inflation.
The bottom panel of Fig. 3 shows the parameters , , and divided by the inflation parameter. Again, we observe that the parameters and characterizing the lower class remain approximately constant over 36 years, when adjusted for inflation. For the exponential distribution in Eq. (12), these two parameters are simply proportional: .
In contrast, the overall mean income increases sharply relative to inflation. The divergence between the income temperature , which is the mean income of the lower class, and overall mean income , which includes the upper class, points to a significant increase of inequality in the last 36 years. As the bottom panel of Fig. 3 shows, these two mean incomes were approximately equal in 1983, but have diverged since then. Thus, the growth of inequality is primarily driven the increase of the upperclass income relative to the lower class, while the lowerclass income remains approximately constant when adjusted for inflation.
3.3 The Pareto exponent and the boundary between two classes
The top two curves in Fig. 4 show the historical evolution of the Pareto exponents , obtained by the piecewise fits for all years, and , obtained by the interpolated fits for the last 10 years using Eq. (11). The values of and are slightly different because they are obtained by different fitting procedures, but they exhibit similar behavior. We observe a significant decrease of the exponent from 1983 to 2000, which indicates fattening of the tail. But after 2000, the exponent exhibits a “bumpy plateau”, where it decreases during financial bubbles and increases during crashes, but stays approximately constant overall. These observations are qualitatively consistent with the main panel in Fig. 1, as well as Fig. 3, and indicate that the sharp expansion of the upper class happened in the 1980s and 1990s, but have slowed down or saturated after 2000.
The two curves at the bottom of Fig. 4 show the historical evolution of the crossover incomes and separating the lower and upper classes, as illustrated in Fig. 2. The former is obtained from the piecewise fits for all years, while the latter from the interpolated fits for the last 10 years, and both are normalized to the income temperature . The ratio has decreased from about 5 in 1983 to slightly above 3 in 2018. A particularly sharp drop is visible around 1986, coinciding with the Tax Reform Act of the Reagan administration. The decreasing value of the ratio indicates that the exponential part of the distribution is shrinking, whereas the powerlaw part is expanding, which is consistent with the inset in Fig. 1.
3.4 The income, population, and tax fractions of the upper class
Given the boundary between the two classes, we can also characterize the upper class by the fractions of its population , income , and tax paid relative to the total population:
(13) 
whereas is defined similarly to but for the tax paid rather than income. The historical evolution of these fractions is shown in the top panel of Fig. 5. All of these fractions increase in time, indicating expansion of the upper class. In particular, the lower curve in the top panel shows that the upperclass population share has increased from less than 1% in 1983 (when almost the whole income distribution was close to exponential) to slightly greater than 4% in 2018. The increase in the upperclass population fraction is closely related to the decrease in the crossover income ratio separating the two classes, as shown by the bottom curves in Fig. 4.
The income fraction of the upper class exhibits the “bumpy plateau” pattern in the top panel of Fig. 5 for the last 20 years, similarly to the exponent in Fig. 4 and the mean income in Fig. 3. We observe sharp peaks of the upperclass income share around 2000 and 2017 coinciding with the stock market and real estate bubbles, followed by crashes of these bubbles, and then further increase by 2012, under the quantitive easing policy of the Federal Reserve Bank. Clearly, the upperclass income share is closely related to financial bubbles.
The income fraction shown in Fig. 5 is deduced directly from the IRS data, given the income threshold . As a consistency check, we compare it with the theoretically expected fraction for the powerlaw distribution (10). The expected mean income of the upper class and the income fraction relative to the total population are
(14) 
The upperclass income nominally diverges, , when . When extrapolated from the decreasing trend in 1980s and 1990s in Fig. 4, the exponent could have reached 1 by 2018, but saturated after 2000 at the “bumpy plateau” of bubbles and crashed, thus avoiding the singularity.
The expected income fraction calculated from Eq. (14) with the parameters from the piecewise fits is shown in the top panel of Fig. 5 together with . We see that and are very close for most years, indicating that the power law (10) accounts for all of the upperclass income. For the recent years, we notice that , suggesting that the actual upperclass income may be slightly higher relative to the powerlaw extrapolation. The difference between and could indicate the additional income of oligarchy on top of the power law, as proposed for the wealth distribution in Ref. [34]. However, the difference in Fig. 5 is too small to be taken as evidence for oligarchy in income distribution, given the uncertainty of the fitting parameters.
The ratios of the fractions are shown in the bottom panel of Fig. 5. The middle curve shows the ratio of the income and population fractions. This ratio stays approximately constant and even slightly decreases over time. Comparison with Eq. (14) for suggests that the decrease in is compensated by the increase in , as shown in Fig. 4.
Another indicator is the share of taxes paid by the upper class, relative to its income share . The lower curve in the bottom panel of Fig. 5 shows a noticeable decrease in after the tax reform of 1986, but then it stays approximately constant . Relatedly, the top panel shows that, after 2000, the income share oscillates roughly between 20% and 30%, whereas the tax share between 40% and 60%. Notably, the upper class paid close to 60% of taxes in 2018. The top curve in the bottom panel shows the ratio of tax to population fractions for the upper class. It exhibits a sharp drop around 1986 and then a gradual decrease in the last 10 years.
3.5 Lorenz curves and Gini coefficient
Income distribution is commonly characterized by the Lorenz curve in the economic literature [35]. The Lorenz curve is defined parametrically in terms of the two coordinates and
(15) 
Here is the fraction of the population with incomes below , and is the total income of this population, as a fraction of the total income in the system. When the parameter changes from 0 to , the variables and change from 0 to 1, thus producing the Lorenz graph in the plane. Its advantage is that all available data are represented within a finite area in the plane, whereas the upper tail at is inevitably truncated in Figs. 1 and 2.
It was derived in Ref. [25] that the Lorenz curve for the exponential distribution (9) is
(16) 
When the upper class is present, the Lorenz curve for the lower class is obtained [24] by multiplying Eq. (16) by the factor :
(17) 
The factor is the fraction of total income contained in the extrapolated exponential distribution, whereas is the additional fraction of income in the upper class. (The subscript alludes to the Lorenz curve.)
The red squares and blue circles in Fig. 6 show the IRS data points for the Lorenz curves of income distribution in 1983 and 2018. The straight diagonal line represents the reference case of perfect equality, so inequality is higher when the data points are further away from it. We observe that the Lorenz curve for 1983 is very close to the solid black curve representing Eq. (16) for the exponential distribution. Moreover, the data points in the middle are slightly above the solid black curve, indicating lower inequality than in the exponential distribution. These conclusions are consistent with the CDF graphs in Fig. 1 and the observation that for 1983 in Fig. 3.
In contrast, the Lorenz curve for 2018 is shifted down, signaling an increase in inequality. Most of its data points, except at the high end, are well fitted by the rescaled exponential formula (17) shown by the dashed black curve. This indicates that income distribution within the lower class remains exponential, whereas the increase in inequality comes from the excess share of the upperclass income, determined by the intersection of the fit (17) with the vertical axis.
Also shown in Fig. 6 are the population and income fractions of the upper class. Both parameters and represent an income share of the upper class, but they are defined differently. In Fig. 6, the parameter is the income share of the top 4% of population, whose income distribution follows a power law. In contrast, is the extra income of the upper class relative the exponential distribution extrapolated to . Thus, the parameter measures a deviation of the actual income distribution from the purely exponential one.
To characterize inequality by a single number, the economic literature [35] uses the Gini coefficient , defined as twice the area between the diagonal line and the Lorenz curve. It was shown that for the exponential distribution [25], and
(18) 
when taking into account the fraction of the upperclass income on top of the exponential distribution [24]. The values of deduced from the IRS data are shown by the open circles in the inset of Fig. 6, where the crosses show the values calculated by Eq. (18) starting from 1988. The agreement between the empirical values of and the formula (18) in Fig. 6 demonstrates that the increase in income inequality starting from the late 1980s comes completely from the growth of the upper income share relative to the lower class, whereas income inequality within the lower class itself remains constant and exponentially distributed.
Besides the overall increase since 1983, the inset in Fig. 6 exhibits local maxima corresponding to bubbles in financial markets. The Gini coefficient peaks around 1988 during the bubble in Savings and Loan, around 2000 in the stock market, around 2007 in subprime mortgages, and past 2012 under quantitative easing. Thus we confirm that the overall income inequality, as measured by the Gini coefficient, peaks during bubbles in financial markets. This behavior corresponds to the “bumpy plateau” already discussed for Figs. 3, 4, and 5.
The IRS publication [16] also provides information on the distribution of taxes paid by the population. In Fig. 7 we show the Lorenz curves for tax revenue in 1983 and 2018, along with the Gini coefficient for taxes in the inset. Because of progressive taxation, where the recipients of higher income are subject to higher tax rates, the inequality of tax distribution in Fig. 7 is higher than in Fig. 6 for income distribution, as also exemplified by the higher tax Gini coefficient. We observe that the tax revenue became highly concentrated in the upper class by 2018, because of the relative decrease in the lowerclass income due to the overall increase in income inequality. As shown in Fig. 7, the upper class, consisting of only about of population, contributed the fraction , greater than a half, of the total tax revenue, as also shown by the top curve in Fig. 5.
To amplify these observations, we show the income and tax shares of the bottom 50% and the top 1% of population in Fig. 8, as derived from the Lorenz curves for income and taxes. From 1983 to 2018, the income share of the lower half of the population has decreased from about 18% to 12% and the tax share from about 7% to 3%. Thus, in 2018, the total tax contribution of the lower half of the population is almost negligible, because their total income share is so low. In contrast, the income share of the top 1% of population has doubled from about 10% to 20%, and their tax share has also doubled from about 20% to 40%. Thus, the majority of tax revenue now comes from the top few percent of population, where most of income is concentrated.
A proposal for increasing taxes on the top 1% of population and incomes above approximately $500k is currently debated by the Democratic Party of the United States [36]. The quantitative characterization of the twoclass structure of income distribution presented in our paper could provide useful input for the formulation of a tax policy. In particular, it would be reasonable to align tax brackets with the boundary between the two classes.
4 Conclusions
In this paper, income distribution data for the USA in 1983–2018 is analyzed in terms of a twoclass decomposition, where the lower class is described by an exponential function, and the upper class by a power law, as inspired by statistical physics. The quantitative characterization of the two classes presented here may be useful for the formulation of a tax policy. We observe a sharp increase of income inequality from the early 1980s to the late 1990s, but slower increase or saturation after 2000. The inequality pattern after 2000 exhibits a “bumpy plateau” pattern, with ups and downs related to financial bubbles and crashes, but an approximately constant overall level.
The twoclass analysis shows that the growth of inequality originates primarily from the increasing share of income ( or ) in the upper class, relative to the lower class. At the same time, relative inequality within the lower class, encompassing more than 90% of population, remains constant and exponentially distributed. Moreover, the mean income of the lower class and the median income remain approximately constant over 36 years when adjusted for inflation, whereas the overall mean income increases relative to inflation. The growing divergence between the mean and median incomes is another manifestation of increasing inequality.
Not only does the upperclass income share increase, but the fraction of population belonging to the upper class increases too. In a relative sense, the upperclass population expands (while still remaining a small fraction of only about 4% of the total population), while the lowerclass population shrinks (while still remaining the overwhelming majority). This happens because the ratio of the crossover income separating the two classes to the mean income of the lower class has significantly decreased from 1983 to 2018. Thus, the relative income level threshold for getting into the upper class has decreased.
These empirical observations prompt a descriptive interpretation. We speculate that the significant increase in the upperclass population fraction , as well as their income fraction , is due to digitization of the economy in the last 40 years. There was a rapid proliferation of personal computers during the 1980s, followed by the spread of the Internet and the World Wide Web in the 1990s, and then by ubiquitous personal mobile devices and a shift to cloud computing at the present time. This transformation enabled the creation and relatively easy scalingup of digital platforms for highly nonlocal business operations. In the past, many business were local. For example, each town had a few local taxi companies, several local book stores, video rental stores, etc. Now they are largely displaced by a small number of national and global network platforms, such as Uber and Lyft for riding, Amazon for books initially and then for all kinds of goods, Netflix for DVD rentals and video streaming, etc. The founders and owners of such network platforms become superrich, because these platforms serve a huge number of customers, in contrast to the oldfashioned local businesses. Thus we speculate that the growth of the upper class is due to the ongoing transformation of business network topology from local clusters to highly connected superclusters and global hubs. In contrast to the onetoone interaction exemplified by Eq. (2), networked businesses are characterized by the onetomany interaction topology, resulting in the power law distribution for the upper class.
Insert details of how to access any supporting data here.
DL performed data analysis and produced the databased figures under the guidance of VY, then VY wrote the manuscript. All authors proofread and approved the manuscript.
The authors declare that they have no competing interests.
References
 [1] H. E. Stanley et al., “Anomalous fluctuations in the dynamics of complex systems: from DNA and physiology to econophysics”, Physica A 224 (1996) 302, https://doi.org/10.1016/03784371(95)004092
 [2] B. K. Chakrabarti (2005) “EconophysKolkata: a short story”, in Econophysics of Wealth Distributions, edited by A. Chatterjee, S. Yarlagadda, and B. K. Chakrabarti (Springer, Milan, 2005), pp 225–228
 [3] S. Ispolatov, P. L. Krapivsky, and S. Redner “Wealth distributions in asset exchange models”, Eur. Phys. J. B 2 (1998) 267, https://doi.org/10.1007/s100510050249
 [4] A. A. Drăgulescu and V. M. Yakovenko, “Statistical mechanics of money”, Eur. Phys. J. B 17 (2000) 723, https://doi.org/10.1007/s100510070114
 [5] A. Chakraborti and B. K. Chakrabarti, “Statistical mechanics of money: how saving propensity affects its distribution”, Eur. Phys. J. B 17 (2000) 167, https://doi.org/10.1007/s100510070173
 [6] Bouchaud JP, Mézard M, “Wealth condensation in a simple model of economy”, Physica A 282 (2000) 536, https://doi.org/10.1016/S03784371(00)002053
 [7] V. M. Yakovenko and J. B. Rosser Jr., “Colloquium: Statistical mechanics of money, wealth, and income,” Rev. Mod. Phys. 81 (2009) 1703, https://doi.org/10.1103/RevModPhys.81.1703
 [8] Cockshott, W. P., A. F. Cottrell, G. J. Michaelson, I. P. Wright, and V. M. Yakovenko, 2009, Classical Econophysics (Routledge, Oxford) ISBN 9780203877548
 [9] B. K. Chakrabarti, A. Chakraborti, S. R. Chakravarty, and A. Chatterjee, Econophysics of Income and Wealth Distributions (Cambridge University Press, 2013) ISBN 9781107013445
 [10] Marcelo Byrro Ribeiro, Income Distribution Dynamics of Economic Systems: An Econophysics Approach (Cambridge University Press, 2020) ISBN 9781107092532
 [11] J. B. Rosser Jr., “Entropy and econophysics”, Eur. Phys. J. Spec. Top. 225, 3091–3104 (2016), http://dx.doi.org/10.1140/epjst/e201660166y
 [12] Anwar Shaikh “Income Distribution, Econophysics and Piketty”, Review of Political Economy 29 (2017) 18–29, https://doi.org/10.1080/09538259.2016.1205295
 [13] Ellis Scharfenaker, “Statistical equilibrium methods in analytical political economy”, Journal of Economic Surveys, early view (2020), https://doi.org/10.1111/joes.12403.
 [14] Adrian Cho, “Physicists say it’s simple”, Science 344 (2014) 828, https://doi.org/10.1126/science.344.6186.828
 [15] Nathanial Gronewold, “Anthill Economics: Animal Ecosystems and the Human Economy”, Ch. 3 “Entropy and Inequality” (Prometheus Books, 2021) ISBN 9781633886537
 [16] Internal Revenue Service (IRS), Statistics of Income (SOI) research division, “Individual Income Tax Returns", Publication 1304, https://www.irs.gov/statistics/soitaxstatsindividualincometaxreturnspublication1304completereport
 [17] V. M. Yakovenko, “Monetary economics from econophysics perspective”, Eur. Phys. J. Spec. Top. 225, 3313–3335 (2016), http://dx.doi.org/10.1140/epjst/e2016602133.
 [18] E. M. Lifshitz and L. P. Pitaevskii, Physical Kinetics (ButterworthHeinemann, 1981)
 [19] Xi, N., Ding, N., and Wang, Y. “How required reserve ratio affects distribution and velocity of money,” Physica A 357 (2005) 543, https://doi.org/10.1016/j.physa.2005.04.014
 [20] L. D. Landau and E. M. Lifshitz, Statistical Physics (ButterworthHeinemann, 1980)
 [21] Ricardo LópezRuiz, Jaime Sañudo, and Xavier Calbet, “A geometrical derivation of the Boltzmann factor”, American Journal of Physics 76, 780 (2008), https://doi.org/10.1119/1.2907776
 [22] Biró TS, Ván P, Barnaföldi GG, Ürmössy K., “Statistical Power Law due to Reservoir Fluctuations and the Universal Thermostat Independence Principle”, Entropy 16 (2014) 6497, https://doi.org/10.3390/e16126497
 [23] A. A. Drăgulescu and V. M. Yakovenko, “Exponential and powerlaw probability distributions of wealth and income in the United Kingdom and the United States”, Physica A 299 (2001) 213, https://doi.org/10.1016/S03784371(01)002989
 [24] Silva A C and Yakovenko V M “Temporal evolution of the ‘thermal’ and ‘superthermal’ income classes in the USA during 19832001”, Europhys. Lett. 69, 304 (2005), https://doi.org/10.1209/epl/i2004103303
 [25] A. A. Drăgulescu and V. M. Yakovenko, “Evidence for the exponential distribution of income in the USA”, Eur. Phys. J. B 20 (2001) 585, https://doi.org/10.1007/PL00011112
 [26] Yong Tao, Xiangjun Wu, Tao Zhou, Weibo Yan, Yanyuxiang Huang, Han Yu, Benedict Mondal, and Victor M. Yakovenko, “Exponential structure of income inequality: evidence from 67 countries”, Journal of Economic Interaction and Coordination 14, 345 (2019), https://doi.org/10.1007/s1140301702116
 [27] Pareto, V., 1897, Cours d’Économie Politique (L’Université de Lausanne)
 [28] Gibrat R 1931 Les Inégalités Economiques (Sirely, Paris)
 [29] Fujiwara Y, Souma W, Aoyama H, Kaizoji T and Aoki M “Growth and fluctuations of personal income” Physica A 321 (2003) 598, https://doi.org/10.1016/S03784371(02)016631
 [30] A. Banerjee and V. M. Yakovenko, “Universal patterns of inequality”, New J. Phys. 12 (2010) 075032, https://doi.org/10.1088/13672630/12/7/075032
 [31] D. Fiaschi and M. Marsili, “Distribution of wealth and incomplete markets: Theory and empirical evidence”, Journal of Economic Behavior and Organization 81 (2012) 243, https://doi.org/10.1016/j.jebo.2011.10.015

[32]
Karl Pearson, “Contribution to the mathematical theory of evolution II: Skew variation in homogeneous material”, Phil. Trans. Royal Soc. London A
186, 343 (1895)  [33] We used the option “CPI for All Urban Consumers” at https://data.bls.gov/cgibin/surveymost?bls

[34]
Bruce M. Boghosian, Adrian DevittLee, Merek Johnson, Jie Li, Jeremy A. Marcq, Hongyan Wang, “Oligarchy as a phase transition: The effect of wealthattained advantage in a Fokker–Planck description of asset exchange”, Physica A
476 (2017) 15, https://doi.org/10.1016/j.physa.2017.01.071  [35] Kakwani N 1980 Income Inequality and Poverty (Oxford University Press, Oxford)
 [36] Jeff Stein, “With big tax push, Democrats aim to tackle enormous gains of top 1 percent”, Washington Post, 13 September 2021, https://www.washingtonpost.com/uspolicy/2021/09/13/democratstaxbidenbudget/