We use 2 bottom-up statistical methods to estimate the high wealth income tax gap – the 'extreme value theorem' regression model for individuals and the multi-stage 'logistic linear regressions' model for companies. We step through the following methods and results and combine them in Table 1.
Calculation – high wealth individuals
There are 4 steps in the extreme value theorem to estimate the high wealth individuals tax gap.
Step 1: Identify the extreme population
Amendments for high wealth individual taxpayers follow a power law distribution, with the majority of total tax amendments in value terms represented by a small number of amended income tax returns.
We rank the amendments in descending order and identify the point where the cumulative sum of positive amendments is equal to or less than the total negative amendments.
We remove all these small amendments, which have no impact on the net value of total amendments. The remaining amendments are referred to as the 'extreme values'.
We calculate the number of extreme values as a ratio of all amendments to be used for extrapolation purposes in step 2.
Step 2: Estimate the unreported tax amount
We transform the amendment data of the extreme population to estimate a linear relationship between the value and rank of the amendments using a regression approach.
To estimate the unreported tax amount, we then extrapolate the relationship to the number of taxpayers expected to contribute to the extreme values in the wider population.
Step 3: Apply a non-detection uplift factor
We need to account for imperfections in the process that could lead to the final gap estimate not reflecting the true tax gap. To account for non-detection, we apply an uplift factor to the unreported tax amount in step 2.
Step 4: Consolidate the gap estimates
We calculate the gross gap by adding the:
- unreported amounts from step 2
- non-detection uplift from step 3
- non-pursuable debt.
We calculate the net gap by subtracting the total amendment amount from the gross gap. Then we add the net gap to the expected collections to estimate the total theoretical liability.
Summary of the estimation process – high wealth individuals
Table 2 shows the:
- individuals population count at step 1
- dollar values at steps 2 to 4.6
- percentage figure for the gross and net gaps at steps 4.7 and 4.8.
Step | Description | 2016–17 | 2017–18 | 2018–19 | 2019–20* | 2020–21* | 2021–22* |
---|---|---|---|---|---|---|---|
1 | Total population (count) | 7,745 | 9,886 | 11,383 | 13,595 | 14,809 | 19,403 |
2 | Total expected amendments ($m) | 358 | 360 | 361 | 362 | 363 | 364 |
3 | Non-detection ($m) | 215 | 234 | 237 | 251 | 258 | 300 |
4.1 | Non-pursuable debt ($m) | 1 | 0 | 0 | 0 | 0 | 0 |
4.2 | Gross gap ($m) | 575 | 594 | 599 | 614 | 621 | 664 |
4.3 | Amendments ($m) | 57 | 255 | 267 | 193 | 193 | 193 |
4.4 | Net gap ($m) | 518 | 339 | 331 | 421 | 428 | 471 |
4.5 | Expected collections ($m) | 3,540 | 5,357 | 5,454 | 6,098 | 7,703 | 11,045 |
4.6 | Total theoretical liability ($m) | 4,059 | 5,696 | 5,786 | 6,518 | 8,131 | 11,516 |
4.7 | Gross gap (%) | 14.2% | 10.4% | 10.3% | 9.4% | 7.6% | 5.8% |
4.8 | Net gap (%) | 12.8% | 6.0% | 5.7% | 6.5% | 5.3% | 4.1% |
*Projected years
Calculation – high wealth companies
The following 5 step bottom-up regression is applied to estimate the High Wealth-linked companies tax gap.
Step 1: Establish a logistic regression trend
We analyse the income tax return data of companies that have been subject to amendment activities and adjust the data to account for selection bias. We identify the relevant characteristics of companies in general that would contribute to the prediction of whether a company has a tax gap.
Based on these characteristics, we assign each company a unique probability of having a tax gap. We then model each company to be compliant or non-compliant through a Monte Carlo simulation.
Step 2: Establish a linear regression trend
We analyse tax return data of known non-compliant companies to identify characteristics of companies that would contribute to the prediction of the tax gap size. We also apply weights to account for selection bias. Then we apply linear regression to each company to estimate the potential size of the gap.
The key difference between steps 1 and 2 is that step 1 calculates the likelihood of a company having a tax gap while step 2 calculates the size of each company's potential tax gap.
Step 3: Combine the results from the 2 regressions
We calculate the estimated unreported tax amount for each simulation by adding the step 2 non-compliance amount to the predicted non-compliance companies in step 1. We estimate total unreported tax (including amendments) by taking an average of the results from 20,000 simulations.
Step 4: Apply a non-detection uplift factor
We uplift the estimates preceding this step to account for non-compliance that isn't detected. This ensures that the final estimate is not understated.
Step 5: Consolidate the tax gap estimates
We calculate the gross gap by adding up the:
- unreported amounts from step 3
- non-detection uplift from step 4
- non-pursuable debt.
We calculate the net gap by subtracting the total amendment amount from the gross gap. We then add the net gap to the expected collections to estimate the total theoretical liability.
Summary of the estimation process – high wealth companies
Table 3 shows the:
- dollar values in millions at steps 1 to 5.6
- company population count at step 5.7
- percentage figures for the gross and net gaps at step 5.8 and 5.9.
Step | Description | 2016–17 | 2017–18 | 2018–19 | 2019–20* | 2020–21* | 2021–22* |
---|---|---|---|---|---|---|---|
1 | Total population (count) | 14,689 | 19,383 | 21,321 | 23,388 | 26,777 | 31,763 |
1–3 | Unreported tax including amendments ($m) | 216 | 326 | 328 | 345 | 434 | 485 |
4 | Non-detection ($m) | 138 | 210 | 212 | 227 | 282 | 318 |
5.1 | Non-pursuable debt ($m) | 5 | 18 | 5 | 5 | 5 | 5 |
5.2 | Gross gap ($m) | 359 | 555 | 545 | 577 | 722 | 808 |
5.3 | Amendments ($m) | 48 | 33 | 45 | 42 | 42 | 42 |
5.4 | Net gap ($m) | 310 | 523 | 500 | 535 | 680 | 766 |
5.5 | Expected collections ($m) | 3,618 | 5,203 | 5,136 | 5,287 | 6,663 | 7,864 |
5.6 | Total theoretical liability ($m) | 3,929 | 5,726 | 5,637 | 5,822 | 7,342 | 8,630 |
5.7 | Gross gap (%) | 9.1% | 9.7% | 9.7% | 9.9% | 9.8% | 9.4% |
5.8 | Net gap (%) | 7.9% | 9.1% | 8.9% | 9.2% | 9.3% | 8.9% |
*Projected years
Find out more about our overall research methodology, data sources and analysis for creating our tax gap estimates.
Limitations
The following caveats and limitations apply when interpreting this tax gap estimate:
- There is a considerable delay between an income year and the completion of our compliance activities for that year. This means gap estimates are subject to revisions for a considerable period. Amendment results for companies and individuals are projected for 2019–20 to 2021–22. They are expected to be subject to revisions overcoming years.
- Provisions are made for non-pursuable debt for all years, excluding 2017–18.
- There is no independent data source that can provide a credible or reliable macroeconomics-driven estimate (unlike indirect taxes).
- The true extent of non-detection is unknown and extremely challenging to measure. There is no international proxy we can apply to the individuals or companies in this population.
Updates and revisions to previous estimates
Each year we refresh our estimates in line with the annual report. Changes from previously published estimates occur for a variety of reasons, including:
- improvements in methodology
- revisions to data
- additional information becoming available.
Figure 2: Current and previous net high wealth income tax gap estimates, 2012–13 to 2021–22
This data is presented in Table 4 as a percentage.
Table 4: Current and previous net high wealth income tax gap estimates, 2012–13 to 2021–22
Year | 2012–13 | 2013–14 | 2014–15 | 2015–16 | 2016–17 | 2017–18 | 2018–19 | 2019–20 | 2020–21 | 2021–22 |
---|---|---|---|---|---|---|---|---|---|---|
2024 program | n/a | n/a | n/a | n/a | 10.4% | 7.5% | 7.3% | 7.7% | 7.2% | 6.1% |
2023 program | n/a | n/a | n/a | 9.0% | 10.1% | 7.0% | 8.2% | 7.5% | 7.1% | n/a |
2022 program | n/a | n/a | 6.4% | 7.4% | 7.8% | 6.8% | 7.0% | 6.7% | n/a | n/a |
2021 program | n/a | 6.6% | 6.5% | 6.6% | 6.8% | 7.1% | 6.9% | n/a | n/a | n/a |
2020 program | 6.5% | 6.9% | 8.2% | 6.9% | 7.1% | 7.4% | n/a | n/a | n/a | n/a |
2019 program | 6.6% | 7.7% | 7.1% | 7.3% | 7.7% | n/a | n/a | n/a | n/a | n/a |