Methodology

We have changed our methodology in 2023 to use both logistic and linear regressions in a bottom-up, multi-stage regression model to estimate the fuel tax credits gap.

Step 1: Estimate probability of entity being selected

We identify characteristics of FTC claimants to help us predict the probability of them being selected for compliance activity.

We then estimate the probability of selection using a logistic regression. This step determines the sample weights for correcting selection bias in the operational data.

Step 2: Estimate probability of entity incorrectly claiming

We analyse the business activity statement (BAS) data of fuel tax credits (FTC) claimants which had an interaction with the ATO, as well as those which did not.

We apply a logistic regression to estimate the probability of an FTC claimant being found to have overclaimed and another to estimate the probability of an FTC claimant being found to have underclaimed. To adjust for selection bias in the operational data, we apply the sample weights calculated in step 1 to the observations.

Step 3: Estimate the average incorrectly claimed amount

We analyse the BAS data of FTC claimants who had an interaction with the ATO, to identify characteristics that would contribute to predicting the size of overclaims and underclaims. We apply a Poisson Pseudo Maximum Likelihood (PPML) regression to estimate the overclaim amount for each entity predicted to have overclaimed and another to estimate the underclaim amount for each entity predicted to have underclaimed. Due to the non-normal distribution of amended BAS, arising from a large share returning a null result, a PPML regression better fits the data and incorporates the null results of claimants who are audited yet claimed correctly.

The results are again weighted to adjust for selection bias. The PPML regressions are then applied to each entity in the population to estimate the potential size of overclaims and underclaims.

The key difference between steps 2 and 3 is that step 2 estimates the likelihood of an entity overclaiming or underclaiming, while step 3 estimates the size of each entity's potential overclaim or underclaim amount.

Step 4: Apply regressions results to each entity to estimate overclaims

We combine the regression results from steps 2 and 3 to estimate overclaims. We estimate overclaims by taking the average of the results from 1,000 simulations. This amount includes amendment results.

Step 5: Apply regressions results to each entity to estimate underclaims

We combine the regression results from steps 2 and 3 to estimate underclaims. We estimate underclaims by taking the average of the results from 1,000 simulations. This amount includes amendment results.

Step 6: Estimate net incorrect claims

We subtract total underclaims from step 5 from total overclaims from step 4 to arrive at net incorrect claims.

Step 7: Estimate for non-detection

We multiply the amount from step 6 by the uplift factor to account for non-detection.

Step 8: Estimate for non-pursuable debt

We add non-pursuable debt to account for overclaims not repaid which is estimated for the current year based on long-term trends.

Step 9: Estimate the gross gap, net gap and theoretical fuel tax credit

We add steps 6, 7 and 8 to estimate the gross gap. We then subtract amendments from it to estimate the net gap.

Then we estimate the theoretical fuel tax credit by subtracting the gross gap from the voluntary credits. We derive the gap percentages by dividing the gap amounts by the theoretical credit.

Summary of the estimation process

Table 2 shows a summary of each step of the estimation process and the results for each year.

Table 2: Summary of estimation process for the fuel tax credits gap
Step	Description	2016–17	2017–18	2018–19	2019–20	2020–21	2021–22
1 to 6	Estimate net incorrect claims ($m)	91.3	99.9	86.1	85.0	127.1	211.0
7	Add estimate for non-detection ($m)	22.8	25.0	21.5	21.3	31.8	52.7
8	Add claims made incorrectly and not paid back ($m)	7.4	7.4	7.4	7.4	7.4	7.4
9.1	Gross gap ($m)	121.5	132.2	115.0	113.7	166.2	271.1
9.2	Amendments ($m)	11.1	35.1	10.3	5.8	-0.1	95.8
9.3	Net gap ($m)	110.3	97.2	104.7	107.9	166.3	175.3
9.4	Total credit ($m)	6,284	6,881	7,195	7,417	7,544	6,984
9.5	Theoretical credit ($m)	6,163	6,749	7,080	7,303	7,378	6,713
9.6	Gross gap (%)	2.0%	2.0%	1.6%	1.6%	2.3%	4.0%
9.7	Net gap (%)	1.8%	1.4%	1.5%	1.5%	2.3%	2.6%

Find out more about our overall research methodology, data sources and analysis for creating our tax gap estimates.

Limitations

The gap estimates are subject to the following limitations.

There is considerable delay after a financial year ends and the completion of our compliance activities relating to that year. This means gap estimates may be subject to revisions for several years.

The lower coverage levels of compliance activity have resulted in us adopting a pooled regression approach. We draw all data across 2016–17 to 2021–22 together to estimate the coefficients for all years. This implicitly assumes the relationships between variables do not change much over that period. While deriving standalone regression results for each income year would be ideal, it is not feasible due to limited data across the FTC population.

The extent of non-detection is unknown and extremely challenging to measure.

This estimate does not include the population that may be entitled to fuel tax credits but has not registered for fuel tax credits.

This population does not include a small number of taxpayers who claimed credits for domestic electricity generation.

Non-pursuable debt is provisional and based on an average of historical amounts.

Updates and revisions to previous estimates

Each year we refresh our estimates in line with our annual report. Changes from previously published estimates occur for a variety of reasons, including:

improvements in methodology
revisions to data
additional information becoming available.

For the 2023 Annual Report, with the most current data available we have been able to produce our gap estimate for 2020–21 and 2021–22.

Figure 2 shows the net gap from our current estimate, which uses the regression approach, compared to our previously published estimates.

Figure 2: Current and previous net fuel tax credit gap estimates, 2011–12 to 2021–22.

Figure 2 shows the net gap estimates from previously published years, as outlined in Table 3.

The data used in Figure 2 is presented in Table 3 below.

Table 3: Current and previous fuel tax credits net gap estimates, 2011–12 to 2021–22

Year published	2011–12	2012–13	2013–14	2014–15	2015–16	2016–17	2017–18	2018–19	2019–20	2020–21	2021–22
2023	n/a	n/a	n/a	n/a	n/a	1.8%	1.4%	1.5%	1.5%	2.3%	2.6%
2022	n/a	n/a	n/a	−0.3%	−0.6%	0.7%	1.9%	0.5%	0.2%	n/a	n/a
2021	n/a	n/a	n/a	–0.1%	−0.2%	−0.1%	0.0%	–0.1%	n/a	n/a	n/a
2020	n/a	n/a	n/a	−0.1%	−0.2%	−0.1%	−0.1%	−0.1%	n/a	n/a	n/a
2019	n/a	–0.2%	–0.1	0.0%	−0.1%	−0.1%	−0.1%	n/a	n/a	n/a	n/a
2018	0.7%	0.7%	0.8%	−0.4%	−0.2%	−0.3%	n/a	n/a	n/a	n/a	n/a
2017	0.7%	0.7%	0.7%	−0.5%	–0.3%	n/a	n/a	n/a	n/a	n/a	n/a
2016	0.8%	0.6%	0.6%	–0.4%	n/a	n/a	n/a	n/a	n/a	n/a	n/a
2015	1.5%	1.5%	1.5%	n/a	n/a	n/a	n/a	n/a	n/a	n/a	n/a