Skip to main content

Tax Gap: IRS Should Take Steps to Ensure Continued Improvement in Estimates

GAO-24-106449 Published: May 06, 2024. Publicly Released: Jun 05, 2024.
Jump To:

Fast Facts

Taxpayers voluntarily paid 85% of taxes owed on time in 2014-2016. The other 15% of taxes owed—$496 billion—is called the tax gap. Most of this tax gap is a result of taxpayers underreporting the amount of taxes they owe.

To estimate part of the tax gap, the IRS audits a random sample of tax returns based on certain characteristics—such as whether the taxpayer is self-employed. The agency has recently piloted a new AI method to select taxpayers for audits. However, it hasn't documented key components and technical specifications of this AI.

We recommended that it do so to ensure that the IRS is being consistent and transparent with its AI use.

Estimated Average Annual Gross Tax Gap, Tax Years 2014-2016

Pie chart showing 80% of the tax gap is from underreporting of tax liabilities on timely filed tax returns.

Skip to Highlights

Highlights

What GAO Found

Taxpayers voluntarily and timely paid about 85 percent of the taxes they should have paid for tax years 2014-2016, according to the Internal Revenue Service's (IRS) latest tax gap estimate. IRS most recently projected the tax gap will grow to $688 billion for tax year 2021. However, when measured relative to the overall economy, the tax gap has remained relatively stable.

The Internal Revenue Service's Annual Average Tax Gap Estimate for Tax Years 2014-2016

The Internal Revenue Service's Annual Average Tax Gap Estimate for Tax Years 2014-2016

In developing the estimates, IRS applies a statistical technique to National Research Program (NRP) audit data to account for noncompliance that was not detected by examiners in audits. Applying this technique nearly doubles the individual underreporting tax gap estimate, increasing the estimate from $145 billion to $278 billion. IRS has not conducted analysis to understand the causes of this estimate of undetected noncompliance. By doing so, IRS would be better positioned to help improve the reliability of and confidence in the adjustment and potentially examiners' detection of noncompliance.

IRS is piloting a new process for sampling tax returns for NRP's audits. The new process uses artificial intelligence (AI) to improve the efficiency and selection of audit cases to help identify noncompliance. However, IRS has not completed its documentation of several elements of its AI sample selection models, such as key components and technical specifications. Completing documentation would help IRS retain organizational knowledge, ensure the models are implemented consistently, and make the process more transparent to future users.

IRS's Strategic Operating Plan (SOP) describes how IRS intends to spend the tens of billions of dollars it received under the Inflation Reduction Act. However, the SOP is not clearly linked to tax gap data. For example, the plan does not address sole proprietor noncompliance, which is one of the largest areas of tax noncompliance. Furthermore, IRS did not address a prior GAO recommendation to link NRP data to its compliance efforts. As IRS develops implementation plans for the SOP initiatives and projects, it has an opportunity to further integrate a significant source of evidence to refine its compliance strategies, thereby addressing the recommendation. Linking compliance strategies with data would help provide assurance that IRS is effectively allocating enforcement resources.

Why GAO Did This Study

The tax gap—the difference between the tax amounts voluntarily paid by individuals and businesses and the tax amounts that are owed—has been a persistent problem for decades.

The tax gap is an aggregate estimate of individual income, corporation income, employment, and estate taxes. IRS estimates are based on three types of noncompliance: (1) underreporting of tax liabilities on timely filed tax returns; (2) underpayment of taxes due from timely filed returns; and (3) nonfiling, when a taxpayer fails to file a required tax return altogether or on time.

GAO was asked to review IRS's most recent tax gap estimates for tax years 2014-2016. This report (1) describes IRS's estimates and changes from prior estimates; (2) assesses IRS's changes to the sample design of NRP; and (3) assesses how the estimates help IRS and other stakeholders gain insights into noncompliance. GAO reviewed IRS's tax gap data and reports and interviewed IRS officials.


Recommendations

GAO is making six recommendations to IRS, including that IRS conduct additional analyses to understand the root causes of undetected noncompliance and complete its documentation related to the pilot process for sampling returns.

IRS agreed with all six of GAO's recommendations and described steps it plans to take in response to each recommendation.

Recommendations for Executive Action

Agency Affected Recommendation Status
Internal Revenue Service The Commissioner of Internal Revenue should ensure officials in the Research, Applied Analytics and Statistics Division work with officials in the Small Business/Self-Employed and Large Business and International Divisions to develop and report on analyses that could provide stakeholders with more confidence in the methods used to estimate undetected noncompliance, and information on its root causes. (Recommendation 1)
Open
IRS agreed with this recommendation. In July 2025, IRS reported that it is taking steps to implement this recommendation, including working with an external group to develop a new methodology for estimating undetected compliance. IRS also reported that it is coordinating with its SB/SE and LB&I business units about the interim results of the methodology research. IRS also reported that the research was shared with the Tax Gap Estimation Technical Expert Panel. According to IRS, next steps include further developing the methodology and using it in the tax gap publication due to be issued in summer 2026. IRS completing its efforts will help provide a better understanding of the detection controlled estimation methodology, which could potentially improve the quality of audits.
Internal Revenue Service The Commissioner of Internal Revenue should complete documentation on key components of the NRP AI models, such as which model options were considered and chosen for implementation, and how IRS determined risk levels and time frames for risk level updates. (Recommendation 2)
Closed – Implemented
Between October 2024 and August 2025, in response to the recommendation, IRS took several steps. IRS documented what modeling options were considered and implemented for the NRP AI models. IRS also documented how it determined the risk levels used in the pilot process.IRS officials said timeframes are not the appropriate benchmark to use for updating risk levels. Instead, they said they plan to update the models when new data are available and show a need for updates. In a memo documenting the annual NRP sampling approach, IRS officials documented information on risk levels. The memo linked to documentation on how IRS determined risk levels. It also included a discussion of risk level updates and noted that IRS should continually evaluate risk levels and consider if an alternative approach is warranted. These additions addressed the intent of our recommendation. By completing this documentation, IRS increased transparency around its models and will have better assurance that the models will be implemented and updated consistently.
Internal Revenue Service The Commissioner of Internal Revenue should complete documentation for the NRP AI models on the technical specifications needed to run the model, such as how the data are to be split or divided. (Recommendation 3)
Closed – Implemented
In January 2025, in response to the recommendation, IRS took several steps. IRS added links in its technical documentation to data files that list the variables used to train and run the models. IRS also added documentation about how the data are split when running the models. Finally, IRS finalized its technical documentation, providing assurance to future readers that it is an accurate representation of the NRP AI process. By completing this documentation, IRS increased transparency around its models and will have better assurance that the models will be implemented consistently.
Internal Revenue Service The Commissioner of Internal Revenue should ensure IRS has complete documentation on how to update the NRP sample selection process in response to changes in the operating environment, including the performance of its AI models and risk level analysis. (Recommendation 4)
Closed – Implemented
Between October 2024 and August 2025, in response to the recommendation, IRS took several steps. IRS updated its original documentation of the redesigned NRP process to include factors to consider when retraining and updating the models, including risk levels. IRS officials also finalized a memo documenting the annual NRP sampling approach. The memo described updates IRS made in response to the changing operating environment and updates IRS considered but did not make based on the data. For example, IRS updated variables the models used in response to changes in tax law. The memo stated future updates to the sampling process should involve similar considerations. Finally, the memo stated that model performance should be assessed annually to see how the models may have been affected by changes to the operating environment. With this additional documentation, IRS increased transparency around its models and will have better assurance the model updates will consistently include relevant factors.
Internal Revenue Service The Commissioner of Internal Revenue should complete documentation on processes for assessing the results of AI model updates, including validating the models. (Recommendation 5)
Open – Partially Addressed
IRS agreed with this recommendation and has taken steps to partially implement it. A key component of assessing AI systems is documenting limitations. In August 2025, IRS officials finalized a memo documenting the annual NRP sampling approach, which includes a section on limitations. The section describes certain types of tax returns for which the model would not be suitable but notes that the modeling framework could be repurposed for those populations. IRS also updated its original documentation for the AI models to state officials were developing a process for assessing and validating them. In May 2025, IRS officials said the development of that process was on hold due to competing priorities. Completing documentation of this process would fully address this recommendation.
Internal Revenue Service The Commissioner of Internal Revenue should develop and document a plan for evaluating the redesigned sample selection process, including factors IRS will consider in determining whether or how to continue using it. (Recommendation 6)
Open – Partially Addressed
IRS agreed with this recommendation. In October 2024, IRS documented a plan to evaluate the redesigned NRP sample selection process. The plan said the evaluation will be holistic and will not be decided based on a single decision factor. It listed factors that would be considered, including the number of audits resulting in no change to taxes due and the magnitude of tax change for audits resulting in a change to taxes due. It also said the evaluation would include comparative metrics, such as the difference between the actual and predicted number of audits with no tax change for each sampling approach. The plan stated that IRS was in the process of developing another approach that would contribute to the overall evaluation. Completing documentation of that approach will fully address our recommendation.

Full Report

GAO Contacts

Media Inquiries

Sarah Kaczmarek
Managing Director
Office of Public Affairs

Public Inquiries

Topics

Artificial intelligenceCompliance oversightEarned income tax creditRegulatory noncomplianceSmall businessTax administrationTax gapTax returnsTaxpayersTaxes