Data

Introduction

The National Treasury Secure Data Facility (NT-SDF), established through the collaboration between the United Nations University World Institute for Development Economics Research (UNU-WIDER), the South African Revenue Service (SARS) and National Treasury (NT), provides researchers with partially anonymised tax data for research. South Africa is one of only a few countries globally to grant access to this type of data for research purposes. This is a unique opportunity as access to national administrative data for researchers outside of senior officials is rarely granted. The administrative data plays a central part in the SA-TIED programme and aims to inform policy formulation.

Data access

Located on the 20th floor of the NT building in Pretoria, the NT-SDF has a number of terminals available to researchers. Researchers with successful applications in response to calls for proposals, listed under the opportunities tab, can secure access the data to conduct their research. 

Apart from responding to the request for proposals, researchers can send proposals (max five pages) to the National Treasury (ntsdf@treasury.gov.za) to request access to the data. Proposals should outline the research question, the relevant literature, the proposed methodology, exact data requirements, and the anticipated time needed in the data lab. In addition, the researcher should ensure that their proposal illustrates their familiarity with papers on the tax data relevant to their proposal. This will ensure they understand the data limitations and place their proposed contribution into the tax data literature.

Data description

The de-identified tax data available at the NT-SDF includes our core datasets such as the Corporate Income Tax (CIT), Employee Income Payroll certificates (IRP5), Personal Income Tax (PIT), Customs, and Value Added Tax (VAT) datasets. These datasets are linked to specific SARS forms that generate the data. Below is a comprehensive list of the datasets which are available.

  • Common Reporting Standards
  • Corporate Income Tax panel (IT14 and ITR14 forms)
  • Customs (SAD500 and CD1 forms)
  •   Employer Reconciliation Declaration (EMP 501 form)
  •  Employee Income Payroll certificates (IRP5 form)
  • Excise Duties
  •  Individual income tax return (ITR12 form)
  • Labour Brokers (IRP30A form)
  • Monthly Employer Declaration (EMP201 form)
  • SARS Treasury Matched Firm (STMF) Panel (merged CIT, Employee Income Payroll certificates, Customs and VAT)
  • SARS Treasury Matched Individual Panel (merged Employee Income Payroll certificates and Individual income tax return)
  •  Value-Added Tax (VAT201 form)
  • Meta and dummy data for selected datasets are available at: https://www.treasury.gov.za/research/NT-SDF/Available%20Datasets/Default.aspx.

Data Security

The data are partially anonymised by masking identifying information (i.e., tax reference number, PAYE reference number, etc.). The aim here is to prevent anyone granted access to the tax data at NT-SDF from directly or indirectly identifying companies or individuals described by the data. The NT-SDF also offers additional security by requiring a non-disclosure agreement (NDA) and the Oath of secrecy before accessing the data. Furthermore, NT-SDF ensures that researcher output exported from the data facility does not contain potentially identifying information. Please refer to the “Standard Operating Procedure at the NT-SDF” document under the Data-Resources tab for more information on what information can be exported from the data lab.

Research

If you are planning on responding to an SA-TIED call for research proposals, please check the papers at the bottom of this page for completed, current, and ongoing research to avoid the duplication of existing research. Final papers resulting from the analysis of these data will be made available on our research page alongside all other SA-TIED final papers.

Some independent research studies have also been undertaken by staff at the National Treasury. An overview of this research can be accessed here.

Please feel free to send an email to: ntsdf@treasury.gov.za with additional questions regarding the data.

Working paper
Chandré Jacobs, Amina Ebrahim, Murray Leibbrandt, Jukka Pirttilä, and Marlies Piek
We use individual-level tax administrative data to estimate personal income inequality among the tax-compliant population in South Africa over the period 2011–21. Our results indicate that inequality of this population rose slightly over the period...
SEPTEMBER 2024
Labour market and inequality
Working paper
Giacomo Brusco, Marlies Piek, and Tejaswi Velayudhan
Refunds are an essential feature of well-functioning VAT systems and take up a sizeable portion of government spending. In South Africa, refunds amount to 50 per cent of gross VAT collection, representing a substantial transfer...
SEPTEMBER 2024
Public revenue
Blog
Christopher Axelson, Jukka Pirttilä, Antonia Hohmann, Roxanne Raabe, Nadine Riedel
South Africa faces a dual challenge of limited fiscal space—the need to raise sufficient revenues to finance public services—and high levels of inequality. One possible solution is to raise tax rates on the rich. If...
August 2024
Public revenue
SA-TIED
Abstract Despite widespread and growing adoption of industrial policy across the world, sound empirical evidence is still limited and typically skewed towards high-income settings. I propose to fill this gap by examining the automotive industry...
Blog
Aimable Nsabimana and Abena Larbi-Odam
Sub-Saharan Africa has abundant natural resources and a substantial market, with an estimated population of 1.2 billion. The population is projected to grow by nearly 80% and reach almost 2 billion people by 2043. This...
February 2024
News
SA-TIED
To support evidence-based policymaking across southern and east Africa, UNU-WIDER, in collaboration with the National Treasury of South Africa, the Spatial Economic Activity Data South Africa (SEADsa), and the SA-TIED programme, successfully organized a capacity...
December 2023
Working paper
Ada Jansen, Winile Ngobeni, and Wynnona Steyn
South Africa has a progressive broad-based personal income tax system with relatively few tax expenditures. The two most important are the medical contribution plus additional tax credits for medical expenses, and the deductions allowed for...
November 2023
Public revenue
Working paper
Joshua Budlender and Ihsaan Bassier
Faced with more favourable demand conditions, many firms raise wages. However, we show that firms with labour market power, lower productivity, and binding wage floors will absorb these positive revenue productivity shocks as excess profits...
November 2023
Enterprise development
Working paper
Shakeba Foster
This paper investigates the role of firm characteristics in driving wage inequality and firm wage premia in the South African labour market. The Abowd, Kramarz, and Margolis (AKM) and Kline, Saggio, and Sølvsten (KSS) regression-based...
October 2023
Labour market and inequality
Working paper
Michelle Pleace, Matthew Clance and Nicky Nicholls
The attainment of financial independence by women holds significant importance for women’s empowerment and has implications for the increasing prevalence of female-headed households in the country. Previous estimates of the gender wage gap in South...
October 2023
Labour market and inequality
News
SA-TIED
SA-TIED marked a significant milestone on 13 October 2023, by hosting an inaugural National Treasury Secure Data Facility (NTSDF) data lab roadshow at the University of Pretoria . The event featured a presentation by the...
October 2023
Enterprise development
Public revenue
Labour market and inequality
Journal article
Marlies Piek, Dieter von Fintel, Johann Kirsten
Marlies Piek , Dieter von Fintel , Johann Kirsten This paper is the first to provide estimates of how minimum wages affect worker flows and employment growth rates in an employment scarce developing country context...
SEPTEMBER 2023
Abstract We incorporate a model of frictions into the bunching-based elasticity estimator to rationalize diffuse bunching around kinks and mass above notches in empirical distributions. Model agents draw a sparse set of opportunities from a...
Working paper
Andrew R. Donaldson
Andrew R. Donaldson March 2023 Tax administration statistics now provide considerably more complete and reliable measures of South African personal income and its distribution than the available household or other survey sources. However, there are...
March 2023
Labour market and inequality