Taxation statistics - Aggregated individual tax return sample files

Australian Taxation Office

Dataset Link

The de-identified data from the 2013-14 individual 2% sample file ( has been aggregated to the following levels: Sex Age (5 year ranges) Occupation (1 digit level) Partner Status Location (SA4 Region name) Lodgment channel (Agent or self-preparer) PHI indicator.

Data was then added from the ABS Census (2011), and ABS SEIFA, in summary variables, or ranked variables to SA4 regions.

This dataset has been created in preparation for GovHack 2016.


Anthony Nolan

Innovation Officer / Data Scientist / Intelligence Analyst – Australian Taxation Office

Major National Sponsors

Lead Agency

Lead Sponsor