Balancing the value of integrated data and privacy in today's data economy - Talei Parker, ABS University Partnership Manager
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
Balancing the value of integrated data and privacy in today’s data economy Talei Parker, ABS University Partnership Manager
ABS DataLab Access is only to approved researchers and ABS or government staff Files remain in the secure ABS environment Analytical information does not include name and address All analytical outputs are checked by ABS staff 3
Data access and release Five Safes Framework Safe people Is the researcher authorised to access and use the data appropriately? Safe projects Is the data to be used for an appropriate purpose? Does the access environment prevent unauthorised Safe settings use? Safe data Has appropriate and sufficient protection been applied to the data? Safe output Are the statistical results non-disclosive? 4
Overview of MADIP The Multi-Agency Data Integration Project (MADIP) is a secure, person based research data asset combining information on health, education, government payments, income and taxation, employment, and population demographics to create a comprehensive picture of Australia over time.. MADIP • MADIP Uses • Nationally important datasets • Partner Agencies • Australian Bureau of Statistics • Answer policy questions • Australian Taxation Office • Program evaluation • Department of Health • Empirical research on socio-economic issues • Department of Social Services • No identification of individuals • Department of Education, Skills and • Used only for statistical and research Employment purposes (never for compliance) • Services Australia MADIP data is securely held by the ABS – access is only made available to approved researchers for approved purposes.
Analytical content and Enduring data in MADIP What are the outcomes for job seekers who cease participating in employment services programs in remote Australia? What is the impact of Paid Parental leave on income shifting and the health and health care use outcomes of Australian women? What are the impacts of international Registries Skilled migration on Australia? Australian Death Data Migration Census Longitudinal Points Traveller Dataset data Census of Population and Housing Client Which Australians are information What factors prevent most at risk in Australians from heatwave completing tertiary Social Security FAMILIES & education and conditions? and Related HOUSEHOLDS Visa information & transitioning to MIGRANTS Information citizenship grants employment? National Personal Income Tax INCOME & MADIP Health TAXATION 2006 to 2018/19 Survey Australian What risk and protective factors Apprenticeships EDUCATION Pharmaceutical What is the impact of mental influence student learning Incentives Programs & Training HEALTH Benefits Schedule disorders on students? trajectories through school? Contracts Higher Education Medicare Information Enrolments Management Database System What factors are important to Australian Medicare consider when targeting What are the benefits of Centralised educational attainment? Early Development Survey of Benefits interventions in relation to Disability, Register opioid misuse? Census Schedule of Medical Ageing, and Practitioners Carers Provider Directory What are the health and social What are the differences in health care outcomes of individuals that usage of people from culturally and have co-morbid mental and linguistically diverse backgrounds? physical health issues?
What research questions can MADIP answer? What is the impact of Paid Parental leave What are the benefits of educational on income shifting and the health and attainment? health care use outcomes of Australian women? What risk and protective factors influence student learning trajectories What are the impacts of international through school? migration on Australia? What are the outcomes for job seekers who What factors are important to consider cease participating in employment services when targeting interventions in relation programs in remote Australia? to opioid misuse? What are the differences in health care What factors prevent Australians from usage of people from culturally and completing tertiary education and linguistically diverse backgrounds? transitioning to employment?
Case Study: Helping vulnerable Australians survive heatwaves • National mortality increases by 2% during heatwaves • Large rural towns have the highest elevated risk for heatwave deaths See MADIP case studies and visit the MADIP Research Projects for the full list of projects
BLADE Overview
Business Longitudinal Data Analysis Environment The Business Longitudinal Analysis Data Environment (BLADE) is an economic data tool combining tax, trade, and intellectual property data with ABS data to provide a better understanding of the Australian economy and business performance over time… BLADE Uses • BLADE Partnerships • Nationally important datasets • • Insights on business performance • Australian Bureau of Statistics (ABS) and dynamics • Australian Taxation Office (ATO) • Answer policy questions • Department of Industry, Science, Energy and Resources (DISER) • Program evaluation • IP Australia • Treasury • No identification of individuals • Department of Agriculture, Water • Not used for compliance (only for and the Environment (DAWE) statistical and research purposes) • Department of Home Affairs (DHA) BLADE data is securely held by the ABS – access is only made available to approved researchers for approved purposes.
Business Longitudinal Analysis Data Environment How does the use of digital technologies vary by sector and by business size? To what extent is there an What is the relationship between energy productivity gap? management capability and firm Business Private Characteristics performance? Non-Profit Survey: Expenditure on Why has the number of Management ABR Research and Energy, Water Development Capabilities Indicative entrepreneurs declined? and Module Items Environment Intellectual Survey Property How does trademark use impact the export Business Longitudinal behaviour and performance of Australian Expenditure on Research Data Research and ABS SURVEYS ABR businesses? How do the effects of collaborative R&D Development accumulate in the years following R&D IP Merchandise activity? Survey of Research BLADE Imports Data and Experimental TRADE 2001-02 to Development, Government 2018-19 What is the impact of Austrade Merchandise Exports Data programs on trade and Do manufacturing firms in Economic investment? Australia have (or develop) a Activity INCOME AND Survey productivity advantage? TAX Business Locations Business (BL) Characteristics Business Survey Business Payment Activity Income Tax Summaries Statement (BIT) (BAS) What are the characteristics of businesses entering What are the pathways for (PAYG) the tourism industry compared with those leaving farmers experiencing prolonged the industry? drought? What are the social benefits of Government investment in private R&D by different types of R&D activity?
BLADE research Understanding entrepreneurship dynamics in Australia Using BLADE microdata to create a better picture of the drivers behind firms’ productivity and performance BACKGROUND DATA SOLUTION • Recent data indicate that: • Enables longitudinal study of business performance, innovation, - Australia’s productivity performance has weakened job creation, competitiveness and productivity - Entrepreneurship rates are declining • Firm-level microdata in BLADE presents increased opportunity to • We need a better understanding of the key drivers of use empirical economic research to improve policies productivity and their impact on industry and firm • Increases the evidence base on productivity-related dynamics, performance by analysing the drivers of the decline in entrepreneurship OUTCOMES • More effective and better targeted policy design, encouraging a productive and efficient Australian economy • Practical advice to businesses to develop strategies and practices for growth
BLADE Data Modules BLADE Taxation Modules ABS Survey Data Modules Other Modules BCS ABSBR EAS MCM EXPORTS BIT BERD IPLORD BAS PNPERD GOVER D IMPORTS PAYG Ag BL EEH • All datasets can be linked by a common key • Full data item list available upon request 14 25/11/2020
Enhancing BLADE • ATO datasets are being updated with 19-20 financial year data. Estimated timeframe is April/May 2021. • Improved methodology – BLADE logic is being redesigned to facilitate different levels of output (ABN level, different linking methods) • Quarterly data – Changes in method now allow for quarterly updates for some data inputs, this will be phased-in during 2021 • Work is underway to utilise new data sources such as: i. Single Touch Payroll (at the ABN level) ii. Indicative data for agricultural businesses iii. Survey of Employee Earnings & Hours A range of other Federal and State agencies are currently being engaged for inclusion of additional data into the BLADE asset
Integrated Person and Business Data • ABS has enabled the integration of some BLADE datasets with information about employer characteristics to employee data within the Multi-Agency Data Integration Project (MADIP) asset over time • Business and person-level data are brought together via the ABN of a person's employer, where this is recorded on a MADIP dataset • BLADE data linked with MADIP is limited to BLADE Core and the Business Characteristics Survey (BSC) with limitations on the use of BCS and Business Income Tax to the creation of flags • Standard governance arrangements apply and costs are subject to project technical assessment by ABS
Keeping data safe We have many protections in place to keep your data safe, including: Separation Data Data Safe access: Data Legislation security principle minimisation storage the Five Safes Functional separation 17
Legislation The Census and Statistics Act 1905 • Data collection, use, and disclosure is authorised by The Privacy Act law Information cannot be released in a manner likely to 1988 • There are protections for data against unauthorised enable the identification of individuals and access and disclosure, or loss. organisations • ABS and seconded officers are legally bound to uphold the confidentiality of MADIP information. • The MADIP is collected under this Act, which requires • Each agency is authorised by law to collect personal the ABS to publish and disseminate compilations and Other agencies’ information as part of its core functions, to share that analyses of statistical information and to maintain legislation information with the ABS for MADIP in order to use it the confidentiality of information collected for policy analysis, research, and statistical purposes. When passed, this Act Sharing Data will: and Release Act • Promote better sharing of public sector data Future Data • Strong security arrangements for all IT systems: https://www.pmc.gov.au/public-data/data- Australian • Build trust in use of public data Sharing and Australian Government Information Security Manual sharing-and-release-reforms Government • Dial up or down appropriate safeguards Release Act • Strict control of access to premises: Commonwealth standards • Maintain integrity of the data system and (DS&R Act) Protective Security Manual • Establish institutional arrangements. 18
The separation principle SEPARATE SEPARATE SEPARATE TRANSFER STORAGE ACCESS No-one can access both personal identifiers and Receipt of analytical information and Personal identifiers are stored analytical information at the same time. personal identifiers is in separate files securely and separately from the at separate times. analytical information. Each person working on the project is assigned a role, and is only able to access the information necessary to perform that role. Analytical information Such as: Personal identifiers • Occupation Such as: • Income • Name • Health services use • Address • Types of Government • Date of birth payments 19
Functional separation A person working on a project can only hold one role at a time. This means that personal information and analytical information cannot be accessed at the same time, and no person can ever see all of your information together at any point in the process. Librarian Linker Assembler Analyst The Librarian prepares The Linker links The Assembler creates The Analyst analyses information for linkage information together files for analysis linked information Only accesses data without direct identifiers 20
MADIP data linkage, assembly and storage Separate datasets are able to be brought together for specific projects Source datasets (analytical data) Person Linkage Spine Project analytical datasets Linkage results SEPARATE STORAGE Mechanism for Assembly of analytical Access to analytical data in of analytical data linkage data ABS DataLab 21
MADIP data security Clear accountabilities Secured internet and risk management gateway processes Data supply via Regular independent Assessed under the secure transfer reviews of security Information Security methods (e.g. arrangements and an Registered Assessors Secure Deposit ongoing program of Program (IRAP) Box) Logging and monitoring of security audits access and use of information IT security arrangements Information is only that conform with combined in a secure government standards for environment within the information security (the ABS, by a dedicated team Australian Government Staff security checks and restricted Information Security access to data Manual) 22
Data Availability and Transparency Act
Permitted Purposes 24 25/11/2020
Layers of Defence 25 25/11/2020
Questions?
You can also read