Home/Methodology

Data Methodology

SA Data Hub is committed to transparency about how data is sourced, processed, and presented. This page documents every data source, update schedule, indicator definition, and known limitation.

Core Principles

Primary Sources Only

All data originates from official government statistical agencies or internationally recognised institutions. No data aggregators or news sources.

No Editorial Adjustment

Data is presented as released. We do not seasonally adjust, smooth, or normalise figures beyond what is in the official release.

Limitations Disclosed

Every dataset page documents known caveats, sampling limitations, and revision risks. Provisional data is clearly labelled.

Data Sources

Statistics South Africa (Stats SA)

www.statssa.gov.za
Quarterly / Annual1994 – present
Unemployment (QLFS)Inflation (CPI / PPI)GDP (National Accounts)Census 2022Population EstimatesHousing (GHS)

The primary national statistical office. All headline macroeconomic indicators originate here. StatSA releases are the reference source for all economic data on this platform.

South African Reserve Bank (SARB)

www.resbank.co.za
Monthly / Quarterly1960 – present
Interest Rates (Repo / Prime)Government DebtBalance of Payments

SARB's online statistical query tool provides downloadable time series for monetary policy variables. Data is authoritative for financial indicators.

Department of Basic Education (DBE)

www.education.gov.za
Annual2000 – present
Matric Pass RatesSchool EnrollmentProvincial Education Statistics

DBE releases annual matric results in January for the previous year. Provincial breakdowns are available in the full NSC diagnostic report.

South African Police Service (SAPS)

www.saps.gov.za/services/crimestats.php
Annual (September)2000 – present
Crime StatisticsContact CrimeProperty CrimeProvincial Crime

SAPS releases annual crime statistics in September covering April–March financial years. Station-level data is available from the Crime Statistics portal.

World Bank Open Data

data.worldbank.org/country/ZA
Annual1960 – present
Long-run GDPPoverty HeadcountGini CoefficientInternet AccessLabour Force Participation

Used for long-run time series and international comparisons. World Bank figures may lag Stats SA by 1–2 years but provide consistent methodology across decades.

Update Schedule

DatasetFrequencyApprox. Publication Lag
CPI InflationMonthly~25 days
GDP GrowthQuarterly~60 days
Unemployment (QLFS)Quarterly~6 weeks
Provincial UnemploymentQuarterly~6 weeks
Matric Pass RatesAnnual~2 weeks (Jan)
Crime StatisticsAnnualSeptember
Population EstimatesAnnualMid-year
Housing / GHSAnnual~12 months
Census DataDecennial~2 years
Interest Rates (Repo)MPC meetings (~bi-monthly)Same day

Dataset Definitions & Limitations

Unemployment Rate

Quarterly (Feb, May, Aug, Nov)Lag: ~6 weeks after reference quarterSince 2008 Q1

Definition

The narrow (official) unemployment rate measures persons without work, available to work, and actively seeking work as a percentage of the labour force. The expanded rate includes discouraged work-seekers.

Known Limitations

  • Does not capture underemployment (part-time workers seeking full-time work).
  • Household survey — subject to sampling error (±0.5pp at national level).
  • Does not reflect quality or formality of employment.
  • COVID-19 disrupted Q2 2020 fieldwork; Q2 2020 data should be treated with caution.

Gross Domestic Product (GDP)

QuarterlyLag: ~60 days after reference quarterSince 2010 Q1

Definition

GDP measures the monetary value of all goods and services produced within South Africa's borders in a reference period, adjusted for inflation (real GDP) and seasonally adjusted.

Known Limitations

  • Preliminary estimates are subject to revision — sometimes material revisions in subsequent releases.
  • Seasonal adjustment methodology may change, causing historical revisions.
  • Informal economy and subsistence agriculture are partially captured.
  • Sectoral decompositions (agriculture, mining, manufacturing etc.) are released with a further delay.

Consumer Price Index (CPI)

Monthly (published ~3–4 weeks after reference month)Lag: ~25 daysSince 2008 Jan

Definition

CPI measures the rate of change in prices paid by urban households for a representative basket of goods and services. It is the primary measure of consumer price inflation in South Africa.

Known Limitations

  • Based on urban households only — may not reflect rural price dynamics.
  • Basket weights are updated periodically (last major revision 2017); the basket may lag consumption pattern changes.
  • Core CPI (excluding food and energy) is not officially published by Stats SA but is estimated by analysts.
  • Individual component indices (food, transport, housing) may diverge significantly from headline.

Provincial Statistics

Quarterly (unemployment) / Annual (other indicators)Lag: Quarterly unemployment: ~6 weeks. Housing/education: annual.Since Varies by indicator

Definition

Provincial data aggregates multiple Stats SA surveys: QLFS for unemployment, Census 2022 for population and housing access, NSC results for education. GDP share is from national accounts regional estimates.

Known Limitations

  • Provincial unemployment estimates have higher sampling error than national (±1–2pp).
  • Census data (2022) may not fully reflect subsequent migration patterns.
  • Provincial GDP estimates are released annually with a longer lag than quarterly national GDP.
  • Service delivery indicators (electricity, water) from General Household Survey, which is conducted annually but with a 1-year publication lag.

Verification Process

All data is sourced directly from official statistical publications and cross-referenced with the original press releases before publication. Automated scripts fetch figures from Stats SA's official download pages; the raw values are validated against the previous period and flagged for manual review if they deviate by more than 3 standard deviations from the historical series.

For datasets without structured APIs (most Stats SA releases are PDF or Excel files), data is manually transcribed from the official statistical release tables. Transcribed values are verified against the headline figure in the press release and, where possible, against secondary coverage in Business Day or Bloomberg SA.

Historical revisions issued by Stats SA are incorporated in the next scheduled update cycle. We do not backdate individual data points between cycles unless the revision is material (defined as a change exceeding 1 percentage point for rates, or 0.5% for index levels).

Transparency Notes

This is not an official Stats SA product. SA Data Hub is an independent platform. While we strive for accuracy, users requiring data for official, legal, or academic purposes should consult the primary sources directly.

Data may be provisional. Stats SA frequently issues preliminary estimates ahead of final figures. We label provisional data where known, but users should verify the status of individual data points on Stats SA's website.

Automation limitations. Automated update scripts rely on the structure of official publication pages remaining consistent. Structural changes to Stats SA's website may cause scripts to fail silently. All automation output is reviewed before deployment.

No commercial affiliation. SA Data Hub has no commercial relationship with any of the data sources listed on this page and does not receive compensation for referencing them.