4 Metadata

Please note that NAPMD focusses on key air pollutants. Meteorological variables may not be complete. Data has undergone validation by state and territory agencies but may require further cleaning.

4.1 Spatial coverage

Station locations (see counts in Table 4.1):

Display marker coordinates have been modified to protect confidentiality and privacy of property owners. Precise locations are available from NAPMD.

Table 4.1: Number of stations (including both historical and current)
State Number of stations
ACT 3
NSW 124
NT 5
QLD 76
SA 17
TAS 35
VIC 33
WA 26

4.2 Temporal coverage

4.3 Variables

4.4 Date and time fields

For hourly observation data, the hour indicates the beginning of the averaging period (i.e. hour 0 marks the average between 00:00 and 01:00). The year, date and hour fields are in local time, without daylight savings. Timezones are as follows:

  • UTC+08:00: WA
  • UTC+09:30: NT, SA
  • UTC+10:00: ACT, NSW, QLD, VIC, TAS

All times are shown in UTC in the date_time_utc field, stored as a timestamp with time zone in PostgreSQL. Please note that some programs will automatically convert the UTC timezone to the local system timezone on retrieval from the database. Ensure that you are working in your desired time zone when manipulating this field.

4.5 Standardisation

4.5.1 Variable and units

All variable and unit names have been expressed in alphanumeric characters only with no whitespace. For some variables, reporting units differ across agencies. Conversion factors are shown in Table 4.2.

Table 4.2: Conversion factors used for standardisation.
from to conversion
ppb ppm ppm = 0.001 * ppb
pphm ppm ppm = 0.01 * pphm
10^-4_per_m Mm^-1 Mm^-1 = 100 * 10^-4_per_m
atm hPa hPa = 1013.25 * atm
mbar hPa hPa = mbar
km/hr m/s m/s = 1000/(60 * 60) * km/hr

4.5.2 Time basis (hour-ending/starting)

The National Air Pollution Monitor Database converts all hourly data to be in 0-23 hour-beginning format (i.e. 0 represents midnight to 1am on the date specified). For different states and territories the source data can vary.

State Data source Time basis
ACT All data 0-23 hour-ending
NSW All data 1-24 hour-ending
NT All data 1-24 hour-ending
Qld Open Portal downloads 0-23 hour-beginning
Historical database extracts 1-24 hour-ending
SA Non-historic data 0-23 hour-ending
Spreadsheets pre-2003 (gases) 0-23 hour-beginning
Spreadsheets pre-2005 (PM2.5) 0-23 hour-beginning
Tas N/A N/A
Vic All data 0-23 hour-beginning
WA All data 0-23 hour-ending

4.5.3 Station names

Station names have been generally cleaned by converting to lowercase with underscores, removing all non-alphanumeric characters and whitespace. Names may differ substantially from the original station name where: - two stations are co-located (e.g. one station is a regular air quality monitor, the second is part of a rural network or sensor network) - a station has moved locations (the agency regards the station is the same station but NAPMD will list the new location under a new station name, typically suffixed ’_post_YYYY_MM_DD’, with the date it moved)

Note that the station name within a state will be unique, however different states may use the same station name (e.g. Richmond in both NSW and Victoria).

Source identifiers are kept with the station names where possible so that they can be cross-referenced with the original data. Use the source identifier to find continuous data for stations which have moved locations over time.

4.5.4 Station locations

Please note that NAPMD station coordinates may differ from the source data coordinates. This may be for a variety of reasons:

  • The station has moved and the source data coordinates only show the most recent location
  • The station has been more accurately geocoded than the source data coordinates (typically with the help of Google Maps and Google Earth)
  • The coordinates are set to 0, 0 (hidden) in the source data and have been removed entirely in NAPMD