The 2018/2020 Combined IR served as the first large scale test of the tools that the IR programming team has been developed. We knew we were in for a big data challenge…but exactly how big was it?

Our query of EPA’s Water Quality Portal for the IR period of record pulled up 1,362,335 data records (that’s more records than there are people in the Salt Lake Valley) from 3,237 monitoring locations throughout the state. However, DWQ did not collect the data alone: we reviewed and assessed data records from 17 different organizations.


Assessed records

However, the 1.36 million records do not pass through the assessment without rigorous screening and checks. They are screened to include only assessed sites, detection limits, parameters, units, fractions, and sufficient associated metadata. These data are also aggregated to a representative value for comparison to water quality standards. In fact, the majority of the thousands of lines of code written to run the IR are used to screen and prep the core dataset for assessment.

Assessment data review process summary showing how the data records whittle down to those that meet the minimum data requirements.


2018/2020 IR results

The map below shows the draft assessment results from the Combined 2018/2020 Integrated Report. Assessment units are colored according to their overall assessment category. Click on assessment units of interest to learn more about the assessment, impaired parameters, and TMDL information. Use the layers widget in the upper left to turn on and off assessment unit categories. The reset view icon (with four arrows pointing inward) will reset the map to the entire state of Utah. Use the search bar to search by assessment unit ID or name.

Draft AU assessments from the Combined 2018/2020 Integrated Report


Assessed parameters

Nearly 4,000 different water quality parameters were filtered down to a core set of just over 100. From these data, we performed 61,388 unique assessments by site, use, parameter, and criterion.

Frequency of assessments associated with each parameter.

Frequency of assessments associated with each parameter.