2021 analytics release notes

Releases made in 2021.

July 1, 2021

  • Release Google Search Console data to CMS dashboards. (DP-18333)

  • Add internal documentation to ETL repo. (DP-22092, DP-22110)

  • May platform activity report. (DP-22222, DP-22388)

  • Implement cross-domain tracking between Mass.gov & vaxmillions.com. (DP-22360, DP-22364, DP-22365, DP-22366)

  • Feedback manager export now includes an organization column. (DP-20386)

June 24, 2021

  • Clean up repo post bigquery refactor. (DP-21998)

  • Add internal search data (from search.mass.gov) to CMS dashboards. (DP-22161, DP-22162)

  • Offboard Geoff. (DP-22304)

  • Report to Joe on analytics question authors have had during the past few months. (DP-22306)

June 17, 2021

  • Remove SSNs from Formstack. (DP-22238)

  • Hand off cross-domain data to DCR/EEA. (DP-22219)

  • Fixed bug that prevented you from exporting CSV data in the feedback manager (DP-22225)

  • Vaccine preregistration cleanup work/ (DP-22244, DP-22236)

June 10, 2021

  • Begin filtering additional social security formats out of formstack data. (DP-22157)

  • Add additional summary tables for archiving web analytics data. (DP-12389, DP-21902, DP-21992)

  • Combine all BigQuery API calls into one job. (DP-21726)

  • Add partitioning to analytics.pageviews. (DP-21718)

  • Release platform activity report. (DP-21985)

  • Documenting DCR xdomain analytics work. (DP-22073)

  • Update other tag manager containers for DCR xdomain pageview. (DP-22136)

  • Drop GA tables. (DP-22138)

  • Add validation errors tags for DCR xdomain tracking. (DP-22156)

  • Chatbot discovery. (DP-22185)

  • DCR xdomain QA. (DP-22205, DP-22207)

  • Document how to upgrade Superset. (DP-22091)

  • Add documentation for using Airflow. (DP-22093)

June 3, 2021

  • Make it possible to query BQ and XD data using Athena. (DP-21629)

  • Investigate if we filter out phone numbers in Formstack. (DP-21795)

  • Review massdcrcamping website. (DP-22069)

  • Create tags and triggers for DCR camping site. (DP-22071)

  • Create task path variable. (DP-22072)

  • Set up GA view for DCR. (DP-22074)

  • Set up Glue definitions in Terraform. (DP-22154)

  • Add link click coverage that doesn’t collect personal data to DCR camping tags. (DP-22178)

  • Merge dependabot update for Flask-Appbuilder. (DP-22202)

  • Plan data structure for search.mass.gov data. (DP-22106)

  • Explore search.mass.gov data. (DP-22108)

May 27, 2021

  • Filter query parameters out of Google Analytics “All Public Traffic” view. (DP-21224)

  • Add partitioning to analytics.entrance_groups. (DP-21743)

  • Fix bug that caused the “author” and “organization” fields to clear when you submitted a Feedback Manager query. (DP-21788)

  • Create basic variables for reserveamerica cross-domain tracking. (DP-22070)

  • Conduct informal interviews to better understand use case for search.mass.gov data. (DP-22107)

May 21, 2021

  • Filter query parameters out of Google Analytics “All Public Traffic” view. (DP-21224)

  • Add partitioning to analytics.entrance_groups. (DP-21743)

  • Fix bug that caused the “author” and “organization” fields to clear when you submitted a Feedback Manager query. (DP-21788)

  • Create basic variables for reserveamerica cross-domain tracking. (DP-22070)

  • Conduct informal interviews to better understand use case for search.mass.gov data. (DP-22107)

May 20, 2021

  • Modify API options to allow for search terms (DP-21656)

  • Run analytics schema backfills (DP-21381)

  • Add partitioning to previous page summary (DP-21741)

  • Add partitioning to next page summary (DP-21742)

  • Onboarding for Geoff St. Pierre (DP-21781)

  • Update ETL repo changelog (DP-21997)

  • Add documentation for Formstack DAG (DP-21413)

May 13, 2021

  • Work toward simplifying ETL processing. (DP-21701, DP-21747)

  • Lock down Superset permissions. (DP-22003)

  • Make sure develop environment database has all necessary data. (DP-21593)

  • Release March platform activity report. (DP-21605)

  • Update chatbot dashboard. (DP-21715)

  • Add partitioning to analytics.page_events. (DP-21717, DP-21971)

  • Automate creating a JIRA ticket when a job fails. (DP-21776, DP-21987)

  • Create eligibility form data set. (DP-21860)

  • Add vaccine options to vaccine locations map. (DP-21893)

  • Investigate and fix Drupal ETL failure. (DP-21959)

May 6, 2021

  • Work toward making it possible to search for labels in the feedback manager (DP-21649, DP-21650, DP-21805)

  • Adjust structure of data in S3 (DP-21714, DP-21787)

  • Add documentation for partitioning (DP-21738)

  • Fixed a bug in the Feedback Manager where the organization and author filters didn’t work if you filtered for more than 1 organization or author. (DP-21762)

  • Understand how alerts work in Airflow. (DP-21775)

  • Back Formstack cleanup date up to 5 weeks. (DP-21794)

April 29, 2021

  • Handoff PFML analytics materials. (DP-21412)

  • Adjust formatting of how labels come over from Drupal API. (DP-21662)

  • Eliminate redundant scripts from ETL processing. (DP-21699, DP-21700, DP-21708)

April 22, 2021

  • Propose new, slimmed-down architecture for ETL processing. (DP-21424)

  • Eliminate bq_ and xd_ page_trace. (DP-21668)

  • Update chatbot documentation. (DP-21693)

April 15, 2021

  • Report on traffic to children metric. (DP-20964)

  • Create February platform activity report. (DP-21128)

  • Explore partitioning bq_events. (DP-21429)

  • Test regex performance to see if we can offer a regex search for feedback manager. (DP-21654)

April 8, 2021

  • Fix bug where wrong organizations were being associated with some pages in the reporting schema. (DP-21240)

  • Various items for tagging vaccinesignup.mass.gov. (DP-21405)

  • Investigate and improve indexes throughout databases. (DP-21448)

  • Convert pageviews and scores view into materialized view so that Drupal API calls won’t take down reporting db. (DP-21563)

  • Drop bq_ and xd_ node. (DP-21579)

  • Automate Color and Curative codes dropoff. (DP-21590)

  • Cleanup after warehouse refactor. (DP-21601)

  • Update backpublish.py. (DP-21612)

April 1, 2021

  • Automate Twilio logs ingestion. (DP-21581)

  • Automate sending codes to Color and Curative. (DP-21469)

  • Add a date column to bq_node. (DP-21428)

  • Create documentation for chatbot dashboard. (DP-21399)

  • Create chatbot analytics dashboard. (DP-21296)

  • Use BETWEEN instead of DATE_TRUNC. (DP-20996)

  • Optimize ETL processing. (DP-20787, DP-20782, DP-20781, DP-20780)

March 25, 2021

  • Interview MassHealth authors about service family dashboard. (DP-21012)

  • Onboard Noah P. (DP-21426)

  • Fix backpublish script bug. (DP-21536)

March 18, 2021

  • Optimize several warehouse queries. (DP-20784, DP-20786)

  • Upgrade Postgres to latest stable version. (DP-20982, DP-21427)

  • Address RDS instances under heavy load. (DP-21280)

  • Add vaccinesignup.mass.gov to warehouse. (DP-21397)

  • Update vaccine dashboard. (DP-21430)

  • Add vaccine feedback form to ETL. (DP-21437)

  • Generate CSVs for contact lists. (/DP-21441)

March 11, 2021

  • Optimize several warehouse queries. (DP-20778, DP-20788)

  • Fix bug that caused nos per 1000 contributions on the service family dashboard to add up to more than 100% (DP-20931)

  • Document Formstack data flow (DP-21189)

  • Add items to the vaxfinder & vaccine content dashboard (DP-21211)

  • Investigate mysterious link clicks in vaccinesignup.mass.gov analytics. (DP-21270)

  • Update nos per 1000 on DUA dashboard. (DP-21311)

  • Add vaxfinder.mass.gov to cross-domain ETL. (DP-21312)

  • Update GTM and ETL for paidleave.mass.gov path changes. (DP-21323)

  • Use new terraform version for ETL databases. (DP-21329)

March 4, 2021

  • Move paidleave.mass.gov query params to a separate dimension. (DP-21049)

  • Factor fact_site_improve out of existence. (DP-20777)

  • Change date formatting on Superset charts for clarity. (DP-20925)

  • Create paidleave feedback scoring metric. (DP-20987)

  • Add Drupal tables to data dictionary. (DP-21217)

  • Formstack cleanup failing (429 error). (DP-21235)

  • Get travel form automated job running for yesterday. (DP-21267)

  • Get UI data for Karthik. (DP-21287)

  • Duplicate chatbot tags for cross-domain property. (DP-21297)

  • Webserver container hasn’t upgraded. (DP-21313)

  • To-dos for attestation data privacy. (DP-21314)

  • Make bq_node performant again. (DP-21316)

  • Add tracking for new promo page buttons. (DP-21340)

  • Dev reporting DB out of space. (DP-21351)

Feb. 25, 2021

  • Make development environment useful again. (DP-20770)

  • Prep for ESC meeting on Feb. 17. (DP-21117)

  • Add vaxfinder formstack form to ETL. (DP-21177)

  • Stop running fact_exits in warehouse schema. (DP-20774)

  • Stop running dim_source in warehouse schema. (DP-20775)

  • Upgrade Airflow to newest stable version. (DP-20981)

  • Refine COVID-19 vaccine promo page dashboard. (DP-21146)

  • Add chatbot tracking. (DP-21170)

  • QA data coming out of refactored warehouse. (DP-21181)

  • Siteimprove data is not accurate on service family dashboards. (DP-21195)

  • Update monthly_dev_cleanup script for new tables. (DP-21268)

Feb. 18, 2021

  • Warehouse.dim_source load failing because of junk data in ETL. (DP-21206)

  • Verify that query parameters are correctly parsed when aggregating paidleave.mass.gov sessions. (DP-21051)

  • Add data from attestation form to postgres and s3. (DP-21081)

  • Onboard Jane Lee. (DP-21110)

  • Make vaccine feedback available for xFact. (DP-21150)

  • Tag vaxfinder.mass.gov. (DP-21176)

  • Division by zero error in reporting ETL. (DP-21180)

  • Various improvements for production ETL environment. (DP-21182)

  • Remove misleading columns in analytics.daily_feedback_count. (DP-17283)

Feb. 11, 2021

  • Take fact_page and fact_daily_feedback offline. (DP-20767, DP-20776)

  • Report on our options for using AWS more efficiently. (DP-20863)

  • Create new creds for HED licensing personnel. (DP-20907)

  • Create January Platform Activity Report. (DP-20898, DP-20963)

  • Optimize use of dev environment Superset databases. (DP-20985)

  • Research and talk to DTA about service family data. (DP-21011)

  • Create reports for vaccine landing page. (DP-21065)

  • Document COVID-19 vaccine dashboard. (DP-21092)

  • Update vaccine query to include more pages (DP-21102)

Feb. 4, 2021

  • Analysis for ESC meeting presentation. (DP-21014)

  • Catchup DAG runs for January. (DP-21062)

  • Write script to generate First Responders map. (DP-20797)

  • Platform comms (DP-20921, DP-20920)

  • Generate accessible Excel file as alternative to map. (DP-21015)

  • Shutdown unused RDS instance. (DP-21019)

  • Investigate QA check error. (DP-21052)

  • “Refresh materialized views” task failed on 1/27 (DP-21053)

  • Tag Caspio interactions. (DP-21066)

  • Monthly_kpi_components table doesn’t have Feb. data prior to SiteImprove DAG running. (DP-21105)

Jan. 28, 2021

  • Refactor temp_analytics schema to skip fact_page_events. (DP-20766)

  • Top-trafficked service family research. (DP-20864)

  • Analytics ETL is failing dev environment. (DP-20999)

  • Add sensors for cross-domain data and siteimprove data. (DP-20452, DP-20806)

  • Update DUA feedback for DUA dashboard. (DP-20883)

  • QA rich text link tags post Drupal release. (DP-20897)

  • Upgrade Terraform to latest version. (DP-20901)

  • Shutdown DTA churn EC2 instance. (DP-20905)

  • Re-imagine how we backup data in AWS. (DP-20912)

  • Add page-by-page breakouts for broken links and grade level to pilot service family dashboard. (DP-20967)

  • Update data dictionary. (DP-20980)

  • Adjust colors on mass vaccine map. (DP-21016)

  • Fix vaccine report file name. (DP-20950)

Jan. 21, 2021

  • You can now see “Nos per 1000 unique pageviews” for your entire organization on the Organization Web Analytics dashboard. (DP-16202)

  • Formstack data missing for a few days this fall. (DP-20517)

  • Delete surplus data in FormstackUI Fraud form. (DP-20541)

  • Add an additional question to Formstack UI Fraud form. (DP-20800)

  • Optimize the data for Promotional page configured metrics. These should now load much faster. (DP-20838)

  • Provide PFML users with GA access & write up instructions for Any on cross-domain property access. (DP-20858)

  • Fix issue with Formstack API not adjusting for daylight savings. (DP-20865)

  • Superset legends cut off most labels. Unfortunately, there’s no easy fix, and we settled for rewriting many of the labels so that they were shorter. (DP-20903)

  • Modify “hide chart” script to work on Service Family dashboard. Users should now see only the KPIs that are relevant. (DP-20904)

Jan. 14, 2021

  • QA all paidleave.mass.gov dashboard flow scorecards. (DP-20789)

  • Add new paths to paidleave taskPath variable. (DP-20728)

  • Add child page path to service families lookup table. (DP-20807)

  • Rerun cross-domain ETL for Jan. 3. (DP-20857)

  • QA KPI charts on service family dashboard. (DP-20866)

  • Convert hours to milliseconds in ETL or in Superset for duration KPIs. (DP-20902)

  • Generate a data dictionary for ETL repo. (DP-14189)

  • Release prototype service family dashboard. (DP-19806)

  • Investigate remaining (not set) event labels in Google Analytics. (DP-20391)

  • Document DUA dashboard. (DP-20711)

  • Cleanup tableau EC2 instance on AWS. (DP-20762)

  • Automate vaccine user feedback query. (DP-20765)

  • Create report on plastform activity. (DP-20813)

  • Add page-by-page nos per 1000 contribution chart to service family dashboard. (DP-20827)

  • Figure out and fix whatever’s causing ETL health check errors. (DP-20872)

Jan. 6, 2021

  • Add KPI metrics reporting views to DB. (DP-20645)

  • Fix Formstack ETL failure for 12-29. (DP-20799)

  • Spotcheck GTM tags in different browsers. (DP-20314)

  • Courts/Bentley project time tracking ticket. (DP-20522)

  • Document paidleave.mass.gov dashboard. (DP-20700)

  • Update UI benefits calculator with new variable. (DP-20811)

  • Fix travel metrics DAG failure. (DP-20822)

  • Update PFML contribution calculator variable. (DP-20826)

Last updated