Add a definition for all snowplow events emitted by GitLab Frontend
Problem
There are events that are being send to our Snowplow collector but don't have an event definition. This makes these events essentially undocumented, which creates problems for event discovery as well as to estimate migration efforts and similar.
The "undocumented" events for the frontend can be found via (based on the assumption that those emit a page_url_path
):
SELECT event_action
FROM prod.common_mart.mart_behavior_structured_event
WHERE behavior_at > CURRENT_DATE - 7
AND event_action NOT IN (SELECT action FROM SREHM_PREP.PUBLIC.EVENT_DEFINITIONS)
AND page_url_path IS NOT NULL
AND app_id = 'gitlab'
GROUP BY 1
NOTE: SREHM_PREP.PUBLIC.EVENT_DEFINITIONS
is a manual upload of the CSV export from metrics.gitlab.com/events so it needs to be manually kept up-to-date otherwise the above query might include already defined events
Desired Outcome
All events emitted by the GitLab frontend are documented.
Potential Solution
TBD
Consider creating a Snowflake query to figure out the tiers
and identifiers
of an event
How to verify
Further actions needed
Edited by Sebastian Rehm