The organisation was capturing high-quality data across GA4, HubSpot, Google Ads, and Chargebee, but lacked the infrastructure to connect them.
The core of the design was a Relational Entity Model that defined how "Leads" in a CRM relate to "Invoices" in a billing system.
Key Design Element: A schema that mapped the "Golden Path" of a customer: WebEvent (Traffic) → Lead (MQL/SQL) → Opportunity (Sales) → Activation (Revenue)
| Layer | Components | Value |
|---|---|---|
| Ingestion (dlt) | APIs / CDC | Automated, reliable data movement. |
| Storage (BigQuery) | Raw Data / Cleaned Data | High-performance, scalable query engine. |
| Logic (dbt) | Models, Staging, Marts | Clean, version-controlled metrics. |
| Presentation | Looker Studio | Actionable insights for stakeholders. |
This allowed for the future design and implementation of multi-touch attribution, where a "first click" from a Google Ad could be programmatically linked to a plan activation six months later.
I evaluated two primary architectural paths to provide the organisation with a "Build vs. Buy" decision matrix:
Beyond just "moving data," the design included a potential operational framework:
The project did not move to the implementation phase, but the design provided the organisation with: