Job Description:
Roles and Responsibilities:
Technical Skills:
- 8–10+ years overall in data modeling/governance with at least 3+ years on GCP; strong SQL across analytical warehouses (BigQuery preferred).
- Proven delivery of conceptual/logical/physical models for analytics & data products; deep knowledge of dimensional modeling (Kimball) and familiarity with Data Vault 2.0.
- Hands‑on Collibra: configuring communities/domains, operating model, business glossary, policies, lineage ingestion, issue management, and stewardship workflows.
- Practical stewardship: defining CDEs, data quality rules/thresholds, certification criteria, and data access approvals.
- Experience collaborating with data engineers on Dataflow/Dataproc/Composer, and optimizing BigQuery for performance & cost.
- Excellent documentation, facilitation, and stakeholder management across business and technology.
Soft Skills: Excellent stakeholder management, facilitation with data owners/stewards, strong documentation and presentation skills
Key Responsiblities:
Enterprise Data Modeling
- Translate business concepts into conceptual, logical, and physical data models; define CDEs, semantic layers, and naming/standards.
- Build BigQuery schemas optimized with partitioning, clustering, and cost effective storage patterns; design star/snowflake models (Kimball) or Data Vault 2.0 where appropriate.
- Guide model versioning, peer reviews, and lifecycle management in Git based workflows.
GCP Data Engineering Collaboration
- Partner with data engineers on ingestion and transformation pipelines (Cloud Dataflow / Dataproc / Composer / Pub/Sub / Cloud Functions or dbt on BigQuery).
- Define performance & cost guardrails (slots vs. on demand, storage/compute separation, query choreography).
Data Governance & Stewardship
- Establish business glossary, data domains, ownership (Data Owners, Stewards, Custodians), and policies/standards aligned to DAMA practices.
- Run issue management, access governance, and data certification workflows; drive DQ rule definitions and monitor remediation.
Collibra Administration & Enablement
- Configure Collibra operating model (communities, domains, responsibilities), business glossary, and stewardship workflows (BPMN).
- Onboard technical metadata and lineage from GCP (e.g., BigQuery, Dataflow) via Collibra Connect/Edge; integrate with DQ/observability tools.
- Build Collibra dashboards and stewardship KPIs; coach squads on day to day usage.
Risk, Compliance & Privacy
- Embed data policies for PII and regulated datasets (e.g., GDPR, DPDP (India)) within models and governance assets; ensure auditability and access controls via IAM.
Stakeholder & Delivery
Facilitate model walkthroughs wi th business, architecture, and product teams; maintain backlogs; drive 30/60/90 day measurable outcomes.
Good to have :
- Collibra Certifications (e.g., Ranger, Data Steward, Governance) and GCP Professional Data Engineer.
- Data modeling tools (e.g., erwin, ER/Studio, SqlDBM, Hackolade).
- Experience with data quality/observability tooling (including Collibra DQ/Observability or equivalents) and Data Catalog/MDE integrations.
- Python/dbt for transformations; CI/CD for data (Git, Terraform for infra is a plus).
- Domain experience in Travel, Transportation & Hospitality (TTH) is advantageous