Data Engineer
Accenture
2025 – Present
- Designed and optimised Spark-based ETL pipelines to ingest, cleanse, and model high volume datasets.
- Built and tuned entity resolution rules, features, and thresholds to improve record matching accuracy.
- Modelled relationships across multi-source data to generate trusted golden profiles for downstream use.
- Improved data quality via deduplication, feature engineering, and match-performance tuning.
- Contributed to CI/CD workflows using GitHub Actions to automate builds and tests for reliable delivery.
- Contributed to CI/CD workflows using GitHub Actions to automate builds and tests for reliable delivery.
- Collaborated with engineers and analysts to translate requirements into scalable data products.