An automated script was created to regularly collect, clean, and organise campaign data from different sources into one place, with older data safely archived and the process running automatically on a set schedule.
- Integrating campaign data from multiple, disparate sources.
- Handling missing, inconsistent, or miscellaneous data values.
- Managing varying data formats, including inconsistent date formats and differing label names.
- Ensuring timely and automated data synchronisation without manual intervention.
- Archiving outdated data to maintain performance and relevance in analytics.
- Developed a Python-based solution to establish unified connections with all data sources.
- Implemented robust data transformation pipelines to standardise formats and handle missing or inconsistent values.
- Configured the Python script to run at regular intervals, enabling fully automated and timely synchronisation of campaign data.
- Archived older data periodically to optimise storage and maintain a clean dataset for analysis.
- Consolidated and stored the cleaned and structured data in Google BigQuery, enabling centralised access for analytics and reporting.