An ETL tool designed to process ~25GB of daily transaction data, applying predefined tagging rules. This fully automated data pipeline built with Google Cloud BigQuery and other GCP services replaced an a data processing program built with SAS (Statistical Analysis System) and Informatica.
Achieved: Transactions tagging and rollup script processing time reduced from 35 Mins to 5 Mins for each day's transaction data processing.
Platform: Google Cloud
Integrations: Google Cloud Storage, Datastore, AppEngine, BigQuery, Shell Scripts