Client's marketing team is looking for ways to gather data from various sources including all major social data platforms and import them into the client data lake for analytics.
The client selected KPI over multiple vendors through a rigorous RFP process. KPI's expertise in Big Data Ecosystem, Airflow, PySpark, and AWS was the key differentiator for the client in addition to KPI's blended shore model to minimize cost and risk for the client.
The KPI team delivered multiple pipelines to automate the ingestion of data from various sources including all major social media platforms like Apple, Google, FB, Twitter, etc. into Client Data Lake (AWS S3) on a daily, weekly, and monthly basis.
Data pipelines include fetching data from APIs, SPTP, S3 using Python and perform transformations, aggregations, and consolidations using PySpark to load into client's Data Lake.
Provided data to Gain Theory, a third-party AI/ML platform for marketing decisions which is critical for client’s business.
Madelyn T.
Social Media Manager
Justin M
Sr Manager Marketing Operations