Showing 4 Result(s)

Employee Churn Prediction Pipeline with BigQuery, PyCaret, and Looker Studio

This project demonstrates the creation of an employee churn prediction pipeline using Google BigQuery, PyCaret, and Looker Studio. The goal is to predict which employees might leave based on historical data and offer insights to boost retention. Technologies Used Key Features Link: Read the full article on Medium Code: https://github.com/shj37/Employee-Churn-Prediction-with-Looker-Studio-BigQuery-and-PyCaret

Automating Healthcare Data Pipelines with Airflow, BigQuery, and dbt

This project showcases a streamlined data pipeline built to handle healthcare data using modern data engineering tools. I used Apache Airflow (via Astronomer) to orchestrate workflows, Google BigQuery as a scalable data warehouse, and dbt for transformations and quality checks. Starting with synthetic data generation, the pipeline uploads data to Google Cloud Storage, creates external …

Building an End-to-End Data Pipeline for Healthcare Data with BigQuery, dbt, and GitHub CI/CD

This project was about building a straightforward, scalable data pipeline for healthcare data using modern tools and cloud tech. For a full breakdown of the process, challenges, and insights gained, check out my article on Medium: https://medium.com/@jushijun/building-an-end-to-end-data-pipeline-for-healthcare-data-with-bigquery-dbt-and-github-ci-cd-8e772b01e318 Github: https://github.com/shj37/dbt-redshift-aws-banking-data-warehouse GoalCreate a pipeline to ingest, transform, and deploy healthcare data for actionable insights. Tools Used How …