Showing 25 Result(s)

MLOps Pipeline with Jenkins, Docker, and AWS ECS

This project implements an automated MLOps pipeline for deploying a machine learning model, focusing on operational efficiency and security. It integrates Jenkins for CI/CD, Docker for containerization, and AWS ECS for scalable deployment, ensuring a repeatable and reliable workflow. Project Overview The pipeline automates the deployment of a Flask-based ML application, emphasizing Continuous Integration and …

Deploying a Machine Learning Model with Docker and Kubernetes on Google Cloud Platform

This project demonstrates the deployment of a machine learning model using Docker and Kubernetes on Google Cloud Platform (GCP), highlighting skills in MLOps, containerization, and cloud infrastructure management. The process integrates a pre-trained PyCaret model into a Flask application, containerized with Docker, and orchestrated with Kubernetes for scalability and reliability. Project Overview Key Steps Outcomes …

Real-Time Algorithmic Trading System with Apache Flink, Redpanda, and News Sentiment Analysis

This project is a hands-on implementation of a real-time algorithmic trading system that processes market data and news sentiment to make automated trading decisions. It integrates Apache Flink for stream processing, Redpanda for data streaming, and the Alpaca API for market data and trade execution, with sentiment analysis driving the strategy. Technologies Used How It …

Real-Time Anomaly Detection Pipeline for Stock Trading Data with Redpanda and Quix

This project is a real-time anomaly detection system for stock trading data, built with Redpanda and Quix. It processes streaming trade data and flags unusual patterns using both rule-based and machine learning techniques. Purpose The goal is to detect anomalies—like sudden price jumps or high-volume trades—as they happen, using a lightweight, practical setup. Technologies Used …

Real-Time Voting Analytics Dashboard with Kafka, Spark, PostgreSQL, and Streamlit

This project involves the development of a real-time analytics dashboard designed to process and visualize voting data for an election simulation involving three political parties: Party A, Party B, and Party C. The system integrates multiple technologies to handle data streaming, processing, and visualization, demonstrating proficiency in modern data engineering tools. For a detailed overview, …

Real-Time E-Commerce Data Pipeline with Kafka, Flink, PostgreSQL, and Elasticsearch

This project demonstrates the development of a real-time data pipeline for e-commerce sales analytics using Apache Kafka, Apache Flink, PostgreSQL, and Elasticsearch. The pipeline ingests high-throughput transaction data, processes it for real-time insights, and stores it for querying and visualization, addressing needs such as inventory optimization and fraud detection. For more details, refer to the …

Real-Time Data Streaming: Monitoring Database Changes with Postgres, Debezium, and Kafka

In today’s data-driven world, real-time monitoring of database changes is critical for applications like fraud detection and live analytics dashboards. This project demonstrates a robust real-time data pipeline using PostgreSQL, Debezium, and Kafka, orchestrated with Docker, to capture and stream database changes efficiently. The project includes: Features Technology Used Learn More Link: Read the full …

Employee Churn Prediction Pipeline with BigQuery, PyCaret, and Looker Studio

This project demonstrates the creation of an employee churn prediction pipeline using Google BigQuery, PyCaret, and Looker Studio. The goal is to predict which employees might leave based on historical data and offer insights to boost retention. Technologies Used Key Features Link: Read the full article on Medium Code: https://github.com/shj37/Employee-Churn-Prediction-with-Looker-Studio-BigQuery-and-PyCaret