Data Engineer @ Stanbic Bank Kenya

VictorSabare

Modernizing bank-grade pipelines that safeguard 0+ SLAs across Finance, Risk, and Operations

Transforming raw data into actionable insights through modern data engineering practices, cloud-native architectures, and real-time processing systems.

Connect with me:
data_pipeline.py
Lines: 0
Data Processed
0TB+
Projects Completed
0+
Uptime Achieved
0%
Records Processed
0
Scroll to explore

Proven Track Record

Delivering data solutions that drive business growth and innovation across industries

0TB+

Data Processed

Across multiple industries

0+

Projects Completed

Successful data solutions

0%

System Uptime

Reliable infrastructure

0

Records Processed

Daily processing capacity

0+

Technologies

Mastered and implemented

0%

Client Satisfaction

Based on project feedback

0+
Years Experience
0/7
System Monitoring
0 Clouds
Platform Expertise
0%
Project Success Rate
6 articles

These featured posts launch on Medium so you can read the full story where they were originally published.

Read on Medium

Fresh writing on data engineering, analytics, and architecture from my Medium publications.

Optimising Multi Cloud Data Pipelines (Without Losing Your Shit)
|1 min read
Medium

Optimising Multi Cloud Data Pipelines (Without Losing Your Shit)

Two years ago I moved a “simple” pipeline from one cloud to another for a project . By week two my Slack looked like a heart monitor… Continue reading on Towards Data Engineering »

aws
apache-iceberg
Read on Medium
Careers in Data Engineering 2025: New Roles and Skill Paths
|1 min read
Medium

Careers in Data Engineering 2025: New Roles and Skill Paths

“Do you see yourself as a guardian of data pipelines or the bridge between code and business strategy? In 2025, you can be both.” Continue reading on Data Engineer Things »

data-analytics
artificial-intelligence
Read on Medium
The Data Engineer’s Toolkit: 12 Free Monitoring Dashboards You Didn’t Know Existed
|1 min read
Medium

The Data Engineer’s Toolkit: 12 Free Monitoring Dashboards You Didn’t Know Existed

How I stopped firefighting and reclaimed my weekends, without spending a dime Continue reading on Data Engineer Things »

data-engineering
data-science
Read on Medium

A showcase of data engineering solutions and platforms I've architected and built.

Machine Learning
Completed

Detecting & Classifying Fraudulent Ethereum Accounts

Developed a machine-learning framework combining supervised and unsupervised methods to detect fraudulent Ethereum accounts with >85% accuracy and <5% false positives, deployed as an interactive Streamlit app.

Python
Scikit-learn
TensorFlow
+7
Data Engineering
Completed

Real-Time Analytics Platform

Built a comprehensive real-time analytics platform processing 10M+ events per day using Kafka, Spark Streaming, and ClickHouse for sub-second query performance.

Apache Kafka
Spark Streaming
ClickHouse
+5
Data Engineering
GitHub
7 4

Stock Price Prediction Spark Cassandra

This is a data pipeline for predicting stock prices using Apache Spark, Apache Cassandra, and machine learning techniques. It collects and preprocesses stock data from Alpha Vantage API, engineers features, trains models, and performs data analysis and predictions.

Python
Apache Spark
Cassandra
+2
Data Science
GitHub
5

Stock Price Data Analysis

This repository contains the code and analysis for my data analysis project on stock price analysis and forecasting for my Internal attachment at Jomo Kenyatta University of Agriculture and Technology. The project analyzes historical stock price data, visualizes trends, and develops a forecasting model using Python and data science techniques.

Jupyter Notebook
Data
Data Analysis
+2
Data Engineering
GitHub
3

Dag Pipeline With Dbt

The project focuses on the development and deployment of an ELT (Extract, Load, Transform) pipeline utilizing industry-standard tools such as dbt (data build tool), Snowflake, and Airflow. The pipeline is designed to handle the transformation and loading of data from source tables to final data marts, ensuring efficient data processing.

Python
Data Science
GitHub
3

Product Network Analysis Using R

This Shiny web application analyzes product transactions to discover frequently purchased product pairs and visualize the relationships between them. The app uses association rule mining (Apriori algorithm) to identify frequent itemsets, and it applies community detection to find clusters of related products.

R