PORTFOLIO / 2025

Joshua
Elkind

Software Engineer building intelligent systems at the intersection of machine learning, distributed systems, and scalable infrastructure.

Computer Science @ University of Waterloo
Available for work
FOCUS
AI/MLDistributed SystemsComputer ArchitectureDatabases
INCOMING
Software Engineering Intern
@ Microsoft
🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...🎉 1st Place Hack the North 20...
Hack the North 2025 Team Photo

LangSketch

View Repo
LET'S CONNECT
jdelkind@uwaterloo.ca
GitHubLinkedIn
Projects Below

Columnar Analytics Engine

View on GitHub

High-performance OLAP engine prototype in Rust demonstrating core principles of modern cloud data warehouses. Implements vectorized execution model processing batches of rows for CPU cache efficiency and SIMD optimization. Features columnar storage with block-level indexing, specialized encodings (Dictionary, RLE, Plain) achieving 37% performance improvement with compression, and full SQL-to-execution pipeline with logical and physical planning. Includes projection pruning, predicate pushdown optimization, and parallel execution using rayon for multi-core scaling across scanning and aggregation operations.

RustOLAPVectorized ExecutionQuery Optimization
Columnar Analytics Engine preview

Distributed Query Runtime

View on GitHub

Distributed, vectorized query execution runtime in C++20 implementing coordinator-worker architecture for high-performance analytical query processing. Features gRPC-based communication, Apache Arrow for columnar in-memory data representation, and fault-tolerant scheduling with automatic task rescheduling on worker failures. Implements pull-based data exchange model with backpressure, proactive disk spilling for memory management, and Parquet file scanning integration. Includes comprehensive test suite with 110+ tests covering planner, scheduler, and execution engine.

C++gRPCApache ArrowDistributed Systems
Distributed Query Runtime preview

Distributed KV Store

View on GitHub

High-performance distributed key-value store in Rust leveraging Raft consensus algorithm for strong consistency and fault tolerance. Implemented leader election, heartbeat replication, and quorum-based decision making across a cluster of nodes. Built modular architecture with separate storage engine (DashMap-based), consensus layer, and asynchronous networking layer using Tokio. Features atomic operations, randomized election timeouts, and binary serialization for efficient inter-node communication.

RustRaftDistributed SystemsTokio
Distributed KV Store preview

B++ Tree From Scratch

View on GitHub

High-performance B++ tree implementation in C++ designed for database indexing. Implemented disk-friendly B+ tree structure with cache-efficient memory layout, bidirectional linked leaf nodes for fast range queries, and recursive node splitting to maintain tree balance. Features logarithmic search complexity, dynamic height growth, and tree visualization for debugging internal structures.

C++Data StructuresDatabase IndexingSystems Programming
B++ Tree From Scratch preview

Rust Shell

View on GitHub

Full-featured Unix shell in Rust with process control, job management, pipelines, and redirection. Implemented safe, concurrent process handling leveraging Rust's ownership model. Integrated signal handling, environment variables, command history, and configuration support.

RustSystems ProgrammingUnixConcurrency
Rust Shell preview

LangSketch

View on GitHub

Cross-platform desktop application built with Electron that provides a visual interface for designing, orchestrating, and monitoring AI agent workflows. Features drag-and-drop canvas for building workflows, multi-LLM support with OpenAI and Deimos Router, real-time analytics dashboard with Chart.js, and Databricks Vector Search integration for document processing and retrieval.

LangGraphRAGFastAPIVector Search
LangSketch preview

CXC 2026 Hackathon Winner - Incentivized factory safety monitoring platform with real-time environmental data collection via Arduino sensors. Built LSTM autoencoder for anomaly detection and risk scoring, integrated Solana blockchain for automated reward distribution, and architected Snowflake data warehouse with Cortex ML for compliance analytics. Full-stack implementation with Express backend, React frontend, and FastAPI ML bridge.

PyTorchSolanaSnowflakeArduino
SafeShift preview

GeneRisk ML & Pipeline

View on GitHub

Pharmacogenomics risk assessment platform with ensemble ML models achieving 95.8% AUC. Built distributed ETL pipelines with Apache Spark and Airflow processing VCF datasets from AWS S3 to Snowflake. Integrated real-time clinical decision support via .NET Core API and React frontend.

XGBoostApache SparkAWSSnowflake
GeneRisk ML & Pipeline preview

STAN (Trading Algo Analytics)

View on GitHub

Full-stack algorithmic trading platform enabling users to run Python strategies on millions of rows of historical stock data. Architected secure cloud-native execution with AWS EKS Kubernetes Jobs and Docker. Integrated Apache Spark for large-scale backtesting and statistical analysis.

PythonKubernetesApache SparkDjango
STAN (Trading Algo Analytics) preview

Concurrent Ray Tracer

View on GitHub

High-performance ray tracer in Go leveraging goroutines and worker pools for near-linear CPU scaling. Implemented recursive reflections, soft shadows, volumetric effects, and procedural materials. Designed spatial acceleration structures (BVH, Octree) to maximize intersection throughput.

GoConcurrencyComputer GraphicsBVH
Concurrent Ray Tracer preview

Scalable full-stack tutoring platform on Azure with live video, real-time chat, and resource sharing. Built AI-powered features with LangChain, Chroma vector DB, and RAG for quiz generation and video transcription. Architected caching with Kafka, Elasticsearch, and Redis for performance at scale.

AngularRuby on RailsLangChainAzure
MyTutorAI preview

Golang Fractal Generator CLI

View on GitHub

High-performance fractal zoom animation engine in Go with near-linear CPU scaling. Developed dynamic formula parser for custom fractal formulas. Created intelligent zoom system with boundary detection and adaptive quality scoring for infinite zoom sequences.

GoConcurrencyComputer GraphicsCLI
Golang Fractal Generator CLI preview

CNN SVHN Classifier From Scratch

View on GitHub

Convolutional Neural Network built entirely from scratch using NumPy achieving 92.8% test accuracy on Street View House Numbers. Implemented custom 2D convolution, batch normalization, max pooling, and Adam optimizer. Optimized with He initialization and im2col technique for efficient training.

NumPyDeep LearningComputer VisionPython
CNN SVHN Classifier From Scratch preview

GAN Alphabet From Scratch

View on GitHub

Generative Adversarial Network built from scratch using only NumPy for generating synthetic alphabet letters. Implemented generator and discriminator networks with binary cross-entropy loss and custom backpropagation. Designed custom dataset of 28×28 alphabet glyphs with noise and transformations.

NumPyDeep LearningGANsPython
GAN Alphabet From Scratch preview

Pi CV Home Security System

View on GitHub

Real-time face detection and recognition system on Raspberry Pi using OpenCV. Implemented custom logic to differentiate known individuals and strangers, triggering email alerts with face image attachments. Optimized frame processing with background subtraction and contour detection.

OpenCVPythonRaspberry PiComputer Vision
Pi CV Home Security System preview

ThoseInNeed

View on GitHub

React-based web platform integrating Google Maps and Places APIs to display nearby aid resources geographically. Implemented dynamic location-based services for food banks, shelters, and mental health clinics. Designed accessible UI with clear messaging for users without location access.

ReactGoogle Maps APIJavaScriptWeb
ThoseInNeed preview

RoadFun Game

View on GitHub

Fast-paced mobile racing game in Unity with multi-lane traffic avoidance gameplay. Implemented dynamic obstacle spawning and smooth player controls. Integrated coin collection mechanics enabling vehicle and parts upgrades for player progression.

UnityC#Game DevelopmentMobile
RoadFun Game preview

Selenium Webscrape Youtube Playlist Maker

View on GitHub

Python tool using Selenium to create and play YouTube playlists offline with automatic ad skipping. Used YouTube API to get video details and calculate total playlist duration. Automated browser actions to control YouTube playback and handle ads.

PythonSeleniumYouTube APIAutomation
Selenium Webscrape Youtube Playlist Maker preview