🇬🇧 English | 🇳🇱 Nederlands

Technical Articles

In-depth explorations of machine learning optimization, systems programming, and production-grade implementations. Each article combines theory with practical code examples and real-world deployment strategies.

Filter by Topic

All Articles Quantization Rust LLM Node.js Docker Systems Programming
Published

TinyLlama Q8K Quantization Engine - CPU-Optimized LLM with Rust/Candle

Advanced Q8K quantization implementation for TinyLlama-1.1B-Chat model using Rust and Candle framework. Features sophisticated permutation strategies (SVD-Importance, QR-Pivot), 3-tier validation pipeline, and production Docker deployment with interactive Angular chat interface. Reduces model size by 4x (from ~5GB to ~1.3GB) while maintaining <0.1% mean relative error.

Rust Candle Framework Q8K Quantization Docker Angular 19 LLM Optimization
Read Full Article
Coming Soon

Node.js Backend Architecture - Production Patterns

Deep dive into building scalable Node.js backends with Express, Redis, MongoDB, and WebSocket. Covers authentication strategies, rate limiting, spam prevention, and multi-tier validation.

Node.js Express Redis MongoDB
Article in Progress
Coming Soon

Rust for Systems Programming - Memory Safety Without Garbage Collection

Exploration of Rust's ownership model, borrowing rules, and zero-cost abstractions. Practical examples of building high-performance systems without runtime overhead.

Rust Memory Management Performance
Article in Progress
Coming Soon

Multi-Instance Docker Orchestration with Node.js

Building a production-grade Docker container pool manager with Node.js. Load balancing, health checks, graceful degradation, and automated cleanup strategies.

Docker Node.js Load Balancing
Article in Progress
← Back to Profile