Technical Blog Posts
Filter by Tags
Intel TPP for LLM Inference: How Tensor Processing Primitives Accelerate Every Transformer Block on CPU
Intel TPP LLM Inference
March 02, 2026 Updated March 02, 2026
GPU Memory Profiling Tools (NVIDIA and Intel)
A practical guide to observing GPU memory stats on NVIDIA and Intel GPUs (monitors, profilers, and attribution).
February 10, 2026 Updated February 10, 2026
Agentic LLMs and Mixture-of-Experts (MoE)
Agentic workflows and sparse Mixture-of-Experts explained from an inference perspective
February 10, 2026 Updated February 10, 2026
LLM Inference Introduction: the end-to-end flow and vector math
LLM Basic
February 10, 2026 Updated February 10, 2026
Adding a counter to the proc interface
A practical guide to instrumenting the Linux kernel by adding custom counters to /proc/vmstat
April 08, 2021 Updated December 04, 2025
Understanding Linux perf: stat and record
What really happens in the kernel when you run perf stat or perf record.
December 04, 2025 Updated December 04, 2025
No posts found matching your criteria.
Showing 11 of 11 technical blog posts.