Blog

June 1, 2026 • Interview Review

A compact review of host/device code, grids, blocks, warps, memory hierarchy, synchronization, and CUDA compilation.

June 1, 2026 • Interview Review

Review notes for explaining GPU kernels through thread mapping, memory access, synchronization, and bottleneck hypotheses.

June 1, 2026 • Interview Review

A walkthrough of Michael-Scott queues, CAS linearization points, memory ordering, ABA, and hazard-pointer reclamation.

June 1, 2026 • Interview Review

The thread-pool layer around a queue: packaged tasks, futures, stop tokens, condition variables, graceful shutdown, and lifecycle locks.

June 1, 2026 • Interview Review

How bit-packing, M4RI-style table methods, layout experiments, and Fenwick-tree CTMC checks fit into one engineering loop.

June 1, 2026 • Interview Review

A systems-oriented refresher on reverse-mode autodiff, micrograd-style scalar graphs, PyTorch vocabulary, and GPU placement.