
The Cache That Thinks in Embeddings
Why matching exact strings is a waste of tokens, and how to cache LLM responses by intent instead.
Articles, guides, and thoughts on web development and technology

Why matching exact strings is a waste of tokens, and how to cache LLM responses by intent instead.

Stop giving every NPM dependency full access to your machine and start using the Node.js native permission model to restrict your runtime's blast radius.
High-performance web apps are outgrowing the limitations of IndexedDB, and the Origin Private File System is the low-latency alternative we've been waiting for.

Examine the technical trade-offs between TCP-bound WebSockets and the new HTTP/3-powered WebTransport API for ultra-low latency data transfer.

The ES2024 arrival of Promise.withResolvers finally standardizes the 'Deferred' pattern, allowing for much cleaner management of asynchronous state outside of the constructor.

Stop guessing when to yield and start using the browser's native priority queue to interleave heavy background work without dropping a single frame.

A deep dive into the internals of the Node.js permission system and whether it can truly mitigate supply chain risks without the overhead of a container.

The new HTML invoker attributes are quietly revolutionizing how we handle UI interactions by moving state management and event listeners directly into the browser's native engine.

If you're tired of the Long Tasks API telling you that your UI stalled without explaining why, it's time to look at the script-level attribution provided by the Long Animation Frame API.

I finally found the escape hatch for nested dependency bugs and security vulnerabilities that don't require waiting for a maintainer's PR.