Skip to content

glossary

The terms behind a system that holds up

Plain-English definitions for the reliability, performance, AI, and security concepts I work with every day. Each one links to a free tool that puts the number to work on your own stack.

reliability_scale

Reliability & scale

The targets, budgets, and capacity math behind a service that stays up under load.

performance_delivery

Performance & delivery

How fast a page feels, where the time goes, and how rendering and caching change it.

Cache Hit Ratio

Cache hit ratio is the share of requests served from cache rather than the origin: a higher ratio means less origin load, lower latency, and lower egress and compute bills.

Read definition

Cold Start

A cold start is the extra latency the first request pays when a serverless instance or container has to be created and initialised from nothing before it can serve traffic.

Read definition

Content Delivery Network (CDN)

A CDN is a network of edge servers that cache and serve your content close to users, cutting latency and origin load by answering most requests without a round trip to your servers.

Read definition

Core Web Vitals

Core Web Vitals are Google's three user-experience metrics: Largest Contentful Paint (loading), Interaction to Next Paint (responsiveness), and Cumulative Layout Shift (visual stability).

Read definition

Latency Budget

A latency budget is a total p95 response-time target split across the hops a request takes (network, app, database, cache, third parties) so each layer knows the time it is allowed to spend.

Read definition

p95 Latency

p95 latency is the response time that 95% of requests come in under: a tail-latency measure that, unlike an average, reflects what your slowest and most-affected users actually experience.

Read definition

Rendering Strategies (SSR, SSG, ISR, CSR)

Rendering strategies decide where and when your HTML is built: at build time (SSG), per request on the server (SSR), regenerated on a schedule (ISR), or in the browser (CSR). Each trades freshness, speed, and cost differently.

Read definition

Time to First Byte (TTFB)

Time to First Byte is how long from a request until the first byte of the response arrives: it captures DNS, connection, and server processing, and sets the floor for every page-load metric after it.

Read definition

ai_agents

AI & agents

What you pay for, what the model can see, and what breaks when an LLM gets tools.

Past the definitions, where does your stack actually stand?

I run a fixed-scope review across reliability, performance, cost, and AI readiness, and hand you a prioritized roadmap. Book a call to talk it through.

Book a call