When not to use an LLM
The most useful thing I do on many AI projects is talk a team out of using a model where a deterministic rule, a query, or a small classifier would be cheaper, faster, and far easier to operate.
Questions before the model
- Is the task actually ambiguous? If the rules can be written down, write them down. Regex and SQL do not hallucinate.
- What is the cost of a wrong answer? High-stakes, low-tolerance tasks need verification layers that often cost more than the model itself.
- Can you evaluate it? If you cannot define “good” with examples, you cannot improve it or detect when it breaks.
- What is the steady-state cost? A demo runs a few times. A pipeline runs millions. Token cost and latency compound.
A reasonable default
Use the smallest mechanism that solves the problem. Reach for an LLM when the input is genuinely unstructured language and the output tolerates review. Everything else is usually a system-design problem wearing an AI costume.
Saying no to AI in the right place is what makes the yes credible everywhere else.