AI Jargon Buster
AI news and the language around it, simplified.
What is an Inference Cost?
An inference cost is the price a business pays every time an AI model performs a task or generates a response. Once a model is built and trained, it does not run for free. Every time a user asks a question or requests an action, the system uses computing power and electricity to process that request. These individual costs are small, but they add up quickly when thousands of employees or customers use the tool every day. Managing these costs is a primary concern for companies trying to balance the benefits of AI with their operational budgets.
Why this matters to you
If the cost to run an AI tool is higher than the value it creates, the project will lose money. Understanding these costs helps managers decide if an AI solution is financially sustainable for their specific business needs.
How you might hear this
We need to optimize our prompts to reduce the inference cost of our internal research assistant before we roll it out to the entire company.
AI Jargon Buster
Search any AI term, explained in plain English.
Type a term below and search. You will be taken straight to the tool.
Related terms
See how your CV performs against the ATS algorithms that screen candidates before a human ever reads your application.
Try the CV Optimiser →How AI job displacement actually works, what it means for your career, and what to do about it. Written by someone who has been in recruitment for 25 years.
When the Ground Shifts →