Not every little thing wants an LLM: A framework for evaluating when AI is sensible

May 4, 2025

2

Be part of our day by day and weekly newsletters for the most recent updates and unique content material on industry-leading AI protection. Be taught Extra

Query: What product ought to use machine studying (ML)?
Venture supervisor reply: Sure.

Jokes apart, the appearance of generative AI has upended our understanding of what use instances lend themselves finest to ML. Traditionally, we’ve all the time leveraged ML for repeatable, predictive patterns in buyer experiences, however now, it’s doable to leverage a type of ML even with out a complete coaching dataset.

Nonetheless, the reply to the query “What buyer wants requires an AI resolution?” nonetheless isn’t all the time “sure.” Giant language fashions (LLMs) can nonetheless be prohibitively costly for some, and as with all ML fashions, LLMs are usually not all the time correct. There’ll all the time be use instances the place leveraging an ML implementation isn’t the suitable path ahead. How will we as AI mission managers consider our prospects’ wants for AI implementation?

The important thing issues to assist make this resolution embody:

The inputs and outputs required to satisfy your buyer’s wants: An enter is offered by the client to your product and the output is offered by your product. So, for a Spotify ML-generated playlist (an output), inputs may embody buyer preferences, and ‘preferred’ songs, artists and music style.
Combos of inputs and outputs: Buyer wants can differ based mostly on whether or not they need the identical or totally different output for a similar or totally different enter. The extra permutations and combos we have to replicate for inputs and outputs, at scale, the extra we have to flip to ML versus rule-based programs.
Patterns in inputs and outputs: Patterns within the required combos of inputs or outputs make it easier to determine what sort of ML mannequin you might want to use for implementation. If there are patterns to the combos of inputs and outputs (like reviewing buyer anecdotes to derive a sentiment rating), contemplate supervised or semi-supervised ML fashions over LLMs as a result of they may be more cost effective.
Value and Precision: LLM calls are usually not all the time low-cost at scale and the outputs are usually not all the time exact/precise, regardless of fine-tuning and immediate engineering. Typically, you’re higher off with supervised fashions for neural networks that may classify an enter utilizing a set set of labels, and even rules-based programs, as a substitute of utilizing an LLM.

I put collectively a fast desk under, summarizing the issues above, to assist mission managers consider their buyer wants and decide whether or not an ML implementation looks as if the suitable path ahead.

Sort of buyer want	Instance	ML Implementation (Sure/No/Relies upon)	Sort of ML Implementation
Repetitive duties the place a buyer wants the identical output for a similar enter	Add my electronic mail throughout varied types on-line	No	Making a rules-based system is greater than adequate that can assist you together with your outputs
Repetitive duties the place a buyer wants totally different outputs for a similar enter	The client is in “discovery mode” and expects a brand new expertise once they take the identical motion (comparable to signing into an account): — Generate a brand new paintings per click on —StumbleUpon (keep in mind that?) discovering a brand new nook of the web by means of random search	Sure	–Picture era LLMs –Suggestion algorithms (collaborative filtering)
Repetitive duties the place a buyer wants the identical/comparable output for various inputs	–Grading essays –Producing themes from buyer suggestions	Relies upon	If the variety of enter and output combos are easy sufficient, a deterministic, rules-based system can nonetheless be just right for you. Nevertheless, in case you start having a number of combos of inputs and outputs as a result of a rules-based system can not scale successfully, contemplate leaning on: –Classifiers –Matter modelling However provided that there are patterns to those inputs. If there aren’t any patterns in any respect, contemplate leveraging LLMs, however just for one-off eventualities (as LLMs are usually not as exact as supervised fashions).
Repetitive duties the place a buyer wants totally different outputs for various inputs	–Answering buyer assist questions –Search	Sure	It’s uncommon to come back throughout examples the place you possibly can present totally different outputs for various inputs at scale with out ML. There are simply too many permutations for a rules-based implementation to scale successfully. Contemplate: –LLMs with retrieval-augmented era (RAG) –Determination bushes for merchandise comparable to search
Non-repetitive duties with totally different outputs	Evaluate of a resort/restaurant	Sure	Pre-LLMs, this sort of state of affairs was tough to perform with out fashions that had been skilled for particular duties, comparable to: –Recurrent neural networks (RNNs) –Lengthy short-term reminiscence networks (LSTMs) for predicting the subsequent phrase LLMs are an important match for this sort of state of affairs.

The underside line: Don’t use a lightsaber when a easy pair of scissors may do the trick. Consider your buyer’s want utilizing the matrix above, considering the prices of implementation and the precision of the output, to construct correct, cost-effective merchandise at scale.

Sharanya Rao is a fintech group product supervisor. The views expressed on this article are these of the creator and never essentially these of their firm or group.

Every day insights on enterprise use instances with VB Every day

If you wish to impress your boss, VB Every day has you lined. We provide the inside scoop on what corporations are doing with generative AI, from regulatory shifts to sensible deployments, so you possibly can share insights for max ROI.

Learn our Privateness Coverage

Thanks for subscribing. Try extra VB newsletters right here.

An error occured.

Not every little thing wants an LLM: A framework for evaluating when AI is sensible

Related Articles

All-in-One Enterprise Website Builder, CRM, Mission Administration and Extra, Now $399

New research finds self-driving automobiles safer than human drivers

ESSER to the Rescue – Training Subsequent

LEAVE A REPLY Cancel reply

Latest Articles

All-in-One Enterprise Website Builder, CRM, Mission Administration and Extra, Now $399

New research finds self-driving automobiles safer than human drivers

ESSER to the Rescue – Training Subsequent

Love Knots, Could 5, 2025

Taylor Sheridan’s Landman Is Impressed By A Podcast That Followers Want To Test Out