RelayMag
ReportNo. 55

Do Online Reviews Actually Move the Needle?

RelayMagJune 20267 min read
Key takeaways

The honest answer is yes, but not in the way most marketing decks suggest. Reviews clearly influence what people buy, and in the right conditions they can move real money. What they do not do is reward a business in a straight line, where every additional star or every fresh five-star rave adds another increment of sales forever. The evidence points somewhere more interesting and more useful, which is that reviews matter most when they make a business feel credible and current, and that credibility has a ceiling rather than an open runway.

The Finding That Started The Serious Conversation

Much of the modern case for reviews traces back to research from Harvard Business School by Michael Luca, who looked at how Yelp ratings related to restaurant revenue. For independent restaurants, a one-star increase in rating was associated with roughly a 5% to 9% rise in revenue. That is a large swing for a single point on a five-point scale, and it gave the whole field a number to rally around.

The part that often gets dropped from the retelling is the other half of the result. Chains saw little to no effect. People already know what they are getting at a national chain, so they lean on the brand rather than on a stranger's rating. That contrast is the first clue that reviews are not a universal lever. They do the heavy lifting where buyers are uncertain and have something real to learn from other customers.

It is also worth saying plainly that this is a correlation drawn from observational data, not a clean experiment. Luca's work is careful, but the broad lesson holds better than any single decimal. Treat the 5% to 9% range as a well-supported signal of magnitude, not a guarantee you can bank on for your own storefront.

More Stars Is Not Always Better

If reviews worked in a straight line, the goal would be obvious. Chase a perfect average and never stop. Research from Northwestern's Spiegel Research Center complicates that instinct in a way every marketer should sit with.

The Spiegel work found that simply displaying reviews can lift conversion, and that the lift is strongest for higher-priced, higher-consideration purchases, the things people actually deliberate over. The more surprising finding is about the shape of the curve. Purchase likelihood tends to peak when average ratings sit somewhere around 4.0 to 4.7, not at a flawless 5.0. Past a certain point, a perfect score starts to read as too good to be true. Shoppers have learned to be suspicious of unanimity, and a sea of identical raves can trip the same alarm as no reviews at all.

That reframes the whole optimization problem. The target is not perfection. It is believability. A 4.5 with a few thoughtful criticisms mixed in often outperforms a spotless 5.0, because the criticism is what convinces a reader the praise is real.

Volume, Recency, And Simply Existing

Averages get all the attention, but they are only one input. Annual consumer review surveys, the kind firms like BrightLocal run year after year, consistently show a few patterns that matter as much as the headline rating.

Put those together and a clearer picture emerges. A handful of recent, specific reviews can do more work than a huge pile of stale ones. Having any credible reviews at all often matters more than squeezing the average up another tenth of a point, because the absence of reviews reads as a risk while their presence reads as a signal that other people have already taken the chance.

Who Actually Gets Moved

The Yelp result already hinted at this, and it deserves its own point. Reviews are not equally powerful across every business or category.

So the question is never just whether reviews work. It is whether they work for this kind of decision. A boutique law firm, a mattress, a roofing contractor, and a managed software platform all live in the zone where reviews carry weight. A pack of gum does not.

The Caveats That Keep You Honest

Anyone using reviews to make decisions has to hold a few uncomfortable facts at the same time.

Extreme-response bias is the big one. The people most likely to leave a review are the ones with strong feelings, delighted or furious, while the quietly satisfied middle often says nothing. That means the visible record is not a clean sample of how customers actually feel. It is a sample of who felt strongly enough to type.

Causation is genuinely hard to pin down here. Better businesses tend to earn better reviews, so when high ratings and high revenue move together, you cannot always tell how much the reviews caused the sales versus simply reflecting a good operation that would have sold well anyway. The Yelp study works hard to isolate the effect, and the directional story is convincing, but the precise size will vary by context.

Then there is outright manipulation. Fake reviews, paid or incentivized reviews, and review gating, where a business quietly steers happy customers to post while discouraging unhappy ones, are all real and well documented. They erode the very trust that makes reviews valuable, and platforms and regulators have moved against them. The irony is sharp. The tactics meant to manufacture a perfect score are exactly the ones that push a rating into the too-good-to-be-true territory the Spiegel research warns about.

Category differences cut across all of this too. What counts as a strong review profile for a restaurant looks nothing like one for enterprise software, and comparing across categories will mislead you fast.

What Actually Seems To Drive Results

Strip away the folklore and a practical pattern is left standing. The thing that moves buying behavior is not a manufactured perfect average. It is having enough recent, credible, specific reviews that a wary stranger decides to trust you.

Each of those words is doing work. Enough, because a single review reassures no one and volume signals that real customers exist. Recent, because trust decays and last year's praise answers a question nobody is asking. Credible, because believable beats flawless, and a little visible imperfection is what makes the rest land. Specific, because a detailed account of an actual experience persuades in a way that a vague five stars never will.

For anyone managing a review presence, that reorders the to-do list. Steadily collecting fresh, honest reviews from real customers beats engineering a spotless number. Responding to criticism in public does more for trust than scrubbing it. And accepting a few less-than-perfect ratings is not damage control, it is part of what makes the good ones believable.

The Measured Takeaway

Reviews are not magic and they are not noise. They are a credibility instrument that works hardest for unfamiliar businesses and considered purchases, that rewards volume and recency alongside the average, and that quietly punishes anything that looks manufactured. The research supports a real revenue effect in the right settings, while the honest reading of that same research warns against chasing perfection or reading too much into a single number.

The businesses that get the most from reviews are the ones that treat them as a running record of real customer experience rather than a score to game. Keep that record current, specific, and believable, and the needle tends to move. Try to fake it to a perfect five, and you may find the needle moving the other way.

Sources
R
RelayMag is an independent publication on marketing, search, and how companies get found.