How Google Search Actually Works
- A page must pass three separate gates in order, crawling then indexing then ranking.
- Near-duplicate URLs make Google pick one canonical page and set the rest aside.
- Ranking blends relevance, quality, links, page experience, and freshness, weighted differently by query.
Most marketers know the outcome they want from Google. They want their pages to show up when someone searches for the thing they sell. Fewer people have a clear picture of how a page actually travels from being published to being shown to a searcher. The path runs through three jobs Google does, and they happen in order. Google has to find your page, decide whether to keep it, and then decide where it belongs for a given search. Crawling, indexing, ranking. Each one is a separate gate, and a page can pass one and fail the next.
Here is the plain version, written for someone who has to make decisions about content and not someone who has to write the code.
Crawling
Crawling is discovery and fetching. Google runs an automated program, usually called Googlebot, that visits pages and reads what is there. It does not magically know your site exists. It learns about pages mostly by following links. When a page Google already knows about links to a new page, the crawler can follow that link and find the new one. This is a big reason internal links and links from other sites matter beyond their effect on ranking. They are the roads the crawler drives on.
You can also hand Google a map. A sitemap is a file that lists the URLs you want considered, and you can submit it so the crawler has a clear starting list rather than relying only on links it stumbles across. A sitemap does not force anything to be crawled or kept, but it helps Google find pages it might otherwise miss, especially on a large or poorly linked site.
A few practical truths about crawling.
- Pages buried with no internal links pointing to them are hard for Google to find. If nothing links to a page, the crawler may never reach it.
- Crawling is not instant or unlimited. Google decides how much to crawl a given site based on signals like how often content changes and how the server holds up. A slow or unstable site can get crawled less.
- Being crawled is not the same as being kept. The crawler fetching your page is only the first gate.
Indexing
The index is Google's giant library of pages it has processed and is willing to consider showing. To be in the index means Google has fetched the page, made sense of its content, and stored a representation of it. If a page is not in the index, it cannot appear in results at all, no matter how good it is.
Plenty of pages get crawled but never indexed. Google looks at a page and decides it is not worth keeping, or that it is a near-copy of something already stored. Thin pages with almost no unique content, pages Google considers low value, and pages blocked by your own settings can all be crawled and then left out.
Duplication is one of the most common reasons. Lots of sites generate many URLs that show essentially the same content. A product reachable through several filter combinations, a printer-friendly version, a page with tracking parameters tacked on. Google does not want ten copies of one thing in its results, so it groups pages it sees as duplicates and picks one to represent the set. That chosen page is the canonical one. The others still exist, but Google folds their signals into the canonical version and shows that one. You can tell Google which version you prefer, though Google makes the final call. The point for a marketer is simple. If you spread the same content across many URLs, you are not multiplying your chances, you are making Google choose one and ignore the rest.
Ranking
Once Google has a pile of indexed pages that could answer a query, it has to order them. There is no single score that decides this. Ranking is many signals combined, weighted differently depending on the query, and the mix is constantly being adjusted. It helps to think in big buckets rather than a magic number.
- Relevance to the query. Does the page actually cover what the person asked about. Google looks at the words on the page, how they relate to the search, and whether the page seems to be about the topic rather than just mentioning it once.
- Content quality and helpfulness. Is the page genuinely useful, reasonably complete, and trustworthy for the subject. Google has spent years trying to reward content that helps people and demote content that exists mainly to rank.
- Links and authority. Links from other reputable sites still act as a vote of confidence, and a site with a strong reputation in a topic tends to do better for related queries.
- Page experience. Does the page load reasonably fast, work on a phone, and not bury the content under junk. This is usually a tiebreaker rather than the main event, but it counts.
- Freshness, for some queries. For news, prices, sports scores, and anything that changes often, newer can beat older. For a question whose answer has not changed in years, freshness barely matters.
No outsider can tell you the exact weight of any of these, and the honest answer is that the weighting shifts by query and over time.
Search intent
Behind every query is a reason someone typed it, and Google tries to read that reason. Marketers usually sort intent into three rough types.
| Informational | The person wants to learn something. Searches like how to clean a cast iron pan. |
| Navigational | The person wants a specific site or page. Searches like a brand name or a login page. |
| Transactional | The person wants to do something, often buy. Searches like buy running shoes or a product model number. |
Intent shapes what Google shows. For an informational query you tend to see explanatory articles and quick answers. For a transactional query you see product pages, shops, and listings. This is why a beautifully written guide can fail to rank for a buying query, and a product page can fail for a how-to query. The page is fine. It just answers a different question than the one being asked. Matching the format and angle of a page to the intent behind the query is often the difference between ranking and not.
How the results page changed
The list of ten blue links is not what most searches look like anymore. Over the years Google has added features that answer questions right on the results page. Featured snippets pull a short answer out of a page and show it at the top. The knowledge panel shows facts about a person, place, or company in a box off to the side. More recently, AI-generated overviews have started appearing above the regular results for some queries, summarizing an answer drawn from multiple sources.
The effect on clicks is real and worth understanding. When Google answers the question on the page itself, some searchers get what they need and never click through. That has been true since featured snippets arrived, and the newer summaries push it further for certain query types. It does not mean the results below stop mattering. It means the value of a click is concentrating, and ranking well is necessary but no longer a guarantee of the traffic it once delivered. The job of making genuinely good, findable pages is the same. The payoff per ranking has simply gotten less even.
A closing note on the algorithm
Nobody outside Google knows the exact algorithm. Google publishes general guidance and confirms broad ideas, but the precise formula is not public and changes constantly. A large part of what circulates as search advice is rumor, old news, or someone overfitting to one site's experience. Chasing a supposedly secret signal is usually a waste of effort. The durable approach has not changed in a long time. Make pages that genuinely help the person searching, make sure Google can find and read them, and match what you publish to what people are actually trying to do. Everything in this piece is downstream of that.
A few common questions
Why is my page not showing up even though it is published?
It may not be in the index. Check whether Google has crawled and kept it before worrying about where it ranks. A page that is crawled but not indexed, or never crawled at all, cannot appear no matter how good it is. Weak internal linking, duplication, or thin content are common causes.
Do I need a sitemap?
It helps, especially for larger sites or sites where pages are not well linked internally. A sitemap gives Google a clean list of URLs to consider. It does not force indexing or improve ranking on its own, so treat it as a way to be found, not a shortcut to the top.
Does more pages mean more traffic?
Not if those pages overlap. Publishing many near-duplicate URLs tends to make Google pick one and set the rest aside rather than rank them all. Fewer strong, distinct pages usually beat a pile of thin variations.