SEO glossary: plain-English definitions
We believe in removing the mystery from search marketing. Explore our definitive guide to the terminology that matters for your business.
Canonical Tag
A canonical tag is an HTML element (rel="canonical") that tells search engines which version of a page is the definitive one when multiple URLs serve similar or identical content. The mechanism operates within the head section of your HTML code. When a search bot crawls a page, it looks for this tag to understand the relationship between the current URL and other possible variants. Search engines use this signal to consolidate ranking signals, such as backlinks and engagement metrics, into a single "master" URL. This ensures that your search equity is not mirrored across multiple versions of the same information. By specifying a preferred version, you prevent the risk of "keyword cannibalisation," where multiple pages on your site compete for the same search term, effectively splitting your ranking potential. Duplicate content problems are common in modern web architecture. Common variants include differences between www and non-www versions, HTTP and HTTPS protocols, and whether a URL ends with a trailing slash. Ecommerce sites often face challenges with URL parameters for sorting and filtering products. For instance, a category page for "Running Shoes" might have dozens of variants based on size, colour, or price filters. Without a canonical tag pointing back to the main category URL, Google may view each filtered view as a separate page, leading to internal competition and crawl inefficiency. Cross-domain canonicals are also used to tell Google that a piece of content published on multiple different websites should only have one primary version credited for rankings. If canonical tags are configured incorrectly, the consequences for visibility are significant. Google may ignore your preferences and choose a different page to rank, which often results in the wrong version appearing for your target keywords. In some cases, authority dilution can cause your primary page to drop in rankings entirely. This is why self-referencing canonicals are critical. Every indexable page should have a tag pointing to itself as the primary version. This provides a clear signal to robots and protects your content if it is republished or scraped elsewhere. A common mistake is using a canonical tag when a 301 redirect is more appropriate; a canonical is a "hint" to the engine, whereas a redirect is a directive that physically moves the user and the bot. Understanding this concept is a vital part of [technical SEO fundamentals](/learning-hub/technical-seo-fundamentals). It is one of the primary tools used to manage how machines interpret your site architecture. By being precise with your canonical logic, you ensure that every drop of ranking power is focused on the pages that drive your commercial success. It works in tandem with your internal linking strategy to reinforce which pages are your "pillar" assets.
Core Web Vitals
Core Web Vitals are three Google-defined metrics that measure real-world page experience: Largest Contentful Paint (LCP), Cumulative Layout Shift (CLS), and Interaction to Next Paint (INP). Largest Contentful Paint measures how long the largest visible element on your screen takes to load. This might be a hero image or a large block of text. To provide a good user experience, Google suggests a target of under 2.5 seconds. Cumulative Layout Shift calculates how much the content of a page moves around unexpectedly during the loading process. A score below 0.1 is required to ensure that users do not accidentally click the wrong button when a page jumps. Interaction to Next Paint tracks how quickly the page responds to a physical user interaction, such as clicking a link or a menu. The target for this metric is under 200 milliseconds. This recently replaced First Input Delay (FID) as it provides a more comprehensive view of responsiveness throughout the entire page lifecycle. Google has used these metrics as an official ranking signal since 2021. They are weighted alongside traditional signals like content quality and brand reputation. While they are not usually the most dominant factor, they serve as a critical tiebreaker between otherwise similar pages. If two sites have equally good content, the one with superior Core Web Vitals will almost certainly rank higher. This is Google's way of forcing site owners to prioritise the actual human experience of the web over technical shortcuts that only benefit robots. Common causes of failure are often technical in nature. Render-blocking JavaScript and unoptimised large images frequently slow down the loading process. For LCP, ensuring that your main image or text block is preloaded and not "lazy-loaded" is a common fix. For CLS, defining explicit width and height attributes in your CSS for all media elements prevents the browser from guessing their size and causing jumps. Low-quality hosting or slow server response times (TTFB) can also drag down all three scores simultaneously. Measuring these vitals is possible through several official tools. Google PageSpeed Insights provides a lab-based view of your performance. The Core Web Vitals report in Search Console shows real-world data from actual users over a 28-day rolling period. You can also monitor the Chrome User Experience Report (CrUX) for historical trends. For a deeper dive into remediation strategies and how these metrics impact your commercial conversion rates, see our guide on [core web vitals explained](/learning-hub/core-web-vitals-explained). Ensuring your site is "passing" these tests is the baseline for competitive performance in 2026.
Crawl Budget
Crawl budget is the number of pages Googlebot will crawl on a website within a given time period, determined by the site's crawl rate limit and crawl demand. The mechanism consists of two specific components that dictate how bots behave. The crawl rate limit is a technical constraint that prevents Google from crawling so fast that it overwhelms your server resources. Crawl demand is a measure of how often Google actually wants to re-visit your site based on factors like content freshness and domain importance. If your site is popular and frequently updated, your demand will be higher. This budget is finite; once Googlebot hits its limit for your session, it stops crawling and moves to another site, potentially leaving your newest content undiscovered. Crawl budget management is not a priority for every website. Small sites with a few hundred pages rarely face issues because Google can easily crawl every URL regularly. However, large websites like massive ecommerce stores, daily news publishers, or extensive directories must manage this resource actively. If Google wastes its budget on low-value pages, your new products or high-margin services may go unindexed for weeks. This is especially critical for programmatic SEO sites that generate thousands of pages dynamically. Several factors typically waste this finite resource. Infinite scroll mechanisms, faceted navigation that generates millions of filter combinations, and soft 404 errors are common culprits. Redirect chains also force bots to make multiple requests for a single outcome, which slows down the discovery of new content. Duplicate URLs created by tracking parameters further dilute the effectiveness of each bot visit. JavaScript-heavy sites also consume more "processing budget" because Google must render the page before it can discover the links, which is a more hardware-intensive task for search engines than parsing static HTML. Managing your budget requires several technical interventions. You can use robots.txt to block bots from accessing low-value folders. Implementing noindex tags on thin pages and using canonical tags to consolidate variants also helps. Managing URL parameters directly within Google Search Console provides another layer of control. You should regularly monitor the "Crawl Stats" report in Google Search Console to see if the bot is encountering server errors or spending too much time on irrelevant file types. These technical steps are explored in detail within our [technical SEO fundamentals](/learning-hub/technical-seo-fundamentals) guide. By focusing the bot on your best work, you ensure your revenue-driving pages remain visible and relevant.
Ready to improve your search visibility?
Good SEO helps customers find your business when they are already looking for your services.
Get your free SEO audit
No obligation. Response within one working day.