SEO glossary: plain-English definitions

We believe in removing the mystery from search marketing. Explore our definitive guide to the terminology that matters for your business.

Anchor Text

Anchor text is the visible, clickable text of a hyperlink. Search engines use it as a primary relevance signal, reading the text as a definitive description of the content found on the destination page. Several types of anchor text exist within a natural link profile. Exact match anchors contain the specific target keyword you want to rank for. Partial match variants include a keyword along with other descriptive words. Branded anchors use your company name, while naked URLs simply display the web address. Generic anchors like "click here" offer very little value to search engines. Finally, image alt text functions as anchor text for links embedded within visual elements. A diverse profile also includes "LSI" (Latent Semantic Indexing) keywords that help the engine understand the broader context of the destination. The importance of this concept for search visibility is significant. Google uses the words in the link to understand what the receiving page is about. Links with descriptive, relevant text pass more context than generic phrases. For example, a link that says "technical SEO audit" tells Google exactly what the target page offers. This helps the engine categorise and rank the content for the correct queries. Furthermore, the text surrounding the link—the "semantic context"—is increasingly used by AI-driven algorithms to verify the link's legitimacy and relevance to the user's intent. There is a major risk associated with over-optimisation. A link profile with too many exact-match anchors looks manipulative and unnatural to algorithms. Google's Penguin algorithm specifically targets this pattern to prevent users from gaming the results. A healthy, natural profile should have a diverse mix of branded and descriptive anchors. Natural growth always results in variety; it is highly unlikely that every person linking to your site would independently choose to use the exact same keyword. Over-optimisation can lead to ranking suppression or a complete manual penalty in extreme cases. When you are managing your own internal links, you should aim for descriptive and varied text. You should never use "read more" or "click here" because these phrases provide no information to the machine or the human user. Instead, use phrases that naturally incorporate your secondary keywords. More information on managing your link profile can be found in our [link building fundamentals](/learning-hub/link-building-fundamentals) hub. Consistent, natural variety is the key to long-term ranking stability and resilience against algorithm updates.

Full Definition →

Backlink

A backlink is a hyperlink on an external website that points to your domain. Search engines treat these links as votes of confidence, using them as a primary signal of a site's authority and trustworthiness. Quality is far more important than quantity in a modern search strategy. One editorially placed link from a relevant, high-authority domain is often more valuable than fifty low-quality directory submissions. This is because topical relevance and editorial standards matter to the algorithms. If a respected industry journal cites your work, it carries significant weight. If a random, unrelated forum links to you, the value is almost zero. The concept of "Link Equity" or "Link Juice" describes the flow of authority from a strong page to a weaker one through these connections. Links are generally categorized by attributes including dofollow, nofollow, rel="sponsored", and rel="ugc". Dofollow links pass ranking authority to the destination page. Nofollow links include a specific attribute that tells Google not to pass this authority, though Google now treats this as a hint. Sponsored tags are used for paid links, while UGC tags are for user-generated content like comments. A healthy, natural indexable profile usually contains a mix of all these types, reflecting a diverse range of mentions across the social and editorial web. Toxic backlinks represent a genuine risk to your search visibility. These are links from spam farms, private blog networks (PBNs), or irrelevant mass submissions. If Google perceives that you are trying to manipulate the results with these links, you may face a manual or algorithmic penalty. In some cases, you may need to use the Google Disavow tool to tell the engine to ignore these harmful signals, though Google's AI is increasingly capable of simply ignoring spam without penalising the target site. Earning high-quality backlinks is a result of producing content that naturally attracts references. Digital PR, active outreach, and original data studies are effective methods for building your profile. You can learn more about how to manage your reputation in our [link building fundamentals](/learning-hub/link-building-fundamentals) guide. Building a durable authority profile is a slow, strategic process that requires a focus on quality and relevance at every stage of your growth.

Full Definition →

Canonical Tag

A canonical tag is an HTML element (rel="canonical") that tells search engines which version of a page is the definitive one when multiple URLs serve similar or identical content. The mechanism operates within the head section of your HTML code. When a search bot crawls a page, it looks for this tag to understand the relationship between the current URL and other possible variants. Search engines use this signal to consolidate ranking signals, such as backlinks and engagement metrics, into a single "master" URL. This ensures that your search equity is not mirrored across multiple versions of the same information. By specifying a preferred version, you prevent the risk of "keyword cannibalisation," where multiple pages on your site compete for the same search term, effectively splitting your ranking potential. Duplicate content problems are common in modern web architecture. Common variants include differences between www and non-www versions, HTTP and HTTPS protocols, and whether a URL ends with a trailing slash. Ecommerce sites often face challenges with URL parameters for sorting and filtering products. For instance, a category page for "Running Shoes" might have dozens of variants based on size, colour, or price filters. Without a canonical tag pointing back to the main category URL, Google may view each filtered view as a separate page, leading to internal competition and crawl inefficiency. Cross-domain canonicals are also used to tell Google that a piece of content published on multiple different websites should only have one primary version credited for rankings. If canonical tags are configured incorrectly, the consequences for visibility are significant. Google may ignore your preferences and choose a different page to rank, which often results in the wrong version appearing for your target keywords. In some cases, authority dilution can cause your primary page to drop in rankings entirely. This is why self-referencing canonicals are critical. Every indexable page should have a tag pointing to itself as the primary version. This provides a clear signal to robots and protects your content if it is republished or scraped elsewhere. A common mistake is using a canonical tag when a 301 redirect is more appropriate; a canonical is a "hint" to the engine, whereas a redirect is a directive that physically moves the user and the bot. Understanding this concept is a vital part of [technical SEO fundamentals](/learning-hub/technical-seo-fundamentals). It is one of the primary tools used to manage how machines interpret your site architecture. By being precise with your canonical logic, you ensure that every drop of ranking power is focused on the pages that drive your commercial success. It works in tandem with your internal linking strategy to reinforce which pages are your "pillar" assets.

Full Definition →

Core Web Vitals

Core Web Vitals are three Google-defined metrics that measure real-world page experience: Largest Contentful Paint (LCP), Cumulative Layout Shift (CLS), and Interaction to Next Paint (INP). Largest Contentful Paint measures how long the largest visible element on your screen takes to load. This might be a hero image or a large block of text. To provide a good user experience, Google suggests a target of under 2.5 seconds. Cumulative Layout Shift calculates how much the content of a page moves around unexpectedly during the loading process. A score below 0.1 is required to ensure that users do not accidentally click the wrong button when a page jumps. Interaction to Next Paint tracks how quickly the page responds to a physical user interaction, such as clicking a link or a menu. The target for this metric is under 200 milliseconds. This recently replaced First Input Delay (FID) as it provides a more comprehensive view of responsiveness throughout the entire page lifecycle. Google has used these metrics as an official ranking signal since 2021. They are weighted alongside traditional signals like content quality and brand reputation. While they are not usually the most dominant factor, they serve as a critical tiebreaker between otherwise similar pages. If two sites have equally good content, the one with superior Core Web Vitals will almost certainly rank higher. This is Google's way of forcing site owners to prioritise the actual human experience of the web over technical shortcuts that only benefit robots. Common causes of failure are often technical in nature. Render-blocking JavaScript and unoptimised large images frequently slow down the loading process. For LCP, ensuring that your main image or text block is preloaded and not "lazy-loaded" is a common fix. For CLS, defining explicit width and height attributes in your CSS for all media elements prevents the browser from guessing their size and causing jumps. Low-quality hosting or slow server response times (TTFB) can also drag down all three scores simultaneously. Measuring these vitals is possible through several official tools. Google PageSpeed Insights provides a lab-based view of your performance. The Core Web Vitals report in Search Console shows real-world data from actual users over a 28-day rolling period. You can also monitor the Chrome User Experience Report (CrUX) for historical trends. For a deeper dive into remediation strategies and how these metrics impact your commercial conversion rates, see our guide on [core web vitals explained](/learning-hub/core-web-vitals-explained). Ensuring your site is "passing" these tests is the baseline for competitive performance in 2026.

Full Definition →

Crawl Budget

Crawl budget is the number of pages Googlebot will crawl on a website within a given time period, determined by the site's crawl rate limit and crawl demand. The mechanism consists of two specific components that dictate how bots behave. The crawl rate limit is a technical constraint that prevents Google from crawling so fast that it overwhelms your server resources. Crawl demand is a measure of how often Google actually wants to re-visit your site based on factors like content freshness and domain importance. If your site is popular and frequently updated, your demand will be higher. This budget is finite; once Googlebot hits its limit for your session, it stops crawling and moves to another site, potentially leaving your newest content undiscovered. Crawl budget management is not a priority for every website. Small sites with a few hundred pages rarely face issues because Google can easily crawl every URL regularly. However, large websites like massive ecommerce stores, daily news publishers, or extensive directories must manage this resource actively. If Google wastes its budget on low-value pages, your new products or high-margin services may go unindexed for weeks. This is especially critical for programmatic SEO sites that generate thousands of pages dynamically. Several factors typically waste this finite resource. Infinite scroll mechanisms, faceted navigation that generates millions of filter combinations, and soft 404 errors are common culprits. Redirect chains also force bots to make multiple requests for a single outcome, which slows down the discovery of new content. Duplicate URLs created by tracking parameters further dilute the effectiveness of each bot visit. JavaScript-heavy sites also consume more "processing budget" because Google must render the page before it can discover the links, which is a more hardware-intensive task for search engines than parsing static HTML. Managing your budget requires several technical interventions. You can use robots.txt to block bots from accessing low-value folders. Implementing noindex tags on thin pages and using canonical tags to consolidate variants also helps. Managing URL parameters directly within Google Search Console provides another layer of control. You should regularly monitor the "Crawl Stats" report in Google Search Console to see if the bot is encountering server errors or spending too much time on irrelevant file types. These technical steps are explored in detail within our [technical SEO fundamentals](/learning-hub/technical-seo-fundamentals) guide. By focusing the bot on your best work, you ensure your revenue-driving pages remain visible and relevant.

Full Definition →

Domain Authority

Domain authority is a third-party metric (created by Moz) that predicts how likely a website is to rank in search results, scored from 1 to 100 based on the quality and quantity of backlinks pointing to the domain. An important caveat is that this is not an official Google metric. Google does not use domain authority or similar scores like Domain Rating within its own algorithms. This distinction matters because a site can have a high third-party score while failing to rank for its target keywords. Clients should interpret this number as a useful proxy for relative strength rather than an absolute rule of search performance. Many SEOs use this score to quickly vet the potential of a link prospect or to gauge the competitive landscape of a new niche. It is essentially a measurement of the "ranking potential" of a whole site based on its link ecosystem. The score reflects several specific data points regarding your link profile. These include the total count of unique referring domains, the perceived quality of those linking sites, and the diversity of the anchor text used to link to you. A domain with links from established news organisations and academic institutions will generally have a higher score than one with thousands of links from low-quality directories. It is a measurement of your digital reputation as seen through the lens of external citations. The "velocity" of your link acquisition also plays a role in how these third-party tools calculate your growth and stability over time. A high score does not guarantee ranking success because topical relevance is equally important. A link from a high-authority fitness site may do very little for a law firm because there is no semantic connection between the two brands. This is why a high Domain Rating link sometimes leads to no movement in rankings. Contextual authority is often more powerful than a high generic score from an irrelevant source. In the era of AI search, "niche authority"—being highly relevant to a narrow topic—is often more valuable than a high generic DA score across unrelated subjects. Practically, you should use domain authority as a comparative benchmark against your direct rivals. If your competitors all have scores in the fifties and you are at twenty, you know you have an authority gap to close. Similar metrics include Ahrefs Domain Rating and Semrush Authority Score. While they use different calculations, they all aim to quantify the same concept of search power. Monitoring your DA over time can help you track the long-term impact of your [link building fundamentals](/learning-hub/link-building-fundamentals) efforts. However, always prioritize building "E-E-A-T" (Experience, Expertise, Authoritativeness, and Trustworthiness) rather than just chasing a number on a dashboard.

Full Definition →

Meta Description

A meta description is an HTML element that provides a short summary of a page's content. It appears directly below the page title in search results and influences whether users choose to click on your listing. It is important to understand what this element does and what it does not do within an SEO strategy. Meta descriptions are not a direct ranking factor. Google does not use the keywords within a description to determine your position in the results. However, they have a massive impact on your click-through rate (CTR). A compelling, benefit-driven description can drive more traffic to your site even if you are not in the top position. It functions essentially as your "ad copy" for organic search, persuading the searcher that your page is the most relevant answer to their query. The optimal length for a description is between 140 and 160 characters. If you write under 140 characters, you are wasting valuable space in the search results. If you go over 160 characters, Google will often truncate your text with an ellipsis, which can cut off your Call to Action (CTA). You should always count in characters rather than words and test how your snippet looks on both mobile and desktop devices. In some cases, for long-tail queries, Google may pull a unique snippet from your content that is much longer than 160 characters. A high-quality description should be specific and directly address the searcher's question. Including a natural use of your target keyword is helpful because Google often bolds those terms when they match the user's query, making your result stand out visually. You should also include a clear call to action, such as "Read our expert guide" or "Get a free audit now," to encourage the click. Your goal is to give the user a clear reason to choose your site over the other options. Google will sometimes rewrite your description if the one you provided does not match the user's specific query. The engine will pull a snippet of text from your page that it feels is more relevant to the searcher's intent. Writing unique, high-quality descriptions for every indexable page reduces the frequency of these rewrites and ensures you maintain control over your brand voice. Duplicate descriptions across your site are a wasted opportunity to communicate with your potential customers. Every page deserves its own unique summary to maximise its traffic potential. For more on crafting high-performing content, see our [content strategy for SEO](/learning-hub/content-strategy-for-seo) guide.

Full Definition →

Schema Markup

Schema markup is structured data added to a page's HTML using a standardised vocabulary (schema.org) that helps search engines understand the specific meaning and context of the content on the page. The most common implementation method is JSON-LD, which is explicitly recommended by Google. This is added as a script block in the head section of your page and does not affect the visible content for human users. Other methods include Microdata and RDFa, but these are less efficient and harder to maintain. By using this code, you provide search engines with a machine-readable layer of information that removes the need for them to guess what a page is about, effectively helping to populate the "Knowledge Graph" with verified data about your brand, products, and services. This structured data enables several advanced features in the search results known as rich results. These include star ratings for reviews, price and stock information for products, and event dates. Beyond these visual enhancements, schema is also vital for how AI search tools and "Generative Search" (like Perplexity and Google AI Overviews) interpret your information. Visibility in these new areas is often the result of having clear, validated markup that allows an LLM to cite your page as a source for specific facts. Common types of schema include Article for news or blog posts, FAQPage for structured questions, and LocalBusiness for geographical information. Ecommerce sites rely heavily on Product and BreadcrumbList markup to ensure their inventory is accurately represented. Advanced users may also implement "Speakable" markup for voice search or "Organization" schema to define their social profiles and logo to Google. Each type serves a specific purpose in building your digital context and improving your citation probability in AI search. There is a strong connection between schema and AI citations. Structured data provides precise points that AI tools can query directly. Using FAQPage markup makes your answers easily extractable for conversational engines. More information on how to implement these technical signals can be found in our guide on [structured data and schema markup](/learning-hub/structured-data-and-schema-markup). You should always use the Schema Validator and Google's Rich Results Test to ensure your code is error-free. This is one of the most powerful technical tools available for modern search dominance in the AI era.

Full Definition →

Ready to improve your search visibility?

Good SEO helps customers find your business when they are already looking for your services.

Get your free SEO audit

Claim your audit

No obligation. Response within one working day.