Cookies disclaimer

I agree Our site saves small pieces of text information (cookies) on your device in order to deliver better content and for statistical purposes. You can disable the usage of cookies by changing the settings of your browser. By browsing our website without changing the browser settings you grant us permission to store that information on your device.

Page is Canonical Data

Page is Canonical

Page is Canonical
Simple Definition
Values Returned
Canonical Pages in More Detail:
Examples of How Canonical URL Data is Used
- Example 1: Is a Page Canonical?
- Example 2: Duplicate Pages
- Example 3: Missing Canonical Tags
A bit more on URLs
A Little More About Canonical Tags
Related Content
Benefits of Our Data

 

Page is Canonical

Determining whether a page is canonical or not is highly valuable information and a time saver for you! Raptor checks whether the URL of the canonical tag matches the URL crawled, if they do match then the page is canonical.

Consequently, only URLs capable of containing a canonical tag are considered canonical pages, this means just HTML/Text pages.

 

Simple Definition

Canonical tag URL matches the UR of the page crawled.

Our data allows you to check that and help provide insights that could direct a digital strategy.

HTML/Text pages with a self-referential canonical tag. This means a canonical URL that matches the URL crawled.

 

Values Returned

The possible values returned in this column are as follows:


Non-Canonical Page

If the URL and the canonical URL do not match due to canonical differences, such including or not including the ‘www’ or a trailing slash.

Canonical Duplicate

If the canonical  URL contains a different directory, directory, sub directory or file.

Canonical Page

The URL and the canonical URL match exactly

Missing Tag

The canonical tag is missing

Error

Tag is incorrectly implemented

 

Canonical Pages in More Detail:

This is a simple check that looks to see if the canonical URL matches the URL of the page crawled, if they match exactly (with no difference at all), the page is determined to be canonical. If a page is non-canonical it will be marked accordingly.

If there is no canonical URL, the field for this column will be marked as per the above system. The page must meet the following criteria to be determined / defined as a canonical page:

  • HTML/Text
  • Accessible
  • Return a ‘200’ status code
  • Linked to at least once
  • Contain a canonical URL that matches it’s URL

File types such as CSS or Images will shows as blank fields as they do not contain a canonical tag.

 

Examples of How Canonical URL Data is Used

All the data our web scraper extracts is used in various ways by SEOs or web devs. We provide some examples of how Canonical URL data can be used in different ways below:

 

Example 1: Is a Page Canonical?

You can determine which pages are canonical based on whether the Canonical URL matches the URL of the page. If they don’t match, the page is canonical, often these pages are not the focus of an SEO assessment or audit.

 

Example 2: Duplicate Pages

You can use the data we provide here to identify pages that are legitimate canonical duplicates of other pages. These pages can often be excluded from any analysis as they are non-canonical and hence do not require optimising in the same way as canonical pages.

 

Example 3: Missing Canonical Tags

A standard SEO check you can do with this data; is ensuring that every page that has a canonical tag, and that it is implemented correctly on page.

 

A bit more on URLs

The URL of a page is the only data / value returned so we thought it worth explaining in a little more detail. below is an example of a URL:

https://www.example.com/dir/sub-dir/dir/page/

  • Internet Application Protocol (Protocol)
  • Sub-Domain
  • Top Level Domain (TLD)
  • Directory (Dir)
  • Sub-Directory (Sub)
  • Page or file / Resource

 

A Little More About Canonical Tags

Canonical stuff can have a huge effect on a site, using Raptor and the data our web crawler provides enables you to identify all canonical issues.

To cover some basics. This is what a canonical tag looks like, we have colour coded this to make it a little easier to understand.

<link rel="canonical" href="https://example.com/dir/page/ "/>

Red = This is the HTML (the code) and encompasses the canonical tag

Grey = This is the ‘Canonical URL’ (variable), the part of the tag that you stipulate

The canonical URL needs to be both absolute (see example below) and consistent with the canonical URLs used across the site.

https://example.com/relevant/page-about-something/

Red = The part of the URL required to be canonically consistent.

Grey = The various pages of the site… Subject to rules and best practice.

 

Related Content

The list of guides below might be useful if you are analysing this data and want to know more about it:

Related column headers in Raptor website crawler reports:

 

Benefits of Our Data

Any identified group of pages that are marked as anything other than ‘blank’ or ‘canonical page’ could be worth investigation. Conversely you can remove large groups of pages quickly from your analysis under certain circumstances.

  • Identify canonical pages
  • Identify non-canonical pages
  • Identify pages missing canonical tags
  • Identify pages with canonical errors

 

Sign up for a Free 30-Day Trial Now! Check for All Canonical Issues on Your site.