Cookies disclaimer

I agree Our site saves small pieces of text information (cookies) on your device in order to deliver better content and for statistical purposes. You can disable the usage of cookies by changing the settings of your browser. By browsing our website without changing the browser settings you grant us permission to store that information on your device.

Header Data

Header Data

Contents

Contents
Headers
What We Give You
- H1 (First)
- H1 (Second)
- H2 (first)
- H2 (second)
- H2 (third)
- H2 (fourth)
- H2 (fifth)
- H tag word count
- H Tag count
- H1 count
- H2 count
- H3 count
- H4 count
- H5 count
- H6 count
What We Crawl, Scrape, Check & Analyse for You
What This Can Tell You
Structural Analysis of Headers
Related Content
Benefits of this Data

 

Headers

Headers are a significant on page direct ranking factor and as such scraping headers during a website crawl is an essential component of the data scraping.

 

What We Give You

Raptorbot crawls and scrapes all the headers on a page and you can choose to include all or some of these in the reports and exports that we produce. Below we detail all of the data that we craw, scrape and export.

 

H1 (First)

The first H1 header from a HTML page.

Detail: 

If there is only one h1 header, this is the column where you would find it. We scrape this from the first <h1> tag we find on a page. We simply scrape the contents of this tag, for example if we found this tag first of a page:

<h1>header</h1>

The data / text we would show in field would be “header”.

We scrape pages from top to bottom, and this reflects the order by which we determine what header is first to last.

 

H1 (Second)

The second H1 header from each HTML page.

Detail: 

If there are multiple h1 header on a page, we would show the text of the second header that we located in this field. This is scraped and presented in the same way as with the H1 header.

We scrape pages from top to bottom, and this reflects the order by which we determine what header is first to last.

 

H2 (first)

The first H2 header from each HTML page.

Detail: 

If there is only one h2 header, this is the column where you would find it. We scrape this from the first <h2> tag we find on a page. We simply scrape the contents of this tag, for example if we found this tag first of a page:

<h2>example header</h2>

The data / text we would show in field would be “example header”.

We scrape pages from top to bottom, and this reflects the order by which we determine what header is first to last.


H2 (second)

The second H2 header from a HTML page.

Detail: 

If there are multiple h2 header on a page, we would show the text of the second header that we located in this field. This is scraped and presented in the same way as with the H2 header.

We scrape pages from top to bottom, and this reflects the order by which we determine what header is first to last.

 

H2 (third)

The third H2 header from each HTML page.

Detail: 

If there are multiple h2 header on a page, we would show the text of the third header that we located in this field. This is scraped and presented in the same way as with the H2 header.

We scrape pages from top to bottom, and this reflects the order by which we determine what header is first to last.

 

H2 (fourth)

The fourth H2 header from each HTML page.

Detail: 

If there are multiple h2 header on a page, we would show the text of the fourth header that we located in this field. This is scraped and presented in the same way as with the H2 header.

We scrape pages from top to bottom, and this reflects the order by which we determine what header is first to last.

 

H2 (fifth)

The fifth H2 header from each HTML page.

Detail: 

If there are multiple h2 header on a page, we would show the text of the fifth header that we located in this field. This is scraped and presented in the same way as with the H2 header.

We scrape pages from top to bottom, and this reflects the order by which we determine what header is first to last.

 

H tag word count

The number of words within the Headers of a page.

Detail: 

We aggregate the word count of all H header tags on a page in a single number, which is what we represent with this metric / number.

This can be useful when trying to gauge the volume of words on a page within the main content. Deducting headers or dividing the content by the ‘h tag count’ (see below) can give you an idea of how much your content is segmented into digestible sections.

 

H Tag count

The number of Headers found on a page.

Detail: 

We aggregate the amount of all header tags into a total number. These tags must be open and closed correctly.

 

H1 count

The number of H1 Headers found on a page

Detail: 

We aggregate the amount of <h1>example</h1> tags on a tag in a single number. These tags must be open and closed correctly.

 

H2 count

The number of H2 Headers found on a page.

Detail: 

We aggregate the amount of <h2>example</h2> tags on a tag in a single number. These tags must be open and closed correctly.

 

H3 count

The number of H3 Headers found on a page.

Detail: 

We aggregate the amount of <h3>example</h3> tags on a tag in a single number. These tags must be open and closed correctly.


H4 count

The number of H4 Headers found on a page.

Detail: 

We aggregate the amount of <h4>example</h4> tags on a tag in a single number. These tags must be open and closed correctly.

 

H5 count

The number of H5 Headers found on a page.

Detail: 

We aggregate the amount of <h5>example</h5> tags on a tag in a single number. These tags must be open and closed correctly.

 

H6 count

The number of H6 Headers found on a page.

Detail: 

We aggregate the amount of <h6>example</h6> tags on a tag in a single number. These tags must be open and closed correctly.

 

What We Crawl, Scrape, Check & Analyse for You

We do all the following with the web crawler’s scraped data, to identify:

  • All H1 headers and content of tags
  • All H2 headers and content of tags
  • All H3 headers and content of tags
  • All H4 headers and content of tags
  • All H5 headers and content of tags
  • All H6 headers and content of tags
  • No. of each type of header
  • Word count for all headers

 

What This Can Tell You

This canonical data can be used for several things, but with just a small amount of filtering columns you can easily:

  • Find pages missing a H1 header
  • Find all pages with multiple H1 headers
  • Identify poor header structure
  • Identify poor or incorrect use of headers

 

Structural Analysis of Headers

Something with is very easy to see in the data are pages with structural problems with headers. Although some of this work needs to be manually checked, there are some quick wins you can often make from looking at the crawl data.

Because headers should be structured hierarchically, there should always be a H1 if there is a H2, and there should always be a H2 if there is a H3, etc. Simply looking at the data to see if this logic is broken, is a quick and easy way to use the aggregate of this data to derive useful insights.

 

Related Content

The list of guides below might be useful if you are analysing this data and want to know more about it:

 

Benefits of this Data

There are several main benefits to analysing canonical data, some of the more popular ones we’ve listed below:

  • Optimise headers for SEO
  • Fix headers with issues
  • Identify structural issues with headers
  • Audit a site’s headers
  • Scrape competitor headers

 

Sign up for a Free 30-Day Trial today and check for any header Issues on your site & Improve your SEO!