How to use crawl report filters
-
Create a crawl
-
Crawling report
-
Outbound links
-
Top words
-
Compare crawl results
-
Delete
How crawl report filters work
The crawl report supports field-based filters with logical operators. You can combine multiple rules to isolate technical SEO issues quickly.
Practical filter examples
Find internal source pages where a URL was discovered
- Open the crawl report.
- Use the From URL column. It shows the source page where the crawler found the link to the current URL.
Find duplicate title tags
- Open the crawl report.
-
Set the following filters:
Column Operator Value content-type start with text/html and title [only_duplicate] and http status = 200 - Click Find.
- The report will show URLs with duplicate title tags in this crawl job.
Find pages with missing title tags
- Open the crawl report.
-
Set the following filters:
Column Operator Value content-type start with text/html and title empty and http status = 200 or content-type start with text/html and title count = 0 and http status = 200 - Click Find.
Find pages where the title tag exists but is empty
- Open the crawl report.
-
Set the following filters:
Column Operator Value content-type start with text/html and title count > 0 and title str length less 1 and http status = 200 - Click Find.
Find pages with missing H1
- Open the crawl report.
-
Set the following filters:
Column Operator Value content-type start with text/html and H1 1 empty and http status = 200 or content-type start with text/html and H1 count = 0 and http status = 200 - Click Find.
Find duplicate H1 tags
- Open the crawl report.
-
Set the following filters:
Column Operator Value content-type start with text/html and H1 1 [only_duplicate] and http status = 200 - Click Find.
- The report will show URLs that share duplicate H1 values.
Find HTTP URLs (non-HTTPS)
- Open the crawl report.
-
Set the following filter:
Column Operator Value url not start with https - Click Find.
- The report will return URLs that do not start with
https.
Other fields you can use for duplicate/missing checks
url, found on, redirect to, content-type, last modified, http version, title, h1 1, h1 2, h1 3, h1 4, h1 5, h1 6, h2 1, h2 2, h2 3, h2 4, h2 5, h2 6, h3 1, h3 2, h3 3, h3 4, h3 5, h3 6, meta descriptions, meta keywords, canonical, robots, http canonical, http robots, og title, og descriptions, metrika, google analytics, hreflang, http hreflang
The report also includes quick templates in the filter panel for common investigations:
Issues
- 200 URLs
- Non 200 URLs
- 301 Redirects
- Non-301 Redirects
- 4xx Client errors
- 5xx Server errors
- Timeout
- HTTPS to HTTP redirect
Indexation
- HTML Canonical ≠ URL
- HTML Canonical on non-200
- HTTP Canonical ≠ URL
- HTTP Canonical on non-200
- HTML Meta noindex Pages
- HTTP noindex Pages
- URLs with Nosnippet attribute
- More than one HTML canonical on page
- HTML canonical is missing
- HTML canonical is empty
- HTML canonical from HTTP to HTTPS
- HTML canonical from HTTPS to HTTP
- HTTP canonical is missing
- HTTP canonical is empty
- HTTP canonical from HTTP to HTTPS
- HTTP canonical from HTTPS to HTTP
- HTTP canonical ≠ HTML canonical
- HTTP robots ≠ HTML robots
Content
- Duplicate Title
- Duplicate H1
- Duplicate Meta description
- Uppercase Title
- Uppercase H1
- More than one Title on page
- Title is empty
- Title is missing
- Title too long
- Title too short
- More than one H1 on page
- H1 is empty
- H1 is missing
- H1 too long
- H1 too short
- More than one meta description on page
- Meta description is empty
- Meta description is missing
- Meta description too long
- Meta description too short