Comments on: The 4 Pillars of Mastering Google Website Crawl https://diggitymarketing.com/site-crawlability/ Tue, 27 Jun 2023 20:07:33 +0000 hourly 1 https://wordpress.org/?v=6.4.3 By: Adetunji Abel Akandedayo https://diggitymarketing.com/site-crawlability/#comment-1044773 Fri, 12 Jun 2020 21:46:22 +0000 http://diggitymarketing.com/?p=5093#comment-1044773 Please can you make a video for this

]]>
By: Matt Diggity https://diggitymarketing.com/site-crawlability/#comment-29372 Mon, 29 Jul 2019 00:35:42 +0000 http://diggitymarketing.com/?p=5093#comment-29372 In reply to T. Adebiyi.

Good input. I wouldn’t say its the most important… if you don’t link to a page or include it in a sitemap, it won’t get crawled, but you’re certainly onto something.

]]>
By: T. Adebiyi https://diggitymarketing.com/site-crawlability/#comment-29369 Sun, 28 Jul 2019 23:34:43 +0000 http://diggitymarketing.com/?p=5093#comment-29369 The most important crawling signal in 2019 is fantastic, quality content that the bots cannot miss. If you churn out fantastic content on a regular basis, it is a signal for the spiders to crawl.

i had issues recently where more than half of my client’s pages were crawled, but not index. You can guess: thin, lack-of-effort content .

]]>
By: peter https://diggitymarketing.com/site-crawlability/#comment-20326 Tue, 27 Nov 2018 03:35:51 +0000 http://diggitymarketing.com/?p=5093#comment-20326 I have two e-commerce websites, one is about 10,000 pages in the sitemap, have been indexed in google at the first month, but it was dropping the indexed only 300 in the sitemap at the Third month, and it was the same happened at my second websites.
could you tell me what is the problem? and what should I do?
thanks

]]>
By: abu anas https://diggitymarketing.com/site-crawlability/#comment-20109 Mon, 19 Nov 2018 07:32:25 +0000 http://diggitymarketing.com/?p=5093#comment-20109 Nice post, very good article to learn webmaster,
I have one question, what is cache error, when ever i checked my website cache its show 404 error,
please guide me.

]]>
By: Viktor https://diggitymarketing.com/site-crawlability/#comment-17870 Mon, 03 Sep 2018 16:17:28 +0000 http://diggitymarketing.com/?p=5093#comment-17870 Hello! Very interesting article. There was a question – pages with an attribute rel = “canonical” should be closed with a tag “noindex”? Will this save the crawling budget? Thank you!

]]>
By: don https://diggitymarketing.com/site-crawlability/#comment-15373 Thu, 21 Jun 2018 14:08:18 +0000 http://diggitymarketing.com/?p=5093#comment-15373 Great write up,

I’ve been spending a lot of time in the new search console. What are your suggestions for pages that are being crawled, but not indexed?

I’ve seen a large increase in this across several websites over the past 90-120 days and I think this is tied into some of Google’s recent algo changes.

I’m wondering if the best approach is to delete these from the website if they were designed as supplemental pages (blogs, newsletters) and are not the core keyword targets?

]]>
By: Tobi https://diggitymarketing.com/site-crawlability/#comment-15117 Mon, 11 Jun 2018 21:28:50 +0000 http://diggitymarketing.com/?p=5093#comment-15117 Great article. What would be the best way to block sites with a specific word in it? My client has many duplicate pages with _copy in it….so i want to block all sites that contains “_copy”.
Thanks

]]>
By: Rowan Collins https://diggitymarketing.com/site-crawlability/#comment-15112 Mon, 11 Jun 2018 16:30:58 +0000 http://diggitymarketing.com/?p=5093#comment-15112 In reply to Alex.

Hey Alex,

Great observations for sure. Hopefully you got it all fixed up and ranking.

]]>
By: Alex https://diggitymarketing.com/site-crawlability/#comment-15073 Sun, 10 Jun 2018 09:51:49 +0000 http://diggitymarketing.com/?p=5093#comment-15073 I was struggling for almost one year to get a 100,000 pages site fully indexed. While there is no short cut and it will naturally need time I found a lot in practice what Rowan is reporting

1) speed, but not only server speed but fast loading in general. Especially make images as small as possible. Otherwise they will eat a lot of your crawling capacity. I could see in the search console that after I reduced the average page size the “Kilobytes downloaded per day” in the search console essentially stayed the same (same crawling budget) but the “Pages crawled per day” went up as each page had gotten smaller

2) when I enhanced usability of the site and in general increased the value this site brought on the table for users I could see in analytics that the average time spent on site by USERs went up. At the same time also the number of pages indexed went up. Although there is certainly no simple correlation it is probably safe to say that the more users like your site the more pages google will be willing to crawl and index

3) strong internal linking. Internal linking is even more important than a sitemap.

Tests:

a) new domain with 10,000 pages having a sitemap but no internal linking
b) new domain with 10,000 pages having no sitemap but 1-5 internal links per page

I performed this test several times. In ALL cases b) outperformed a)

4) Rowan mentioned backlinks which is very important for sure. Also an expired domain with strong backlinks coming in will get its pages getting indexed much faster at the start than a new domain as of my findings

]]>
By: Bostjan https://diggitymarketing.com/site-crawlability/#comment-14988 Fri, 08 Jun 2018 16:32:33 +0000 http://diggitymarketing.com/?p=5093#comment-14988 In reply to Matt Diggity.

Tnx. Will have to take some time to do this.

]]>
By: Gundeep https://diggitymarketing.com/site-crawlability/#comment-14840 Wed, 06 Jun 2018 12:25:40 +0000 http://diggitymarketing.com/?p=5093#comment-14840 Hello Matt,

Great post, I have learned too much today.

Just a bit confused about the crawl budget, will research on this for sure.

]]>
By: Matt Diggity https://diggitymarketing.com/site-crawlability/#comment-14809 Wed, 06 Jun 2018 01:57:49 +0000 http://diggitymarketing.com/?p=5093#comment-14809 In reply to PC.

Yoast has fixed the issue with the latest version.

]]>
By: Rowan Collins https://diggitymarketing.com/site-crawlability/#comment-14771 Tue, 05 Jun 2018 14:00:30 +0000 http://diggitymarketing.com/?p=5093#comment-14771 In reply to Harold Crow.

Hey Harold,

I think John Mueller is being very transparent on this matter. In the article I mention that people who have crawl budget problems often have site problems. Fixing the root of the problem will ultimately fix the crawl problem.

If this is because your content is low quality, then fixing this will be a bigger help.

If you lack links or have too many links, then fixing this will also have a positive impact. It’s about finding the right balances between these signals.

My personal approach is that crawl budget doesn’t really exist in practice, but only in principle. It’s interesting conceptually, but the main thing is to improve value on the core pages that you want users to engage with.

]]>
By: Rowan Collins https://diggitymarketing.com/site-crawlability/#comment-14770 Tue, 05 Jun 2018 13:55:11 +0000 http://diggitymarketing.com/?p=5093#comment-14770 In reply to Terry O’Connor.

Hey Terry,

I tackle each problem at the root, and look for whatever is going to achieve the best results with the minimal input and collateral.

Sometimes I’ll use a plugin, other times I will edit the php or liquid files directly. It depends largely on which platform you’re using and the capabilities of it.

]]>
By: Rowan Collins https://diggitymarketing.com/site-crawlability/#comment-14769 Tue, 05 Jun 2018 13:53:20 +0000 http://diggitymarketing.com/?p=5093#comment-14769 In reply to Kris Rivenburgh.

Hey Kris,

I personally use several different page speed tools, and load on 4G through my mobile phone.

I’m mostly looking for site speed differences between regions, the client’s target location, and whether site speed is likely a problem for their website.

If your website is consistently performing worse from the United States than any other region; you may wish to tackle this and see if it brings any uplift. Some people will benefit more than others.

In regards to Content Distribution Networks, I work with clients that use tons of different providers. I’ve not seen that one outperforms others from an SEO perspective, but you should definitely look into where the nodes are located and if this aligns with your goals for the CDN.

Remember that pricing and implementation may depend on each website, so this will also be a factor.

]]>
By: Rowan Collins https://diggitymarketing.com/site-crawlability/#comment-14768 Tue, 05 Jun 2018 13:43:20 +0000 http://diggitymarketing.com/?p=5093#comment-14768 In reply to Steev Bar.

Hey Steev,

The Yoast automatic sitemap gets the job done, but it doesn’t really hold any advantages over a simplified sitemap.

This will depend largely on your website. The bigger it is, the more benefit you’re going to have from compartmentalised sitemaps.

The main advantage is that it’s done server side and will adapt based on your page and posts. If you go down the road of using Screaming Frog, then you will need to do it manually whenever you make changes.

]]>
By: PC https://diggitymarketing.com/site-crawlability/#comment-14747 Tue, 05 Jun 2018 08:13:29 +0000 http://diggitymarketing.com/?p=5093#comment-14747 Nice post!! complete and it has useful info. And what is with Yoast now ?

]]>
By: Matt Diggity https://diggitymarketing.com/site-crawlability/#comment-14746 Tue, 05 Jun 2018 07:13:53 +0000 http://diggitymarketing.com/?p=5093#comment-14746 In reply to Terry O’Connor.

Easier to use a plugin. Better to do it from the file level.

]]>
By: Raghuveer Singh Rao https://diggitymarketing.com/site-crawlability/#comment-14744 Tue, 05 Jun 2018 06:17:07 +0000 http://diggitymarketing.com/?p=5093#comment-14744 Yes, I have faced the same issue with my wordpress website. Thank you so much Matt, You always comes out with solution. Once again thank you for your helpful insights.

]]>
By: Terry O'Connor https://diggitymarketing.com/site-crawlability/#comment-14741 Tue, 05 Jun 2018 04:42:43 +0000 http://diggitymarketing.com/?p=5093#comment-14741 This is great.

I am just wondering whether it’s better/more efficient to control index and crawl from robot.txt/htaccess level, or just us a plugin like Yoast to control the pages manually.

]]>
By: Matt Diggity https://diggitymarketing.com/site-crawlability/#comment-14728 Tue, 05 Jun 2018 00:15:40 +0000 http://diggitymarketing.com/?p=5093#comment-14728 In reply to Bostjan.

You can use the robots.txt file’s wilkdcard feature to handle many at a time. Or use the redirection plugin if you have a few hours available in a bored afternoon.

]]>
By: Kris Rivenburgh https://diggitymarketing.com/site-crawlability/#comment-14724 Mon, 04 Jun 2018 23:02:25 +0000 http://diggitymarketing.com/?p=5093#comment-14724 Excellent write-up, Rowan.

As far as distance from Google servers, if your website loads quickly otherwise but shows as slower in Google page speed, would this be an effective measure of being further away?

Is MaxCDN your preferred content distribution network? Do you consider it best practice to have a CDN anyways?

Thank You!

]]>
By: Steev Bar https://diggitymarketing.com/site-crawlability/#comment-14719 Mon, 04 Jun 2018 21:51:57 +0000 http://diggitymarketing.com/?p=5093#comment-14719 So the Yoast automatic sitemap isn’t good for google to recognize the important pages ?

]]>
By: Harold Crow https://diggitymarketing.com/site-crawlability/#comment-14703 Mon, 04 Jun 2018 15:22:39 +0000 http://diggitymarketing.com/?p=5093#comment-14703 Nice post!! complete and it has useful info.
Days ago i read this post https://www.seroundtable.com/google-crawl-budget-overrated-25825.html where John Muller talks about crawl budget, maybe you can read it and let me know what do you think.
Thank you very much for the effort

]]>
By: Bostjan https://diggitymarketing.com/site-crawlability/#comment-14701 Mon, 04 Jun 2018 15:13:25 +0000 http://diggitymarketing.com/?p=5093#comment-14701 Very useful article. I am just checking for my errors and I have found a lot of not found errors for pages!
exp.:
https://www.example.com/page/4/article/

How do I solve this part or do I have to do redirects for all those URLs (there is over 200 of them)?

Thanks
B

]]>
By: Louie Modling https://diggitymarketing.com/site-crawlability/#comment-14700 Mon, 04 Jun 2018 15:11:40 +0000 http://diggitymarketing.com/?p=5093#comment-14700 This is a GREAT article, thanks for delivering it.

]]>
By: Lekan https://diggitymarketing.com/site-crawlability/#comment-14698 Mon, 04 Jun 2018 14:34:38 +0000 http://diggitymarketing.com/?p=5093#comment-14698 Hello Rowan,

Thank you very much for this insightful post.

One other thing that affects crawling and indexing as you rightly said is the quality of the content on the page.

If the contents doesn’t seem to provide value perhaps because they are spun, Google might crawl it but won’t index it

]]>