In one of Matt Cutts’ latest Google Videos he covers the topic of dynamic URLs and what effect they have on indexing the site basically.
Well, I have heard a lot of things about this topic, generally ending up at the conclusion that it wasn’t of great importance what sort of / number of variables you put into the URL and if I really wanted a dynamically generated page indexed then I could link to it from a static page indexed by Google and I should be right.
Interestingly Matt does touch on this mentioning something like ‘there were search engines that would only crawl one deep into dynamically generated content’ – such as what I just suggested, the spider would visit dynamic URLs linked to from static pages, but go no further. This is where the one deep comes into it. I guess this was to stop spiders from getting mixed up into those never ending loops of indexing dynamic pages with similar content over and over again, wasting resources on often times duplicate content.
But it appears this is no longer the case, matt suggests 2 or 3 parameters at most in the URL string, and no large numbers that look like session ids.
He also suggests to use Apache’s mod_rewrite if you have instances where you have the same format of parameters being passed through. I’m sure I will write a mod_rewrite post because it is such a useful tool, but for now, here are a few useful links…
- Apache mod_rewrite reference
- Test mod_rewrite enabled
- Forum post guide
- Perl Regular Expressions Syntax
One Response
Stay in touch with the conversation, subscribe to the RSS feed for comments on this post.
Continuing the Discussion