|
A major factor with search engines is the number of pages that they find on your site.
This raises intriguing questions as to:
- How do they find out?
- How to they get it right?
The way they find out is at the basis level simple enough. They start at a page of your site that they already know about. Probably via a link from another site, and they index that page, examine the links on that page and visit those pages, repeating the process until the site is fully indexed. This is known as "spidering".
But this simplistic explanation falls over very rapidly. If the web was made up of static pages written in plain html it would work, but just look at the page you are on now! In common with almost all of our pages, it is prefixed index.php and a set of parameters decide which content to show, and what is more that content is dynamically generated from a database!
Thus we could easily create a page:
randomquotes.php?lasthash=1
This page would display a random few quotes from a quote database and have a link to the page:
randomquotes.php?lasthash=2
and so on ad-infinitum.
How does a search engine deal with this situation? Will it spider your pages for ever believing you to have an infinite number of differerent pages? Well clearly not!
One way a search engine will judge, is by looking at the meta tags and page titles. If these are identical it will be able to limit the number of pages it visits.
But I hear you say, I could easily make these dynamic as well! that would screw up the search indexing system. The spider might stop eventually, but it will think I have a massive site and this will improve my search engine rankings!
Well maybe?, maybe not? Search Engines employ a lot of smart techniques that they are certainly not going to disclose for the very reason that people in the SEO Search Engine Optimisation business will be quick to find ways to work around.
It is also worth considering that they may flag suspect sites for human intervention and they may fry your sites butt for attempting to spoof them.
That's the bad news, the good news is that there are legitimate techniques that you should be considering, techniques that help your pages show up for accurate indexing.
Consider your page design and navigation. If you are using Mambo as your content managment system (similar rules are likely to apply to other portal systems) then you have two basic choices when displaying content.
- Blog style, a series of articles appears on an individual page for easy perusal by the casual viewer.
- An indexed style where article titles appear and you have to click on each article to see the article in its own individual page.
For short articles the Blog style is far more user friendly in showing off dynamic web content. Each time the user revisits a page, the page (assuming you are writing new content) will have clearly changed and they can start reading from the top. In the indexed style there is a static header and a list the user must look at and decide if the headlines are new and worth looking at.
In essences for humans you often want Blogs, but for search engines the situation is reversed. A Blogged page will lack the clear titles and reference tags that help define the page for the search engine.
The answer to this dilemma is really blindingly obvious and at least in Mambo very easy. You make the main navigation into a page a "blog" style, but you also provide a link to the same content in the indexed style.
Where once you had a single page with 6 articles, none of which might index well with the search engines, you now have seven pages, with six of those pages capable of having very accurate title and meta tags.
|