|
|
Optimizing Dynamic Content
By Jill Whalen and Alan Perkins
Every week, I receive a few questions from people who have dynamically
generated sites and are having trouble getting them spidered and
ranked in the major search engines. I've written about this subject
before, as it's been a long-standing problem in the SEO world.
Instead of simply reiterating my thoughts on the subject, I thought it
would be smart to provide you with some information from someone with
a more technical background than I have. Who better than my
colleague, Alan Perkins?
Alan has been working with search engines since 1995. He holds patents in search
engine technology and was lead developer of
Search Mechanics, a product to help webmasters make their sites more
search-engine-friendly. Alan is also the co-founder of
e-Brand Management, a company dedicated to
helping people build and maintain a successful online presence.
Alan and I are currently working together to optimize a dynamically
generated Web site for one of my clients. It's been quite a learning
experience for me (and the client), and there's a whole lot to it.
The good news is that dynamic sites *can* be optimized to be found in
the search engines, but you do have to know what you're doing in order
to make it happen.
So without further ado, here's my interview with Alan:
---
Jill: Can you explain what dynamic content and dynamic URLs are?
Alan: The terms dynamic URL and dynamic content are frequently used
interchangeably. However, this can lead to confusion because they are
two separate, but related, terms. A URL is not content - a URL is the
address of some content.
Dynamic content is information that is delivered to the Web browser in
a different form than it exists on the server. It is usually pulled
from a database and created on the fly at the server level through CGI
programming, ASP, PHP, or by a content management system such as
BroadVision(tm) or ATG Dynamo(tm).
Dynamic URLs, on the other hand, are simply Web site addresses that
contain a question mark (?).
In contrast, static content is stored on the Web server in the same
format that is delivered to the Web browser. And static URLs do not
contain question marks.
In general, dynamic URLs are addresses of dynamic content, and static
URLs are addresses of static content. However, this need not be the
case, as we shall see later.
Jill: We often hear that search engines have a problem indexing
dynamic content; why is this?
Alan: It boils down to two issues -- the same core content seen at
different URLs, and different core content seen at the same URL.
When the same core content is at different URLs, a small site can
appear to be very large because an unlimited number of URLs can be
used to provide essentially the same content. Spiders can fall into
"dynamic spider traps," crawling through thousands of URLs when only a
few really needed to be crawled. Since a dynamic URL usually indicates
dynamic content, the simplest way for a search engine to avoid these
spider traps is to avoid dynamic URLs altogether. Remember, search
engines want to index any given core content just once.
Now let's consider different core content at the same URL. There are a
number of ways in which this might happen. For example, a site may
have content that may be viewed at the same URL in multiple languages
depending on the browser settings. Another example would be content
that gets updated every few minutes or so.
Whatever the means, search engines typically index only one copy of a
specific URL once every few weeks or so. Therefore, if a search engine
indexes your English content at a given URL, the same search engine
will not index your Spanish content at the same URL (during the same indexing period). And if your content is frequently updated, the
search engine's copy of your content will not be fresh. A search
engine prefers that the visitors to a particular URL see the same
content its spider saw.
Jill: Sounds like sites with dynamic content have an uphill climb when
it comes to the search engines. So what can we do to help them get
indexed?
Alan: The general answer is to give each search engine what it wants:
unique core content at a unique URL, plus the same core content seen
by all visitors.
But I'm guessing you want specifics. So here they are!
1. Use static URLs to reference dynamic content.
If a search engine sees a static URL, it is more likely to index the
content at that URL than if it found the same content under a dynamic
URL. Therefore, you can turn your dynamic URLs into static URLs
despite the fact that you are serving dynamic content. There are a
number of ways of achieving this, and your method will vary depending
upon your server and other factors. To go into all of these methods is
beyond the scope of this interview; however, you can visit the
following sites for two popular servers:
Apache
ASP
2. Link to dynamic URLs from static URL content.
With limited resources, it may prove difficult or impossible for you
to implement a solution based on static URLs. Don't worry! There are
other things you can do.
Over the years, the engines have tried to find ways of crawling
dynamic content while avoiding dynamic spider traps. One technique
they use is crawling dynamic URLs that are linked to from pages with
static URLs. For example, if you give your site map page a static URL,
but have links to dynamic URLs within its content, there's a good
chance that the leading engines will crawl those links. If they like
the content they find there, they will index that content. The search
engines' reasoning here seems to be, "If you're prepared to link to
this content, then so are we."
You can reinforce this reasoning by negotiating links to your dynamic
URLs from pages on other sites (especially high-quality pages which
are already indexed). Again, the search engines' reasoning here is "If
other sites are prepared to link to your site, then so will we." If
others won't link to your dynamic content, that might give you some
idea why search engines won't either! If it proves impossible to get
links to your dynamic content from other sites, then you can't expect
a search engine to link to your site either.
3. Pay for inclusion whenever possible.
AltaVista, Ask Jeeves/TEOMA, FAST and Inktomi offer one or more means
of paying for individual URLs to be spidered. You can use these
paid-inclusion programs to get your dynamic URLs indexed.
Paid-inclusion programs only affect inclusion and do not influence
ranking, so it is still important to make sure your dynamic content is
well optimized. For more details see the Add-URL pages of the
respective search engines.
Conclusions:
1. Search engines have problems creating links to dynamic content.
2. If you can recognize these problems, you are halfway to getting
your dynamic content indexed.
3. Where practical, use static URLs to reference dynamic content.
4. Otherwise, try to ensure your dynamic URL is linked to by content
referenced by static URLs.
5. Consider using paid-inclusion programs.
Jill: Thanks for your answers and your time, Alan!
Jill Whalen of High Rankings is an internationally recognized
search engine optimization consultant and host of the free weekly High Rankings Advisor
search engine marketing newsletter.
She specializes in search engine optimization, SEO consultations and seminars. Jill's handbook,
"The Nitty-gritty of Writing for the Search Engines"
teaches business owners how and where to place relevant keyword phrases on their Web sites so that they make
sense to users and gain high rankings in the major search engines.
Contact WingsDove for affordable and effective small business web design and
web site optimization.
|
|
|
|
|