Thursday, April 17, 2008

Google Crawling Past HTML Forms

Regular readers of our blog will probably attest to the idea that we are big proponents of the Google search engine. Let's face it, we abide by Google's suggested rules of engagements and our efforts result in quality listings for our clients.

It is rare that we speak negatively of the Google "system" outside of rising Pay Per Click costs and a general lack of competition.

We have a new gripe.

Google Is Crawling Past HTML Forms

A recent post to the Google Webmaster Central Blog tells all about how Google is programming their spider to explore content behind HTML forms. The main reason for this work is to "discover new web pages and URLs that we otherwise couldn't find and index..."

They go on to speak about adhering to good Internet citizenry practices such as following noindex, nofollow, and robots.txt directives. They obviously won't get past a login form but they will quickly bypass a GET form pointing to database results.

This is an SEO nightmare.

Optimized Your Post-Form Pages Much?

If you've done your homework, your database site has a navigational flow to the deep content that you want indexed. Your .php and .asp form results are undoubtedly pure user functionality with little care to metatags and headers.

The forms are just a way for your visitors to get right to the goods, right?

Well, now your non-optimized, form-result pages with non-specific metatags (if they exist) are available to the Googlebot.

Don't build pages for Google!

How many times have you heard that statement? Don't build content for Google. Don't optimize content for Google. Yeah, right.

By definition their search engine looks for, and appropriately ranks, relevant content. The only way that content gets relevant is if someone makes it relevant. They're about to climb through some forms and find a wealth of unoptimized, irrelevant trash.

In their own words, "This experiment is part of Google's broader effort to increase its coverage of the web."

Unfortunately this effort will uncover some real garbage. I have a feeling most of that garbage is titled "Search Results".

No comments: