Subscribe Via Web Feed Subscribe with Google Add to My Yahoo! Subscribe with Bloglines Add to netvibes Subscribe with Live.com

« Google & Gatorade, Together In Oklahoma | Main | Free Web Design Tools From The Big Three: SEO Friendly Or Not? »

May. 3, 2007 at 12:19pm Eastern by Barry Schwartz

Google Features Volkswagen, Which Happens To Be Search Spamming

Volkswagen Hidding Text

The Google Enterprise Blog recently featured the Volkswagen web site for using Google Enterprise search to power a new feature on the VW web site. As you can see, the Volkswagen home page has a huge search box in the middle of the page. Cool, right?

Danny and I think so. As Danny was explaining the news on the Daily Search Cast today, he noticed that the site loads the box up in Flash. Looking at the source code, he discovered hidden text! Yes, hidden text on a page that was featured by an official Google blog.

Here is the text that is clearly not visible on the page. It's kept invisible using a special style called "invisibleContent:"

<div class="invisibleContent">Volkswagen of America presents U.S. vehicle information, pricing, incentives, deals, comparisons on Eos, GTI, Jetta, New Beetle, New Beetle Convertible, Passat, Passat Wagon, Touareg, Rabbit, R32 and the GLI with links to VW dealers, owner information, Volkswagen merchandise, and VW accessories. homepage, volkswagen, volkswagon, vw.com, home, landing, top, volkswagen.com, home page, home, top, back, VWofAmerica, Volkswagen of America, Volkswagon of America, VWoA, VWofA, volkswagon.com</div>

Google has guidelines against using hidden text. In fact, such use got a different car maker, BMW, banned briefly from Google last year. YADAC: Yet Another Debate About Cloaking Happens Again covers both of these points.

Even Google has violated its own rules. Back in 2005, text meant for internal indexing was showing up on public pages, causing one part of Google to file for a reinclusion request with another part of Google. From what Google said at the time in a WebmasterWorld discussion:

Those pages were primarily intended for the Google Search Appliances that do site search on individual help center pages. For example, http://adwords.google.com/support has a search box, and that search is powered by a Google Search Appliance. In order to help the Google Search Appliance find answers to questions, the user support system checked for the user agent of "Googlebot" (the Google Search Appliance uses "Googlebot" as a user agent), and if it found it, it added additional information from the user support database into the title.

The issue is that in addition to being accessed via the internal site-search at each help center, these pages can be accessed by static links via the web. When the web-crawl Googlebot visits, the user support system thinks that it's the Google Search Appliance (the code only checks for "Googlebot") and adds these additional keywords.

That's the background, so let me talk about what we're doing. To be consistent with our guidelines, we're removing these pages from our index. I think the pages are already gone from most of our data centers--a search like [site:google.com/support] didn't return any of these pages when I checked. Once the pages are fully changed, people will have to follow the same procedure that anyone else would (email webmaster at google.com with the subject "Reinclusion request" to explain the situation).

Postscript: The Google Enterprise blog updated us with a post telling us that they contacted the Volkswagen team and Volkswagen removed the hidden text from the page and placed them in the meta description of the code.

Like The Story? Vote For It On Yahoo Buzz!
Subscribe To Our Daily Search News Recap!
Your Email:
Send me the monthly search newsletter too! (Learn more about our newsletters and feeds)
Subscribe To Our Search Feed!
Subscribe Via Web FeedSubscribe with GoogleAdd to My Yahoo!Subscribe with BloglinesAdd to netvibes
Subscribe with Live.comSubscribe in NewsGator OnlineSubscribe in RojoAdd to My AOL
Share & Bookmark This Story!
By Barry Schwartz Permalink Jump To Comments See Related Stories In: SEO: Spamming



Reader Comments

At least the disguised the fact it was hidden well with their choice of class name!

I don't think I've ever seen anyone spam the search engines for highly monetizable words like "homepage" and "home page" before.

I wonder what their homepage-based spam revenues have been like?

.invisibleContent { width:1px; height:1px; visibility:hidden; overflow:hidden;}

Comment by savage [TypeKey Profile Page] | May 3, 2007 12:50 PM

The better news is that the don't just do this on the home page, they do this throughout the site with unique "content" in the invisibleContent area depending on where you are. Wonder if they are doing this for the "exposure"? Could that be the reason they were so obvious?

Nice catch, but in all fairness to Google, this certainly isn't the first time they've had two sets of rules for the little people and for those who have connections. What would be news if they treated all sites the same.

Barry it's another good Google "gotcha" and shows that Google often implements the rules inconsistently. This is one of many reasons it would be nice to see pressure from you, Danny, SEW, WMW, SearchEngineLand to get Google to create a "Chief of Collateral Damage" who would help examine the ways the algo sometimes punishes "good" sites while rewarding "cheating" sites as they have here.

I think I already visited that site and it was a really cool site... Changing, adding, replacing your choosen VW aftermarket part and automatically computes for the cost of the VW was the coolest part of that site.. I did not noticed that they already broke some rules of the Google which regards in using html codes...

In fairness to Google, you cannot expect them to check every site before they will except ad or partnership dollars?

Perhaps the Googlers like their VWs but not their BMWs?
:)

Odd

It would be ok if they where sniffing the flash and displaying text to a non flash browser.

VW do seem to have a habbit of breaking the rules.

Though I wonder if its a lame atempt to handle nonflash browsers.

What is it with car manafacturuers and flash what does say ttremastered do for audi's brand.

Google pinged them and VW quickly changed it:
http://googleenterprise.blogspot.com/2007/05/vw-website-updates.html

Question: Does Google reach out to all sites with hidden text before they ban them, or just to big ones?

Why do they hide content , there is a "legal" way to hide it . They can just place a small button and onclick will show the content :)

Comment by Eugeniu [TypeKey Profile Page] | May 4, 2007 6:08 AM

Just one small point -
The Car Geek....
I read your comments above (and of course noted the nifty little link that you have added in your post)
Thanks for the 3x spam comments which you tried to add to my blog this am.
It's getting a little tiresome...

///
I think I already visited that site and it was a really cool site... Changing, adding, replacing your choosen VW aftermarket part and automatically computes for the cost of the VW was the coolest part of that site.. I did not noticed that they already broke some rules of the Google which regards in using html codes...
///

Comment by follett [TypeKey Profile Page] | May 4, 2007 10:40 AM

Hi,

I can't see the with the class "invisibleContent" in the VW source code. I'm on http://www.vw.com and viewing the "Page Source" using Firefox. When I look at the CSS file http://www.vw.com/global/css/html_default_v2.css, I do see the "invisibleContent" tag, but I'd like to know how to see the we're talking about in the source code as well. . .

Can you let me know what I'm missing?

Thanks!

Comment by ceggi [TypeKey Profile Page] | May 4, 2007 12:04 PM

Re-posting my question since it came out wrong--sorry for the duplication.

I'd like to know how to view the div we're talking about in the source code. I can see the "invisibleContent" class in the CSS, but not the div that contains the invisible text. I'm using "View Source" in the Firefox browser. Do I need a different tool to see the div? Or has VW already removed it?

Thanks!

Comment by ceggi [TypeKey Profile Page] | May 4, 2007 12:22 PM

ceggi, yes they took it down already. read all comments before leaving your own.

"In fairness to Google, you cannot expect them to check every site before they will except ad or partnership dollars?"

i thought google's core function was "checking sites" ?? a company who is going to "organize the world's information" should be able to handle "checking every website" right?

Comment by savage [TypeKey Profile Page] | May 4, 2007 1:27 PM

It looks like VW is quite embarassed over this. Did they hire a shady SEO firm or was hidden text the brilliant idea of the VP of marketing?

Either way, you can see the old cached version here (it wont last long):
vw cached spam

the current page disallows caching. This will be interesting: VW doesn't want search engines to cache its pages? They go from spamming for dollars to blocking indexing spiders?

their code:
meta http-equiv="Pragma" content="no-cache"

a good new design, but the tagcloud never change? static tagcloud?

Search:

Search Marketing Expo

Save the date for:
SMX Madrid (in Spanish, May 20-21)
SMX Advanced - Seattle, WA (June 3-4) Register today! Early bird rate expires May 9!
SMX Local & Mobile - San Francisco, CA (July 24-25) (July 24-25) Pre-agenda rate expires May 2. Get the lowest rate by registering now.
SMX East - NYC - (Oct. 6-8)
SMX London - November 4 & 5, 2008

Search Marketing Now

Learn more about search marketing through free online webcasts and webinars from our sister site Search Marketing Now.

Upcoming Webcasts:

Most Recent News Posts

About Search Engine Land

Stay Updated!

Get Our Search Newsletters:
Email:
Daily Monthly

Get Our Search Feed:
Subscribe Via Web FeedSubscribe with Google
Add to My Yahoo!Subscribe with Bloglines
Add to netvibesSubscribe with Live.com
Subscribe in NewsGator OnlineSubscribe in Rojo
Add to My AOL
More About Our Feeds & Newsletters

Add to Technorati Favorites

Track Us Socially:
Facebook: Our Search News App
Facebook: Search Engine Land Page
Facebook: Search Engine Land Group
Flickr: Search Engine Land
LinkedIn: Search Engine Land Group
Twitter: Search Engine Land Feed

Bragroll