Google's New Spider - Friend or Foe?

This web site requires that javascript be enabled. Click here for instructions..

Google's New Spider - Friend or Foe?

20 Feb

by Rob Sullivan

by Rob Sullivan
http://www.enquiro.com

Ever since we first heard of Big Daddy, Google's new data infrastructure I've been watching for anomalies across the Googlesphere.

And there was something interesting that began even before Big Daddy was announced.

There was a new Googlebot roaming the web and it has been acting like no other Googlebot.

I've seen reports of many weird things that this Googlebot can do that no other crawlers seem to be able to do.

First, let's start with an identification: This new Googlebot is known as:

Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)

Now the first difference you will notice is that it is using the Mozilla engine. This makes sense since Google has been busy hiring Firefox developers. Since the Firefox browser is also built on the Mozilla engine it makes sense to hire these developers to build a new crawler.

Originally the industry speculated that Google was going to get into the browser wars, but now that doesn't appear to be the case (at least to me).

So why would Google want to build a new crawler? More importantly why would they want to do it on the Mozilla engine?

Well consider that the old Googlebot runs on Lynx, a fairly old text based web browser.

Lynx is limited in what it can do. It can't handle JavaScript and it can't deal with CSS because of its text based nature. It is a nice, small, fast web browser but as a web user you give up too much using it.

And it is these shortcomings that have been issues for all crawlers, not just Google. You see, all the engines use similar frameworks for their crawlers, therefore they have similar limitations.

Why use Mozilla?

Well for one thing, it's open source which means anyone can use the codebase. And for another thing, the Mozilla base is much more advanced than Lynx. Mozilla CAN handle JavaScript, CSS and more.

In fact, Mozilla is a modern browser capable of rendering any web based code.

Right away you can see why Google would want to build this new crawler - the increased capabilities of it allow for more advanced crawling and indexing of the web.

So what anomalies am I seeing?

While I can't confirm too much of this at this point, my gut is telling me it's mostly true.

For one thing, this new spider is like a spider on steroids. It's hyperactive in its crawling. I've already had 2 clients that have had their websites go down because of its activity.

To put in context let me tell you how the new Googlebot compares to the old Googlebot on just one client's site:

A random sample of Googlebot activity on a site shows that the Mozilla bot requested almost 99,000 pages in that 3 day period. During the same period the old bot only requested 14,500 pages. The new bot requested almost 7 times as many pages.

Not only that, there have been reports of the new bot filling out and submitting forms! I've asked a few clients of mine that have forms if they can confirm this, but my gut tells me this is so. It's not a random act, or someone spoofing an IP or User Agent. This is truly an intelligent spider that can emulate human actions.

And it makes sense considering that Google wants it's user's experience to be the best on the web. Therefore they are going to want to ensure that those sites which show up at the top of the SERPs are also the most user friendly. Therefore the CSS must look nice, the JavaScript must work properly and forms must not contain buggy code.

In the end I think this new crawler is going to catch the web by surprise. This is because most of the sites we work on, we do so anticipating how the old Googlebot will react to the site, but if there is indeed a new Googlebot out there (And I do believe it is so) we are going to have to rethink how we code sites.

If we are hiding things in CSS or JavaScript, those tactics will no longer work. If we are hiding text in div layers or behind images, those too will no longer work.

Essentially what Google is doing is closing a few more loopholes. This will make it harder for some black hatters but as long as you follow Google's Webmaster Guidelines you will be alright.

Rob Sullivan
Head Organic Search Strategist
Enquiro Full Service Search Engine Marketing

Copyright 2003 - 2006 - Searchengineposition Inc.

Back

News Categories

General

•	SEOHost.Net Principal: A Search Filter Sidebar Has the Potential to Change the Search Engine Results Page

•	SEO Company Seeks New Content Partners for Growing Access to Publisher Websites and Premier SEO Blog & Content Writing Services

•	WebFindYou Launches FREE True Digital Marketing Master Class

Internet Related

•	Critical Security Vulnerability Discovered in All-in-One SEO Plugin Threatening Millions of WordPress Websites

•	Proactive Strategies Needed for Emerging Cyber Security Threats in 2024

•	Equally AI Announces Official Launch of Flowy--The World's First No-Code Accessibility Solution

News

•	The Launch of Presscart is Revolutionizing the PR Industry

•	Digital Marketing Agency SEO.co Expands to Salt Lake City, Utah

•	Synup Offers Free Trials for their Local Marketing Product Amidst Major Platform Upgrades

Programming

•	What is application concurrency?

•	SQL database corruption due to index fragmentation

•	The advantages of PHP

Search Engines

•	Google’s New Focus On Helpful Content Shakes up Digital Marketing’s Go-To SEO Playbook

•	SEOHost.Net Principal: Google's Recent Data Reporting Bug Demonstrates the Inherent Fragility of SEO

•	SEOHost.net Principal Discusses DuckDuckGo's Allegations Against Google

Site Promotion

•	Search Engine Journal Uncovers New Data On Industry Growth In Latest State Of SEO Report

•	TitleTap Releases Content Marketing Service To Drive Traffic To Websites

•	The Most Effective Lead Generation Channels to Leverage in 2022 [DesignRush QuickSights]

Software

•	Place1SEO Rebrands to Provide a New and Versatile Software Suite for the FES Industry

•	SEO.co Releases Backlink Checker for Understanding Competitor Backlinks

•	The Ultimate Micromarketing Tool: Turning Text into Video

Web Development

•	Top Reasons to Upgrade Your Website

•	Different keywords require different SEO tactics

•	ClickTale review - powerful web site and in page analytics - try it free

Search Promotion Data

Date / Time

Ads

News by month ‹ ›

November 2008

Mo	Tu	We	Th	Fr	Sa	Su
	1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30

October 2008

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

September 2008

Mo	Tu	We	Th	Fr	Sa	Su
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

August 2008

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

July 2008

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

June 2008

Mo	Tu	We	Th	Fr	Sa	Su
	1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30

May 2008

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

April 2008

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

March 2008

Mo	Tu	We	Th	Fr	Sa	Su
	1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30
31

February 2008

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29

January 2008

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

December 2008

Mo	Tu	We	Th	Fr	Sa	Su
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30	31

December 2007

Mo	Tu	We	Th	Fr	Sa	Su
	1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30
31

November 2007

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30

October 2007

Mo	Tu	We	Th	Fr	Sa	Su
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30	31

September 2007

Mo	Tu	We	Th	Fr	Sa	Su
	1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30

August 2007

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

July 2007

Mo	Tu	We	Th	Fr	Sa	Su
	1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31

June 2007

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30

May 2007

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

April 2007

Mo	Tu	We	Th	Fr	Sa	Su
	1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30

March 2007

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

February 2007

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28

January 2007

Mo	Tu	We	Th	Fr	Sa	Su
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30	31

December 2006

Mo	Tu	We	Th	Fr	Sa	Su
	1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

November 2006

October 2006

September 2006

August 2006

July 2006

June 2006

May 2006

April 2006

March 2006

February 2006

January 2006

December 2005

November 2005

October 2005

September 2005

August 2005

July 2005

June 2005

May 2005

April 2005

March 2005

February 2005

January 2005

December 2004

November 2004

October 2004

September 2004

August 2004

July 2004

June 2004

January 2009

February 2009

March 2009

April 2009

May 2009

June 2009

July 2009

August 2009

September 2009

October 2009

November 2009

December 2009

January 2010

February 2010

March 2010

April 2010

May 2010

June 2010

July 2010

August 2010

September 2010

October 2010

November 2010

December 2010

January 2011

February 2011

March 2011

April 2011

May 2011

June 2011

July 2011

August 2011

September 2011

October 2011

November 2011

December 2011

January 2012

February 2012

March 2012

April 2012

May 2012

June 2012

July 2012

August 2012

September 2012

October 2012

November 2012

December 2012

January 2013

February 2013

March 2013

April 2013

May 2013

June 2013

July 2013

August 2013

September 2013

October 2013

November 2013

December 2013

January 2014

February 2014

March 2014

April 2014

May 2014

June 2014

July 2014

August 2014

September 2014

October 2014

November 2014

December 2014

January 2015

February 2015

March 2015

April 2015

May 2015

June 2015

July 2015

August 2015

September 2015

October 2015

November 2015

December 2015

January 2016

February 2016

March 2016

April 2016

May 2016

June 2016

July 2016

August 2016

September 2016

October 2016

November 2016

December 2016

January 2017

February 2017

March 2017

April 2017

May 2017

June 2017

July 2017

August 2017

September 2017

October 2017

November 2017

December 2017

January 2018

February 2018

March 2018

April 2018

May 2018

June 2018

July 2018

August 2018

September 2018

October 2018

November 2018

December 2018

January 2019

February 2019

March 2019

April 2019

May 2019

June 2019

July 2019

August 2019

September 2019

October 2019

November 2019

December 2019

January 2020

February 2020

March 2020

April 2020

May 2020

June 2020

July 2020

August 2020

September 2020

October 2020

November 2020

December 2020

January 2021

February 2021

March 2021

April 2021

May 2021

June 2021

July 2021

August 2021

September 2021

October 2021

November 2021

December 2021

January 2022

February 2022

March 2022

April 2022

May 2022

June 2022

July 2022

August 2022

September 2022

October 2022

November 2022

December 2022

January 2023

February 2023

March 2023

April 2023

May 2023

June 2023

September 2023

November 2023

December 2023

May 2024

November 2024