Home Clients Contacts

Pavel Lebedev

Crawling Web 2.0: Content Hunt

Pavel Lebedev | 2009-06-26

Anyone tracking search engine bots (crawlers) for a few last years has certainly noticed dramatic changes in crawling activity.

First, previously it took relatively long time for a new web site to get into the indexes of major search engines, but nowadays it is just a matter of days really. Often web analytics detects first ever visit by search engine crawler within days from the moment a brand new site was submitted (or linked from already indexed site).

Second, frequency of crawling a site by major search engines has increased substantially. It is not uncommon to see crawlers revisiting same web page several times a day. Nowadays there is an intense crawlers activity on a web site (activity that is often invisible in web analytics or, at least, not transparent to web site owners).

Furthermore, having been retrieved by crawler, new or updated web page becomes searchable by entire world (gets updated in search index) within days.

All these are the signs of Web 2.0 Era we are now living in.

Before Web 2.0 Era content was mostly static – there was no need for crawlers to revisit same web page (recrawl site) often since in most cases it did not change anyway.

Currently, things are entirely different. There are plenty of web publishing technology available that make content dynamic, easy to change, update, etc. It is now easy to publish online as never before.

In particular, the advent of blogging technology (you read a blog post now) has contributed to explosive growth of dynamic content.

In addition, the adoption of syndication feeds technology (Atom, RSS) substantially increased reach of and, hence, demand for dynamic content. The supply has followed.

As a result, web is now flooded with dynamic content.

On the other hand, web has become a true commercial venue, unleashing more and more of its economic potential. As a result, competition between search engines has increased. To be competitive, search engines need to keep their ... Read more...

Comments | Tweet this

Add to Google Reader: TrackSite Web Analytics

Pavel Lebedev, web analytics expert/consultant (pavellebedeff yandex.ru)

Read me in Web Analytics Blog

Have questions? Let our consultants answer:
, Skype: tracksite, ICQ 552016042 or chat on-site...

Blog

Twitter, online marketing and online advertising
Website Optimization and Conversion Rate Boost
Web Analytics Is In Details
Firefox vs. IE: Web Analytics Perspective
What is web-analytics?
Who are web analysts anyway?

Web Analytics of Most Popular Browsers
Own Your Own Web Analytics
Browsers Market: Web Analytics Report
Web Analytics Done Right: Know Your Customer
Crawling Web 2.0: Content Hunt

Publications

Add to Google Reader: TrackSite Web Analytics
TrackSite - advanced web analytics software