Google is all we need!
jean | 09.05.2003 10:06
And then there were four:
Why we target Google
Google is one of about four search engines that matter. There are many more than four engines, but only about four have the technology to crawl most of the web on a regular basis. Alltheweb (now owned by Overture) does the best overall crawling, followed by Google and then Inktomi (now owned by Yahoo). Google's bizarre crawling can be unfair for large sites with low or average PageRank. They may not get to many of the pages each month on such sites, even though by the end of that crawl Google is grabbing pages in spasms, at a rate of several per second. Then the next month Google's crawlers start all over again and do the exact same thing.Of these top three crawlers, Alltheweb has the smallest number of users in the U.S. Many webmasters wouldn't notice if Alltheweb disappeared -- despite the fact that they have good technology, both for crawling and for searching. Hopefully Overture can do something with Alltheweb and AltaVista (which they also bought in 2003). While AltaVista has good search technology, their crawling is extremely poor. Now with both under the same roof, Overture has the assets it needs to compete.
Yahoo has never done any crawling. Their purchase of Inktomi makes them potentially competitive as well, and perhaps by 2004 they may rely less on Google for their results. Inktomi provides the algorithmic results for MSN, but even so, their market share is less than one-third of Google's. Microsoft might never be a competitor because they don't have the technology to crawl or search the web. They have the money to buy anything, of course, but the last thing they'd want to buy is something like Google's network, which uses some 15,000 cheap Linux boxes. It is hard to imagine Microsoft's software scaling reliably to this level, and we don't even count them.
That gives us Google, Yahoo, and Overture. The last one worth watching is Teoma/AskJeeves. Their search technology is good, and they have begun to expand their crawl. It remains to be seen how deeply and consistently they will be able to crawl websites with thousands of pages.
Google is easily top dog. They provide about 75 percent of the external referrals for most websites. There is no point in putting up a website apart from Google. It's do or die with Google. If we're all very lucky, one of the other three will offer some competition within several years. If we're not lucky, we will be uploading our websites to Google's servers by then, much like the bloggers do at blogger.com (which was bought by Google in 2003). It would mean the end of the web as we know it.
It is worthwhile to understand the pressures that the average, independent webmaster is under. And given that Google is so dominant, it's important to understand the pressures that are being brought to bear on Google, Inc. It does not take too much imagination to recognize that there's a struggle going on for the soul of the web, and the focal point of this struggle is Google itself.
At one level, it's a struggle for advertising revenue. The pundits look at only this level, and they are unanimous that the only advertising model on the web with any sort of future is one where little ads appear after being triggered by keyword searches, or by the non-ad content of the page. For example, a search for Google Watch may show some ads on the right side of the screen for wrist watches.
While the technique is doesn't work for this example, more often it serves its purpose. There is only so much pixeled real estate that the average user can be expected to survey for a given search. Today up to one-third of each screen is dedicated to paid ads on Google, as compared to the ad-free results of the original Google. We programmed our Google proxy in August 2002 and stripped out the ads because it was easy, but the reason we started the project was due to privacy issues. If it had merely been a question of ads we wouldn't have written the proxy program in the first place. Now it's only seven months later, and we consider the ad stripping to be one of our proxy's best features. That's how quickly everything has changed. Today everyone wants a piece of this new wave in web advertising. Google has become very profitable, and is even having growing pains.
At another level, it's a struggle over who will have the predominant influence over the massive amounts of user data that Google collects. In the past, discussions about privacy issues and the web have been about consumer protection. That continues to be of interest, but since 9/11 there is a new threat to privacy -- the federal government. Google has not shown any inclination to declare for the rights of its users across the globe, as opposed to the rights of the spies in Washington who would love to have access to Google's user data.
Much of the struggle at this new level is unarticulated. For one thing, the spies in Washington don't talk about it. Congress has given them new powers, without debating the issues. Google, Inc. itself never comments about things that matter, and as a private corporation is largely unaccountable. The struggle recognized by Google Watch has to do with the clash of real forces, but right now all we can say is that potentially this struggle could manifest itself in Google's boardroom:
The privacy struggle, which includes both the old issue of consumer protection and this new issue of government surveillance, means that the question of how Google treats the data it collects from users becomes critical. Given that Google is so central to the web, whatever attitude it takes toward privacy has massive implications for the rest of the web in general, and for other search engines in particular.Call it class warfare, if you like. Because that brings up the other major gripe that Google Watch has with Google. That's the PageRank problem -- the fact that Google's primary ranking algorithm has less to do with the quality of web pages, than it has to do with the "power popularity" of web pages. Their approach to ranking is anti-democratic, in that already-powerful pages are mathematically granted extra power to anoint other pages as powerful.
It's not that we believe Google is evil. What we believe is that Google, Inc. is at a fork in the road, and they have some big decisions to make. This Google Watch site is trying to articulate, publicize, and even dramatize the situation at Google, and encourage more scrutiny of their operations. By doing this, we hope to play a small part in maintaining the web as an information tool that is more useful for the masses, than it is for the elites.
That's why we nominated Google for a Big Brother award in 2003. The nine points we raised in connection with this nomination necessarily focused on privacy issues. By the time the 2004 nominations are open, we hope that this list will be shorter rather than longer. But don't count on it.
1. Google's immortal cookie:
Google was the first search engine to use a cookie that expires in 2038. This was at a time when federal websites were prohibited from using persistent cookies altogether. Now it's years later, and immortal cookies are commonplace among search engines; Google set the standard because no one bothered to challenge them. This cookie places a unique ID number on your hard disk. Anytime you land on a Google page, you get a Google cookie if you don't already have one. If you have one, they read and record your unique ID number.
2. Google records everything they can:
For all searches they record the cookie ID, your Internet IP address, the time and date, your search terms, and your browser configuration. Increasingly, Google is customizing results based on your IP number. This is referred to in the industry as "IP delivery based on geolocation."
3. Google retains all data indefinitely:
Google has no data retention policies. There is evidence that they are able to easily access all the user information they collect and save.
4. Google won't say why they need this data:
Inquiries to Google about their privacy policies are ignored. When the New York Times (2002-11-28) asked Sergey Brin about whether Google ever gets subpoenaed for this information, he had no comment.
5. Google hires spooks:
Matt Cutts, a key Google engineer, used to work for the National Security Agency. Google wants to hire more people with security clearances, so that they can peddle their corporate assets to the spooks in Washington.
6. Google's toolbar is spyware:
With the advanced features enabled, Google's free toolbar for Explorer phones home with every page you surf. Yes, it reads your cookie too, and sends along the last search terms you used in the toolbar. Their privacy policy confesses this, but that's only because Alexa lost a class-action lawsuit when their toolbar did the same thing, and their privacy policy failed to explain this. Worse yet, Google's toolbar updates to new versions quietly, and without asking. This means that if you have the toolbar installed, Google essentially has complete access to your hard disk every time you phone home. Most software vendors, and even Microsoft, ask if you'd like an updated version. But not Google.
7. Google's cache copy is illegal:
Judging from Ninth Circuit precedent on the application of U.S. copyright laws to the Internet, Google's cache copy appears to be illegal. The only way a webmaster can avoid having his site cached on Google is to put a "noarchive" meta in the header of every page on his site. Surfers like the cache, but webmasters don't. Many webmasters have deleted questionable material from their sites, only to discover later that the problem pages live merrily on in Google's cache. The cache copy should be "opt-in" for webmasters, not "opt-out."
8. Google is not your friend:
Young, stupid script kiddies and many bloggers still think Google is "way kool," so by now Google enjoys a 75 percent monopoly for all external referrals to most websites. No webmaster can avoid seeking Google's approval these days, assuming he wants to increase traffic to his site. If he tries to take advantage of some of the known weaknesses in Google's semi-secret algorithms, he may find himself penalized by Google, and his traffic disappears. There are no detailed, published standards issued by Google, and there is no appeal process for penalized sites. Google is completely unaccountable. Most of the time they don't even answer email from webmasters.
9. Google is a privacy time bomb:
With 150 million searches per day, most from outside the U.S., Google amounts to a privacy disaster waiting to happen. Those newly-commissioned data-mining bureaucrats in Washington can only dream about the sort of slick efficiency that Google has already achieved.
Index of /heinzreport 1999,2000,2001,2002,2003
Name Last modified Size Description
Parent Directory 04-Jan-2002 11:48 - America.html 28-Jan-2003 10:55 33k Arizona.mid 21-Feb-2003 07:08 54k Beingstupid.html 21-Feb-2003 10:38 21k BinLaden.jpg 22-Oct-2002 09:12 20k BinLondonwotpanel.gif 27-Oct-2002 10:51 5k Blairmotivatorpanel.gif 27-Oct-2002 10:51 9k Blairs-Bloddy-Hands.jpg 29-Jan-2003 03:54 51k BlueRibbon.gif 15-Feb-2003 07:54 40k Britain.html 28-Jan-2003 08:16 16k Bush-Aznar.jpg 27-Oct-2002 10:38 15k Bush-in-the-Church.jpg 08-Feb-2003 02:25 2k Bush-shut-door.jpg 08-Feb-2003 02:25 4k Bush-watch-Powell-Tv..> 08-Feb-2003 02:25 3k Bushbritneypanel.gif 27-Oct-2002 10:51 4k Chemical-Weapons-of-..> 01-May-2003 03:37 23k Deutscher-US-Ministe..> 21-Apr-2003 02:45 20k ElBaradai.jpg 08-Feb-2003 02:25 3k Entbuerokratisierung..> 27-Dec-2002 09:30 41k EpidosodeII.jpg 08-Feb-2003 01:18 286k Fischer-Catholic.jpg 08-Feb-2003 02:52 3k Fischer-Pope.jpg 08-Feb-2003 02:52 3k Fischer-Powell.jpg 08-Feb-2003 02:52 3k HEINZREPORT-WAR-AND-..> 15-Mar-2003 23:59 47k HEINZREPORT2003.html 02-Mar-2003 07:31 28k HagueConvention.html 01-Feb-2003 04:05 103k Indonesia.jpg 16-Oct-2002 06:45 15k Ireland-Yes.jpg 20-Oct-2002 05:43 30k KaspischesOil.jpg 19-Oct-2002 10:40 9k LeCarre.jpg 16-Jan-2003 11:24 12k Lula-Brazil.jpg 28-Oct-2002 10:12 19k Man-of-Peace.jpg 21-Apr-2003 02:45 25k Money.html 21-Feb-2003 11:48 113k Muhammed_Adman.jpg 24-Mar-2003 18:20 41k Nada_Adman.jpg 24-Mar-2003 18:20 21k Powell-Sraw.jpg 16-Oct-2002 07:09 9k Powell.html 08-Feb-2003 02:25 11k Saddam.jpg 21-Apr-2003 02:45 13k Sharon.jpg 05-Nov-2002 06:20 4k Spain.html 02-Mar-2003 07:31 296k The-Course-of-the-Na..> 29-Jan-2003 03:54 14k TwinTowers.jpg 17-Oct-2002 06:46 3k UN-Spies.jpg 02-Mar-2003 06:05 8k US-IrakPetrol-Chief.jpg 21-Apr-2003 02:45 28k US-to-the-Philippine..> 21-Feb-2003 05:47 29k US-warplanes.jpg 05-Mar-2003 12:33 6k Voltaire.gif 08-May-2002 10:58 7k Where-is-Bin-Laden-M..> 01-May-2003 03:38 21k Where-is-Mr.Saddam-M..> 01-May-2003 03:37 18k acreditcardfraud.html 14-Feb-2003 00:37 8k advanced-creditcardf..> 14-Feb-2003 00:19 103k ahardday.mid 21-Feb-2003 07:08 36k alltheweb.html 29-Jan-2003 03:54 16k alo-ahe.mid 21-Feb-2003 07:08 37k anti-bush.jpg 27-Oct-2002 10:17 17k archive-heinzreport...> 28-Feb-2003 12:12 63k argentin.mid 21-Feb-2003 07:05 38k arrivederciroma.MID 21-Feb-2003 08:55 37k au-front.html 01-May-2003 03:05 178k austria.html 31-Oct-2002 02:56 23k axeofevil.html 25-Apr-2003 07:11 10k beingstupid.html 28-Feb-2003 12:12 21k belfast-parliament.jpg 16-Oct-2002 07:09 9k belgium.html 18-Oct-2002 03:17 23k blaircowboy.jpg 24-Mar-2003 20:58 3k bluespanisheyes.mid 21-Feb-2003 08:55 26k buerokratie-abbau.html 15-Jan-2003 08:39 41k bush-newswire.html 24-Jan-2003 05:27 34k bush_nostradamus.jpg 27-Oct-2002 12:09 24k bush_oscar.jpg 27-Oct-2002 10:46 29k bushbush.jpg 18-Oct-2002 03:51 14k bushes_iraqoil.jpg 27-Oct-2002 12:09 40k bushjr.jpg 13-Jan-2003 04:46 8k chirac_schroeder.jpg 15-Oct-2002 00:26 10k createwar.html 21-Oct-2002 10:47 20k creditcardfraud.jpg 14-Feb-2003 00:20 15k deathmanwalking.html 01-Feb-2003 04:05 15k denmark.html 22-Oct-2002 09:19 24k deutschland.html 19-Oct-2002 11:07 24k display.html 25-Apr-2003 07:32 119k espana.html 18-Oct-2002 13:17 24k eu-immigration.html 05-Nov-2002 08:09 14k euro-terror.html 12-Feb-2003 06:03 22k euroflag.gif 15-Feb-2003 07:54 1k europa.mid 21-Feb-2003 07:05 18k europe.gif 19-Oct-2002 09:55 22k europeanflag.gif 18-Apr-2003 11:43 1k europeanunionbookmar..> 18-Apr-2003 12:23 62k eyewire.jpg 23-Oct-2002 08:01 11k finland-bomb.jpg 16-Oct-2002 07:03 13k finnland.html 24-Oct-2002 19:31 24k fischer.jpg 20-Jan-2003 04:24 14k france.html 18-Oct-2002 03:18 23k front.html 25-Apr-2003 07:11 27k front2.html 25-Apr-2003 07:32 28k fuckracism.jpg 01-May-2003 02:47 26k general.html 29-Jan-2003 04:28 11k gerdschroeder.jpg 12-Feb-2003 05:11 8k germany.html 08-Feb-2003 02:25 30k google.html 29-Jan-2003 03:54 17k grass-award.jpg 16-Jan-2003 11:24 45k grass-lecture.jpg 16-Jan-2003 11:24 3k gulf203-flights.jpg 03-Mar-2003 07:57 11k heinz.swf 20-Oct-2002 08:13 70k heinzbanner.gif 27-Jan-2003 12:19 5k heinzlogo.gif 13-Jan-2003 05:05 14k heinzreport-war-and-..> 14-Mar-2003 06:35 47k homepage.html 21-Feb-2003 11:47 20k hungry-in-ny.html 17-Oct-2002 06:21 4k iht.gif 13-Mar-2003 18:53 5k img_good-bye.gif 15-Apr-2003 10:58 4k index2.html 21-Apr-2003 03:29 30k index3.html 25-Apr-2003 07:41 31k indymedia1.jpg 25-Apr-2003 07:11 78k indymedia2.jpg 25-Apr-2003 07:11 47k intheghetto.mid 21-Feb-2003 07:10 49k ira.html 29-Jan-2003 04:28 24k iraks-weapen-of-mass..> 21-Apr-2003 02:54 1k iraqi_cards.gif 21-Apr-2003 02:54 27k irish.html 27-Oct-2002 12:09 22k is-front.html 01-May-2003 03:05 165k israel-woman.html 12-Feb-2003 05:11 28k italy.html 29-Oct-2002 10:14 22k karachi-bomb.jpg 16-Oct-2002 06:53 13k kohle1.jpg 08-Feb-2003 01:18 7k krieg2002.jpg 28-Feb-2003 12:12 60k laha18.jpg 08-Feb-2003 01:18 17k linkheinzreport.html 27-Jan-2003 12:19 4k listen-usreport.html 21-Feb-2003 10:39 12k long.schroeder.jpg 14-Mar-2003 03:32 7k lycos.html 29-Jan-2003 03:54 30k manofhonor.html 25-Apr-2003 07:24 113k mastercard.gif 02-Mar-2003 09:48 1k million-americans.html 18-Oct-2002 03:21 7k napalmNthemorning.ram 05-Feb-2003 09:46 1k national_anthem_-_eu..> 25-Apr-2003 07:24 3k netherlands.html 21-Oct-2002 10:57 24k nuclaire-en-france.html 08-Feb-2003 03:11 39k nuclearbanner.jpg 13-Mar-2003 02:47 18k ny-horror.html 12-Feb-2003 05:11 30k olli_s3.gif 08-Feb-2003 01:19 12k orelhinha.gif 28-Oct-2002 11:54 2k original.html 24-Mar-2003 20:58 23k password-req.html 23-Oct-2002 21:53 3k ph_emergencyservices..> 02-Mar-2003 09:28 23k powell-withould-news..> 08-Feb-2003 02:25 47k pressreleaseno.html 13-Jan-2003 07:30 24k rayanim3.gif 21-Apr-2003 01:59 12k registered-only.html 27-Jan-2003 12:19 2k richest-us-jews.html 12-Feb-2003 05:11 57k rockmeamadeus.mid 21-Feb-2003 08:06 22k satisfac.mid 21-Feb-2003 08:06 51k schroederchirac.jpg 15-Oct-2002 00:02 3k schroederfischer.jpg 16-Oct-2002 06:45 11k schweiz.html 20-Oct-2002 05:52 24k showdown.jpg 02-Mar-2003 06:05 7k smalllogo-visa.gif 02-Mar-2003 09:18 1k spain_right.gif 02-Mar-2003 09:18 1k spielmirdasliedvomto..> 21-Feb-2003 07:52 16k spread-the-word.png 18-Apr-2003 11:43 96k stan_s3.gif 08-Feb-2003 01:19 12k stickerEye.jpg 25-Apr-2003 07:16 4k story.guns.jpg 31-Oct-2002 02:46 7k sweden.html 23-Oct-2002 08:32 28k testsite.jpg 08-Feb-2003 03:11 22k toparticles.html 12-Feb-2003 05:54 115k trinty1a.gif 08-Feb-2003 03:06 9k uk-front2.html 01-May-2003 03:05 20k ukbnn.jpg 02-Mar-2003 07:18 19k ukfront.html 01-May-2003 03:05 174k united-kingdom.html 28-Oct-2002 12:06 23k untitled.bmp 01-May-2003 02:49 42k us-children-poverty-..> 18-Oct-2002 03:21 78k us-fighterplane.jpg 05-Nov-2002 06:20 11k us-front.html 12-Feb-2003 06:01 30k us-profitgames.html 21-Apr-2003 02:54 6k us-wargame.jpg 14-Jan-2003 10:10 17k usconconcentrationca..> 21-Feb-2003 02:35 15k vietnamhospital.jpg 15-Mar-2003 23:52 15k warcrimes.html 27-Jan-2003 08:44 73k washing_hands_in_a_b..> 18-Oct-2002 02:31 5k weapons300.jpg 17-Jan-2003 06:30 12k wrh2.gif 08-Mar-2003 03:27 49k yahoo-india.html 29-Jan-2003 01:43 14k
|
<!-- nedstatbasic("AB1WZA0VlrLGo28HbDy7DhaIMbWQ", 0); // -->
Oh la la mon coupin George! Don't copy Dad! <!-- Original: Martin Webb (martin@irt.org) --> function right(e) { if (navigator.appName == 'Netscape' && (e.which == 3 || e.which == 2)) return false; else if (navigator.appName == 'Microsoft Internet Explorer' && (event.button == 2 || event.button == 3)) { alert ("Oh la la mon coupin George! Don't copy Dad!"); return false; } return true; } document.onmousedown=right; if (document.layers) window.captureEvents(Event.MOUSEDOWN); window.onmousedown=right;
jean
e-mail:
euronews@int.ms
Homepage:
http://www.euronews.int.ms