Posts tagged with "bing"


Google's non-existent whitelists... exist

Icon from the Tango Desktop Project

Google has been caught manipulating search results again, but at least they admitted what they were doing!

Google mostly do no evil :)

Along with IPv6 and cloud computing in their current forms, I've viewed most of my concerns with Google in light of potential government and law enforcement abuses. Whereas even more privacy and security obsessed people than me have been quick to dismiss Google as hypocritically evil, I've been willing to give them the benefit of the doubt.

Take the street view controversy. While I think Google engineers were short sighted by not closely studying the source code of the software they put on their trucks and drove around the world, I don't believe they did it maliciously. These signals were being broadcast in the open, and while the scale of Google's downloading may warrant further scrutiny, it skips the real issue that people are still broadcasting unencrypted data out of their homes for anyone to gain access to. Rather than blaming consumers (which is always an easy thing to do) however, I place the blame on network hardware manufacturers for selling devices that didn't make this clearer.

Basically, while I've taken issue with some of their policies (labelling Android as "open" and broadly copying the iOS interface, the Verizon net neutrality thing, Eric Schmidt's baffling statements about not having things to hide, pushing WebM and Flash, claiming redundancy is built into Gmail only to have all that redundancy fail), I still believe they're a force for good online and we benefit from their presence. Clearly, Bing does too ;).

Whoopsiedooddle

Unfortunately, it's come to light that Google has been misrepresenting aspects of their search algorithm's operation. After steadfastly and repeatedly denying in court that they use whitelists to inform their search results, both Google and Bing have admitted their existence. These so-called "exception lists" are claimed to consist of sites that Google and Bing engineers deem to have been false positives in their farmer site targeting algorithms.

As with all stories of this nature, how its being reported is almost as fascinating as the story itself. Barry Schwartz at SearchEngineLand catches out Google for denying whitelists are used in their anti farming efforts but otherwise sticks to reporting the facts.

Matt Cutts explained that there is no global whitelist but for some algorithms that have a negative impact on a site in Google’s search results, Google may make an exception for individual sites. But didn’t Matt tell us there is no whitelist for the Farmer update?

Cade Mets at The Register was less charitable and accused Google of lying to European Union anti-trust regulators.

The trouble is that Google spent years refusing to acknowledge that these manual interventions exist – and, in some cases, outright denying them. Google spent nearly a decade telling the world that its search engine was completely objective, and it has only recently begun to freely admit that this is not the case, presumably as a result of the EU investigation.

Ruben gets nervous talking to people

I've talked to people at university and my old work about this, and as usual the opinions are pretty evenly divided between "Google is guilty of denying the true nature of their operations and must be punished", to "Google's exception lists are a valid approach to dealing with false positives, these stories are just sensationalist".

Sensationalist sounds like it'd be a good thing. What a sensation! How sensational! YAYS! The English language is weird.

I'm of the opinion that while Google have been guilty of search result rigging in the past, we don't have enough evidence yet to suggest Google are abusing these exception lists. If they are as benign as they suggest, I'm sure Google shouldn't have any problem with providing us with further details as to their operation. If they remain obtuse, then we have severe cause for concern.


Bing copying Google, apologies to Jack Johnson

Photo by me during Jack Johnson's concert in Sydney in 2010

It's like pictures of people taking pictures of people... but with search engine results!

The backstory

So the gist of this latest high school-esque media circus is Google asserts Microsoft's MSN Search, wait no Windows Live Search, no wait Chanandler Bing search engine has been actively stealing their search results. As I understand it, in a highly publicised sting, Google engineers:

  1. developed a honeypot by creating a series of pseudo-random search queries, then linked them to search results that were entirely unrelated.

  2. installed the Bing Toolbar on a series of machines with Windows and Internet Explorer installed, and opted into the data gathering clause or whatever it was

  3. performed Google searches for those pseudo-random terms in Internet Explorer and clicked the entirely irrelevant results, knowing the Bing Toolbar was checking what they were doing

  4. In 9% of cases, those same results later appeared in Bing for those queries

I'm of two minds about this. Or two brain hemispheres about this. No wait, that would imply I'm using the creative and mathematical parts for different things. You know what, let's end this unfortunate metaphor before it starts. Mmm, honeypot.

Thar be copyin' going on

Public opinion over whether this constitutes stealing has been all over the place, but I'm of the opinion it is.

Bing and Google are indexing the same interwebs so its entirely possible (and one could argue very likely) that the two engines would index and return similar results, particularly for high traffic or valuable sites. The key is whether the engines return these similar results:

  1. by using their own independent algorithms

  2. by copying results

No matter how Microsoft apologists attempt to spin this ("the users opted into the toolbar and not all the results appeared lol!"), its pretty obvious Bing is performing both. Unique search results from Google are appearing in Bing. Copying is going on. End of story.

You'd think that would be the end of it, but no.

Google bro, you're being a tad hypocritical

Google has built their empire copying other people's data and wrapping it with ads, unless you specifically opt out with a robots.txt file or rel=nofollow or their web admin tools. Google News aggregates information from hundreds of news sources without compensation, Google Books is scanning thousands of books without regard to copyright law, however broken such laws are. Heck even their latest search user interface was a copy of Bing, which they were deservedly called out on. Whether these efforts have been positive or not is inconsequential: the bottom line is they copy.

Google's Eric Schmidt was on the board of directors at Apple during the development of the iPhone, and shortly after the iconic device's launch, Android was suddenly transformed from an imitation Blackberry into an iPhone clone, and was based on enough of Java to leech of its success without having to pay licensing fees to Sun/Oracle. To their credit at least they haven't charged handset manufacturers for the OS, otherwise they'd be repeating Microsoft's history.

Google is often credited with creating the idea of keyword based advertising, but this was developed at Overture, formerly GoTo. Overture was later purchased by Yahoo, and Google paid a substantial out of court settlement to allow them to continue to operate. It bears consideration when one remembers AdSense is overwhelmingly their primary source of revenue.

Google, Microsoft, a word please!

Ultimately, while all these are examples of copying which in light of their claims against Microsoft are hypocritical, they're not my primary concern.

Despite its cash cow search algorithm being more of a closed secret than the Kernel's secret herbs and spices (sorry, my attempt at a a bad pun), Google has always claimed to be "open" and facilitating universal access to human knowledge. Microsoft is absolutely shameless for claiming some of their search results aren't lifted from Google, but why is Microsoft selectively not allowed universal access? It's one thing for Google to claim Microsoft isn't be honest about where they're getting their data from, its quite another to tell them to cease entirely.

Google has every right to voice their frustration and encourage Microsoft to innovate rather than piggybacking on their own work, but they need to assure us that when they say they're open and universally accessible, that applies to everyone.

Free and open source software advocates are well aware that their software may be used in ways they don't like, but the trade off is worth it for getting applications in the hands of as many people as possible and to allow people to modify and redistribute them. If Google is all for doing no evil and is as open and free as they claim, they need to acknowledge this. I'm confident they can take the moral high ground again.

Finally, Microsoft, you guys need to stop beating about the bush and just admit that you're copying search terms and results from your competitor. You're not fooling anybody. Well okay you're clearly fooling some people, but I don't think IDG really counts.

I'll be sticking with Google for their searches, because if I use Bing I'll just be getting them anyway ;).


Rubenerd Show 273 2009.07.27

Larger version of cover artThe incompetent internets episode!

Moving lots of blogs into one blog; people like Zombie Plan and Neal from IntoYourHead going the opposite ways; working and living places after university; iTelephone cases and downloading; The Googles and I losing track of core competencies; too much self respect to use Chandler Bing and vacuous people getting angry at Twitter for losing spam followers!

Download MP3 to listen 19:18 8.9MiB

You can also view previous episodes, subscribe via iTunes or another client, stream this episode and view its Internet Archive page.