Free Republic
Browse · Search
News/Activism
Topics · Post Article

Skip to comments.

Microsoft Crawling Google Results For New Search Engine?
WebProNews ^ | 11.11.04

Posted on 11/11/2004 1:35:03 PM PST by mhking

Microsoft Crawling Google Results For New Search Engine?


Jason Dowdell | Contributing Writer

2004-11-11



I was questioned today by a developer who was watching a particular IP address scan his site. The IP was 65.54.188.86 and is registered to Microsoft Corp. located at One Microsoft Way, Redmond, Washington 98052. This visitor was not sending the normal header information associated with a crawler to the web server such as an http robot name or identifying info or even a browser name.

MSN Spiders
Is MSN Crawling Google?

Is Microsoft "using" Google's search results to populate their index? Discuss Microsoft's behavior at WebProWorld.

The behavior it demonstrated made it look like a crawler, especially since it was spidering urls that were no longer in existence (search engine spiders crawl site segments at regular intervals and often come back when an initial crawl left urls uncrawled) and doing so at the rate of 1 page every 3 - 5 seconds. The visitor started their visit at 7:37 am and was still on the site at 12:00 pm.

Correction, the data was there after all, here's the crawler info... msnbot/0.3 (+http://search.msn.com/msnbot.htm)

Here's the kicker

So now you're saying, so what, big deal. But this really is a big deal. It's a big deal not only because the urls this visitor was making requests to don't exist any longer but because the only place these urls can be found is in Google's search results using site:www.sitename.com. A similar query on MSN Search doesn't show the urls at all, even on the beta version of their new Microsoft search engine. But then within just hours of the visitors exit from the site the new same search at Microsoft's new search engine shows all of the urls in question being fully indexed within its results.



My Theory On This Mysterious Microsoft Crawler

The old msn required a fee to be crawled by its spider. But a few months back MSN dropped the fee and said they were going to begin crawling the entire web and doing it without charge. However, that's no easy task. So I believe MSN is using the results from Google and possibly even Yahoo to get all of the pages they've indexed on sites that have a relatively low page count in the current msn search engine.

First off, that's the fastest way to get the relevant pages from a web site. Sure they could just go to the site directly and start crawling but in doing so they're going to get tons of duplicate urls and urls that seem different but point to the same content. Crawling Google's results will eliminate the bandwidth to some extent but will not completely take care of the duplicate content issue their spider will encounter.

Secondly, crawling Google's results can act as a qualitative measure for their new search engine. By creating a baseline number of pages per site when the new Microsoft Search is launched and running a comparison on a regular interval for the next 6 months, they'll be able to determine internally if their engine is finding and indexing the same links and as many links as Google. Call it competitive analysis or whatever you want.

So Microsoft's Screen Scraping?

Obviously my conclusion should be taken as a grain of salt but it's a definite possibility. Microsoft very well could be screen scraping Google (or maybe even using their API, LOL) and crawling the urls it finds. It makes sense from a business case but I wonder if there are any legal issues there. I doubt it. It's like putting garbage out to the curb. Once it's out there it's fair game but I bet Google's lawyers would have more to say than that on the case.

Has anyone out there seen similar behavior on their own sites? Please comment with your qualitative/objective data if so.

Jason's article first appeared on his blog MarketingShift.com.


TOPICS: Business/Economy; Culture/Society; News/Current Events
KEYWORDS: google; internetexploiter; microsnot; underweartootight
Navigation: use the links below to view more comments.
first previous 1-2021-4041-6061-80 ... 181-200 next last
To: KwasiOwusu

Bill Gates is no Republican, but Microsoft gave no political contributions and had no Washington lobbiests until the government of Bill Clinton sued them.


21 posted on 11/11/2004 1:53:37 PM PST by js1138 (D*mn, I Missed!)
[ Post Reply | Private Reply | To 19 | View Replies]

To: mhking
Not likely. Besides, they still have a long way to go.

msn for "freerepublic" Results 1-15 of about 16,239 containing freerepublic
google for "freerepublic" Results 1-10 of about 2,970,000 for freerepublic

msn     Results 1-15 of about   2,142,700 containing "George W. Bush"
google Results 1-10 of about 12,700,000 for "George W. Bush"

22 posted on 11/11/2004 1:55:29 PM PST by rit
[ Post Reply | Private Reply | To 1 | View Replies]

To: KwasiOwusu

You're a bit new to be calling established members of this forum trolls.


23 posted on 11/11/2004 1:56:34 PM PST by Sofa King (MY rights are not subject to YOUR approval.)
[ Post Reply | Private Reply | To 19 | View Replies]

To: KwasiOwusu

Politics isn't EVERYTHING, you know. I eat Ben and Jerry's because I like the ice cream. I use Google because it's an awesome search engine.

Walter Mossberg is a failure as a person and a columnist. He has no more technological knowledge than my dad (which isn't considerably bad, but not quite enough to understand the cutting edge).

I'd be very, VERY surprised if Microsoft took down Google. Heck, I'd be very surprised if they gave them anything of a fight. M$ won the OS war because they were first and best to market. No matter what innovation they have in their new search engine (WinFS was pretty impressive, though - then it got pulled from Longhorn - blech), Google's already established in the market.

Google will win, and M$'s search engine will have disappeared in a year and folded within three, and you can quote me on that.


24 posted on 11/11/2004 1:56:44 PM PST by K1avg
[ Post Reply | Private Reply | To 19 | View Replies]

To: Revel
"Microsoft did not write the original Dos. They purchased at some rediculous price from someone else." True but did you know that when Bill Gates sold it to IBM he did not own it?
25 posted on 11/11/2004 1:57:18 PM PST by reagandemo (The battle is near are you ready for the sacrifice?)
[ Post Reply | Private Reply | To 8 | View Replies]

To: KwasiOwusu

And, oh yes, I'm willing to bet most universities across the country have "more and better Ph.D's" than FreeRepublic, but are you about to blindly say they are more politically apt than us?

Of course not. Argument defeated.


26 posted on 11/11/2004 1:58:33 PM PST by K1avg
[ Post Reply | Private Reply | To 19 | View Replies]

To: js1138
Gates contributed $2000 to the Bush-Cheney campaign.
The Google boys on the other hand gave massively to Hanoi John Kerry and the DNC.
Plus Gogle topped themselves by coming up with President Bush's name whenever anyone typed in the words "Pathetic failure" in Google search.
It must be noted that neither Yahoo nor msn search came up with such a result.
Did Gates back Bush because of the "good" outcome of the antitrust case?
Maybe.
Bottom line, Gates backed Bush, and the Google guys were Kerry supporters.
27 posted on 11/11/2004 2:00:39 PM PST by KwasiOwusu
[ Post Reply | Private Reply | To 21 | View Replies]

To: Sofa King
"You're a bit new to be calling established members of this forum trolls"


Its an anti-Microsoft piece based on very flimsy evidence.
We get this type of nonsense from the Old Media all the time when it comes to Microsoft.
This post was clearly trolling for Google.
No question about that.
I call it as it is.
28 posted on 11/11/2004 2:04:38 PM PST by KwasiOwusu
[ Post Reply | Private Reply | To 23 | View Replies]

To: K1avg

"I'd be very, VERY surprised if Microsoft took down Google. Heck, I'd be very surprised if they gave them anything of a fight. M$ won the OS war because they were first and best to market. No matter what innovation they have in their new search engine (WinFS was pretty impressive, though - then it got pulled from Longhorn - blech), Google's already established in the market."

Need I say more to respond to this than "Netscape"?


29 posted on 11/11/2004 2:06:06 PM PST by Frank L
[ Post Reply | Private Reply | To 24 | View Replies]

Comment #30 Removed by Moderator

To: K1avg
"And, oh yes, I'm willing to bet most universities across the country have "more and better Ph.D's" than FreeRepublic, but are you about to blindly say they are more politically apt than us? "

No
Because politics is not advanced mathematics.
To write a really great search engine needs some serious Math PHD's.
Its like being a brain surgeon. If you don't have the advanced training and qualifications in neurosurgery, you can't just get up and go open someone's head .
Your argument does not hold water.
31 posted on 11/11/2004 2:10:21 PM PST by KwasiOwusu
[ Post Reply | Private Reply | To 26 | View Replies]

To: Revel

In 1982 I saw a machine that had most of the components of Windoze 3.1

A "Lisa" by Apple/McIntosh


32 posted on 11/11/2004 2:11:07 PM PST by djf
[ Post Reply | Private Reply | To 8 | View Replies]

To: KwasiOwusu

Ah, then you missed the point.

The point I was making was: you do not need to have a Ph.D to understand the material, and having a Ph.D doesn't always mean you do.


33 posted on 11/11/2004 2:12:38 PM PST by K1avg
[ Post Reply | Private Reply | To 31 | View Replies]

To: jra
"Are you always like this, or is it PMS?
Take a midol and settle down, sister"

Have you tried following your own advice?
Seems you need it more than I do.
And while you are about it, lay off the booze, will you?
34 posted on 11/11/2004 2:12:47 PM PST by KwasiOwusu
[ Post Reply | Private Reply | To 30 | View Replies]

To: mhking

The best search tool is still the meta-search application Copernic. (www.copernic.com)

It comes bundled with other nifty tools, like a summarizer that will produce an accurate summary of any web page, a tracker tool to follow changes on specific websites, and an application to search your own files on your desktop, without uploading details of all your documents, like Google's desktop search tool requires you to do.

It costs $100 or so, but it is absolutely worth it.


35 posted on 11/11/2004 2:13:10 PM PST by LouD
[ Post Reply | Private Reply | To 1 | View Replies]

To: KwasiOwusu

mhking is a well-known poster around here. You, on the other hand, have been here for little over a month and are accusing him of being a troll because he dared suggest that Microsoft (which isn't exactly right-leaning company) may be doing something slightly underhanded. If I had to pick a troll here, it would be you.


36 posted on 11/11/2004 2:14:17 PM PST by Sofa King (MY rights are not subject to YOUR approval.)
[ Post Reply | Private Reply | To 28 | View Replies]

To: KwasiOwusu; jra

Personal insults get us nowhere around here.


37 posted on 11/11/2004 2:15:15 PM PST by K1avg
[ Post Reply | Private Reply | To 34 | View Replies]

To: K1avg
"The point I was making was: you do not need to have a Ph.D to understand the material, and having a Ph.D doesn't always mean you do."

To write serious search engine programming you do.
That's why Google is busy trying to harvest as many math PHD's as they can, and have been in a fight with Microsoft over the past year trying to get the best math PHD's into their company, straight from the universities.
38 posted on 11/11/2004 2:16:02 PM PST by KwasiOwusu
[ Post Reply | Private Reply | To 33 | View Replies]

To: KwasiOwusu
It would have been trolling if the original poster (mhking) had offered anti-Microsoft rhetoric, which he didn't.

He posts, you decide/discuss. Maybe you should back off the accusatory tone vs mhking, one of the most respected members of this forum.

39 posted on 11/11/2004 2:16:06 PM PST by xrp (Executing assigned posting duties flawlessly -- ZERO mistakes)
[ Post Reply | Private Reply | To 28 | View Replies]

To: KwasiOwusu; hellinahandcart; trussell; MEG33; MeekOneGOP; petuniasevan; Hillarys nightmare; ...
You're a moron.

mhking just posted the piece for debate, not because he wrote it.

Are you a shill for Gates, or just an anti-Google troll out to spread anti-Google FUD, sugah?

40 posted on 11/11/2004 2:16:44 PM PST by JoJo Gunn (More than two lawyers in any Country constitutes a terrorist organization. ©)
[ Post Reply | Private Reply | To 19 | View Replies]


Navigation: use the links below to view more comments.
first previous 1-2021-4041-6061-80 ... 181-200 next last

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search
News/Activism
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson