Powerset Licenses Xerox PARC Natural Language Tech


Both VentureBeat and the New York Times are out today with an update on Powerset, which has won the rights to natural language search technology from Xerox PARC that it hopes will make it into the new Google, when Powerset finally launches. VentureBeat also gives us news of at least one person from Yahoo now working there – Tim Converse — who left Yahoo in December. I know others from Yahoo are also at Powerset, as well.

Last time we had news about Powerset, I did a big long rant – Hello Natural Language Search, My Old Over-Hyped Search Friend — about the hype and history often associated with natural language search. VentureBeat mentions that rant, as does TechCrunch today in its post, Powerset Hype To Boiling Point. So I felt I should give an update.

Unlike Matt Marshall over at VentureBeat, I’ve yet to see Powerset (right now, you can only see it within the Powerset offices, and I’m in England, not Silicon Valley). I have been in touch with CEO Barney Pell, and I’m looking forward to using a working demo in the future.

I also have to stress that not having seen the tool firsthand, it could be that I’ll be blown away and as impressed as some others are. Then again, right now I still feel that it’s more hype than actual Google threat.

Technically, I’m "off" from writing on Fridays. So for now, I’ll just share what I sent over to both VentureBeat and Barney last night:

Frankly, I can use Hakia right now and have been very impressed with much of what they deliver, though not all of that is due to the natural language stuff. I have an article I’m working up on it to talk about how they are better for some queries, in some respects.

I knew about some of the Yahoo people going over, including Tim. Interesting, certainly gives more to the Powerset story to consider, but I’m still firmly in the prove it category by actually rolling it out.

who acquired ibm
[a search cited as Powerset doing better than Google in Matt's story]

OK, I see Surfaid, Lenovo listed, so that’s two "IBM" things acquired. I’m not sure who searches for that [particular phrase] to find out who acquired particular IBM units. That type of query to me feels like something you were suggested to try, and if so, then usually it’s suggested because it’s known that query will produce something good. Go search for travel, cars and casino and compare those to Google and see if you’re still impressed. Those are real searches.

Also, what’s the size of the index Powerset is searching against? Barney?

Google has in excess of 10 billion documents from across the entire web, and that’s a low estimate. If Powerset is hitting fewer documents — and especially a subset of say business documents — just that filtering along might give you better results.

So later, when you ask Powerset: "What do liberal democrats say about healthcare policy?" And that sounds really compelling. But guess what? On many pages that say liberal democrats, you’ll also find Hillary Clinton also mentioned.

Maybe that special document I needed was in the 500 million that Google’s not picking up, because it’s not making the association. Chances are, it’s not.

"Powerset showed it can answer more complex questions, such as “Who did IBM acquire in 1996?�? Here, Google completely breaks down
[quote from the story]

Breaks down because what did Powerset show? I mean, a screenshot? A list of what was there?

FYI, if I do these:

Both bring me to Wikipedia first [via Google] which answers the question in short order and is anything other than a failure.

Now ask yourself. If you’re looking for an IBM acquisition in 1996, don’t you think you’re likely to wonder about a particular one? Or if you’re after things they’ve acquired, are you that likely to put in a particular year?

Sure, some refinement suggestions would help. But you’re picking out one query and saying Google breaks down when that might not even be a query someone would do.

Clearly, Powerset faces challenges. Even if its technology does prove to be useful, it isn’t clear how long it will keep its lead in the face of an onslaught from Google. Another challenge is changing peoples’ search behavior, which is used to keyword searches.
[quote from the story]

What lead? With respect, it doesn’t have any lead. It has some experts, a product pitch and a product a few can play with and only internally. When it launches, then it has a serious challenge to take on even Ask, much less Google. I’m not saying it can’t be done — but wow, you’ve already got it winning while it hasn’t even entered the race. Not only does it have to attract traffic, but it needs to prove it can scale to handle that traffic and the processing. Will it stand up to doing all this natural language processing (which often will be unnecessary) in the face of a lot of traffic.

NOTE: Matt also replied to me:

By lead, I actually meant lead in natural language search. Clearly they don’t have a lead in search. (though, even in natural language, I suppose you could say that is speculation, given that [Google] tells me they’re got people working on this).



Danny Sullivan is editor-in-chief of Search Engine Land. He’s a widely cited authority on search engines and search marketing issues who has covered the space since 1996. Danny also oversees Search Engine Land’s SMX: Search Marketing Expo conference series, maintains a personal blog called Daggle and microblogs on Twitter as @dannysullivan.

See more articles by Danny Sullivan >


Share, Bookmark & Discuss This Article
More:


Keep Updated: News Via Email | News Via RSS Feed | News Via Twitter


See more stories like this in the Members Library! Check out the Business Issues: General, Search Engines: Hakia, Search Engines: Other Search Engines, Search Features: Natural Language sections of the Members Library where this story is filed. Members also get access to exclusive video content, a members-only weekly & monthly newsletter, plus more. Check out all the benefits!

ONE COMMENT ON Powerset Licenses Xerox PARC Natural Language Tech

Steve Morsa,

As Don Dodge explains in his excellent post this morning, if you want to beat Google, you’ve got to C.H.A.N.G.E. T.H.E. G.A.M.E.

So let’s do just that:

As promising as their natural language platform sounds; and it does; the greatest threat to Google’s growing hegemony in the search/paid search arenas…given that about 1/2 of all searches are known to be for products and services…may actually spring from patent pending (#11/250,908) paid match, which will target people’s actual demographic and psychographic traits and characteristics (keytraits) instead of just the words we all type into little search boxes.

Though paid match is not yet an operating system, our own US Dept of Labor does run a very popular service (over 500,000 users/month) which provides an enlightening and instructive peak at the potential that such a paid match search/ad platform possesses.

Called GovBenefits (available at govbenefits.gov), it utilizes a personal profile and a match engine to determine what government benefit programs people qualify for.

Were such a system populated with the 100’s of thousands to millions of products and services companies provide nation/worldwide instead of just the 400-odd government programs it includes now, one can only imagine what its public popularity would be…

…and with the world’s advertisers having the ability to pinpoint target and control; via bidding directly on those keytraits most relevant and applicable to their products and services, exactly who sees their ads (goodbye click fraud); one can also only imagine the deleterious effects that such an elegant and superior system/platform would have on a 95% PPC income dependent company like Google…




RECENT COMMENTS

  • Chris Silver Smith said " Wow - my coincidental timing couldn't have been better, what with Rupert Murdoch getting quoted this"
  • George Michie said " Clearly they're promoting the product. Why wouldn't they? The only odd piece of the equation is that"
  • LinkLift said " Eric, this seems a bit too optimistic. From our experience, the losses for simultaneous domain and s"

See All »


FREE DAILY SEARCH NEWS RECAP!

Stay on top of all the search news with our daily summary, the SearchCap newsletter. View a sample ›

STAY CURRENT THROUGHOUT THE DAY

RSS Feeds

The Search Engine Land feed keeps you informed as news happens. SEE ALL FEEDS »

Upcoming Search Engine Land Conferences

Advertise With Us »

Search Engine Land produces SMX, the Search Marketing Expo conference series. SMX events deliver the most comprehensive educational and networking experiences - whether you're just starting in search marketing or you're a seasoned expert.


SMX Web Site » | SMX Difference » | SMX News »


Join us at an upcoming SMX event:

Search Marketing Now Learn more about search marketing with our free online webcasts and webinars from our sister site, Search Marketing Now. Upcoming online events include:


See more webcast topics »

TRACK US SOCIALLY
Upcoming Search Engine Land Conferences

Get Your Search Engine Land
Premium Membership!

Become a premium member today and receive:

  • Express commenting privileges & photo.
  • Exclusive videos & newsletters.
  • Discounts to our SMX conferences.
  • Access to "How To" & Other Archives.

Learn More

Upcoming Search Engine Land Conferences
Add to GoogleAdd to My Yahoo!Add to BloglinesAdd to NetvibesAdd to Windows Live