Thursday, February 24, 2011

Watson--more


"Watson’s a Kindle, humans are iPads"

February 22nd, 2011

REUTERS

I missed the first and last days of IBM Watson’s assault on humanity, played out innocently on a game show. But Tuesday’s edition of Jeopardy alone was as demoralizing for me as a human as it exhilarated my android side.

Part of the fun is what the IBM Language Team came up with to make humans comfortable in Watson’s presence. The supercomputer had that tad of inflection and a tone of voice which put one in the mind of HAL 9000 before, well, you know. Watson mixed up the banter at least once with a “Let’s finish out … ,” instead of just naming the category and amount. Watson displayed some frailty on display by giving the same wrong answer human competitor Ken Jennings had just before — I have seen humans do this, so why not a supercomputer?

For many, though, Watson’s weakness wasn’t something with which to commiserate but a way to cling to a small hope that we weren’t sowing the seeds of our own destruction. As Wired put it on Twitter during day two’s massacre: “For those not watching @IBMWatson on Jeopardy, we won’t spoil it, but you might want to stock up on provisions. #skynet”

Watson was remarkable in many mundane ways. It showed the value of pure R&D. It continued an entrepreneurial tradition of showmanship to dramatize technology. It demonstrated, within limits, the kind of natural language processing power that Star Trek fans have always known is the future of computing.

One of those limits was that Watson doesn’t hear — it doesn’t respond to voice commands at all. This is just as well, since a voice interface isn’t ready for prime time. We humans are still required to adapt by providing the verbal equivalent of command-line instructions: particular words in a particular order. However, when I can say “I could really use a burrito” and trigger my car’s on-board computer to start tracking down menus, restaurants and phone numbers — or have it suggest a nice salad instead — then we’ll have something to talk about.

Even if interpretation is perfect, voice input isn’t always preferable. It could be great in a passenger car and for advanced avionics or a hospital’s operating theater — places where acting as fast as you can think is important, or where you need your hands for other things. But it’s the last thing you’d want in an office’s cubicle city.

Which is why Watson’s achievements in bridging the enormous gap between semantic and programmatic language are far more significant than it’s ability to quickly produce a correct fact, or even sound like one of the guys (it has a male simulated voice, for some reason).

Watson was designed for the specific showdown: It was prepared for simple questions, and “knew” they were questions (well, “answers” in the inverted world of Jeopardy). But the questions themselves were not tailored to accommodate Watson. Rather, it was the other way around — witness the first Final Jeopardy answer debacle in which Watson’s question implied it thought Toronto was a U.S. city.

In the end, we have to ask ourselves what really was the wow factor. There are internet search engines which parse semantic language. The first to make this claim, Ask Jeeves, was launched way back in 1996. The latest is from Wolfram-Alpha. Even Google serves up relevant answers pretty darn quick.

So far it’s much simpler for a machine to process an inquiry in text form than it is to figure out how to get a machine to “hear” you and interpret our irrational and incomplete ramblings. Indeed, Watson was getting the Jeopardy answers in text form as the human were hearing them.

For a studio-bound server farm Watson’s Jeopardy prowess certainly unleashed all manner of man versus machine gallows humor. And the machines we make may someday run amok even if they don’t become self-aware, like runaway trains on steroids.

But even after Watson’s superb Jeopardy showing, I’m not worried. Machines that do one thing well — even better than any human — are nothing to be afraid of. Yet.

"Garry Kasparov on IBM's Watson"

by

Garry Kasparov

February 22nd 2011

The Atlantic

Unless IBM's Watson can do more than play Jeopardy!, Garry Kasparov sees it as little more than a complicated toy.

That's what the Russian world chess champion said when asked for his thoughts on last week's Jeopardy! contest between two champions, Ken Jennings and Brad Rutter, and IBM's new Jeopardy!-playing supercomputer. Kasparov reviewed the three-day contest and offered his initial thoughts exclusively to The Atlantic.

The true test of Watson's significance, Kasparov says, will be whether it can be translated "into something useful, something groundbreaking"—applied in a more meaningful way, beyond the game show.

In the annals of man vs. machine competition (the topic of this month's Atlantic cover story), Kasparov holds the most prominent of historic places. The Russian world chess champion defeated IBM supercomputer Deep Blue in 1996, then lost in a six-game rematch in 1997 that surprised many and revealed a nascent truth: In closed-system contests of raw data computation, computer technology had evolved an edge over the most talented and disciplined human minds. Kasparov accused IBM of cheating in the match and requested a rematch but was denied.

Find below Kasparov's initial take on Watson, offered via e-mail through an aide:

* A convincing victory under strict parameters, and if we stay within those limits Watson can be seen as an incremental advance in how well machines understand human language. But if you put the questions from the show into Google, you also get good answers, even better ones if you simplify the questions. To me, this means Watson is doing good job of breaking language down into points of data it can mine very quickly, and that it does it slightly better than Google does against the entire Internet.

* Much like how computers play chess, reducing the algorithm into "crunchable" elements can simulate the way humans do things in the result even though the computer's method is entirely different. If the result—the chess move, the Jeopardy answer—is all that matters, it's a success. If how the result is achieved matters more, I'm not so sure. For example, Deep Blue had no real impact on chess or science despite the hype surrounding its sporting achievement in defeating me. If Watson's skills can be translated into something useful, something groundbreaking, that is the test. If all it can do is beat humans on a game show Watson is just a passing entertainment akin to the wind-up automata of the 18th century.

* My concern about its utility, and I read they would like it to answer medical questions, is that Watson's performance reminded me of chess computers. They play fantastically well in maybe 90% of positions, but there is a selection of positions they do not understand at all. Worse, by definition they do not understand what they do not understand and so cannot avoid them. A strong human Jeopardy! player, or a human doctor, may get the answer wrong, but he is unlikely to make a huge blunder or category error—at least not without being aware of his own doubts. We are also good at judging our own level of certainty. A computer can simulate this by an artificial confidence measurement, but I would not like to be the patient who discovers the medical equivalent of answering "Toronto" in the "US Cities" category, as Watson did.

* I would not like to downplay the Watson team's achievement, because clearly they did something most did not yet believe possible. And IBM can be lauded for these experiments. I would only like to wait and see if there is anything for Watson beyond Jeopardy!. These contests attract the popular imagination, but it is possible that by defining the goals so narrowly they are aiming too low and thereby limit the possibilities of their creations.

Watson wins on "Jeopardy"

No comments:

Post a Comment