All posts by Paul Raven

Watson’s victory clear, but perhaps not as impressive as it seems

So, Watson won at Jeopardy!… by a pretty significant lead, too. Inevitably, lots of folk are keen to downplay this victory, and for a variety of reasons. Commonest complaint would have to be regarding Watson’s speed-to-buzzer advantage, but its minders designers say that it’s not really that big a deal:

Though Watson seemed to be running the round and beating Jennings and Rutter to the punch with its answers many times, Welty insisted that Watson had no particular advantage in terms of buzzer speed. Players can’t buzz in to give their questions until a light turns on after the answer is read, but Welty says that humans have the advantage of timing and rhythm.

“They’re not waiting for the light to come on,” Welty said; rather, the human players try to time their buzzer presses so that they’re coming in as close as possible to the light. Though Watson’s reaction times are faster than a human, Welty noted that Watson has to wait for the light. Dr. Adam Lally, another member of Watson’s team, noted that “Ken and Brad are really fast. They have to be.”

A re-run with some sort of handicap might prove this one way or the other, but I suspect the doubters will find new advantages to pin on the machine… which , to my mind, rather misses the point of the exercise, which was to demonstrate whether or not a machine could outperform humans at a particular task. Quod erat demonstrandum, y’know?

A more interesting point is that even Watson’s creators aren’t entirely sure how Watson achieves what it achieves. George Dvorsky:

Great quote from David Ferrucci, the Lead Researcher of IBM’s Watson Project:

“Watson absolutely surprises me. People say: ‘Why did it get that one wrong?’ I don’t know. ‘Why did it get that one right?’ I don’t know.”Essentially, the IBM team came up with a whole whack of fancy algorithms and shoved them into Watson. But they didn’t know how these formulas would work in concert with each other and result in emergent effects (i.e. computational cognitive complexity). The result is the seemingly intangible, and not always coherent, way in which Watson gets questions right—and the ways in which it gets questions wrong.

As Watson has revealed, when it errs it errs really badly.

This kind of freaks me out a little. When asking computers questions that we don’t know the answers to, we aren’t going to know beyond a shadow of a doubt when a system like Watson is right or wrong. Because we don’t know the answer ourselves, and because we don’t necessarily know how the computer got the answer, we are going to have to take a tremendous leap of faith that it got it right when the answer seems even remotely plausible.

Dvorsky’s underlying point here is that we shouldn’t be too cocky about our ability to ensure artificial intelligences think in the ways we want them to. They’re just as inscrutable as another human mind. Perhaps even more so… which is why he and Anders Sandberg (among others) believe we should foster a healthy fear of powerful AI systems.

But the most interesting point I’ve seen made about Watson’s victory is a skeptical stance over at Memesteading:

When Alex Trebek walked by the 10 racks of 9 servers each, said to include 2880 computing cores and 15 terabytes (15,000 gigabytes) of high-speed RAM main-memory, I couldn’t shake the feeling: this seems like too much hardware… at least if any of the software includes new breakthroughs of actual understanding. As parts of the show took on the character of an IBM infomercial, the feeling only grew.

[…]

An offline copy of all of Wikipedia’s articles, as of the last full data-dump, is about 6.5GB compressed, 30GB uncompressed – that’s 1/500th Watson’s RAM. Furthermore, chopping this data up for rapid access – such as creating an inverted index, and replacing named/linked entities with ordinal numbers – tends to result in even smaller representations. So with fast lookup and a modicum of understanding, one server, with 64GB of RAM, could be more than enough to contain everything a language-savvy agent would need to dominate at Jeopardy.

But what if you’re not language savvy, and only have brute-force text-lookup? We can simulate the kinds of answers even a naive text-search approach against a Wikipedia snapshot might produce, by performing site-specific queries on Google.

For many of the questions Watson got right, a naive Google query of the ‘en.wikipedia.org’ domain, using the key words in the clue, will return as the first result the exact Wikipedia article whose title is the correct answer.

[…]

With a full, inverse-indexed, cross-linked, de-duplicated version of Wikipedia all in RAM, even a single server, with a few cores, can run hundreds of iteratively-refined probe queries, and scan the full-text of articles for sentences that correlate with the clue, in the seconds it takes Trebek to read the clue.

That makes me think that if you gave a leaner, younger, hungrier team millions of dollars and years to mine the entire history of Jeopardy answers-and-questions for workable heuristics, they could match Watson’s performance with a tiny fraction of Watson’s hardware.

All of which isn’t to demean Watson’s achievement so much as to suggest that perhaps the same results could be reached with a much smaller hardware outlay… though there is an undercurrent of “Big Iron infomercial” in there, too.

A kraken, enraged

This Ars Technica rundown of the whole HBGary Federal vs. Anonymous/Wikileaks thing is really quite astonishing for a whole number of reasons, not least the staggering hubris and chutzpah of Aaron Barr, but there’s also the comparative ease with which Anonymous nailed Barr to his own mizzen. Maybe it’s just me, but the subtext I get from the whole business is that Barr’s desire to “take down” Anonymous stems from a sort of envy and admiration of them; funnier still are the communications between Barr and his pet programmer, who makes no bones about telling Barr he’s walking out onto very thin ice indeed.

Most astonishing of all (though hardly news in this day and age) is the staggering amount of money that shadowy and largely unaccountable outfits like can charge government agencies for work that neither party fully understands or – more importantly – wants the general public to know about. And as Chairman Bruce points out, there’s probably a whole lot more operations just like it that we never get to hear about:

The question now is, do people stumble over the truth here and just sort of dust themselves off and traipse away sideways — or are there more shoes to drop? The furious and deeply humiliated lawyers at HBGary ought to have enough federal clout to pursue their Anonymous harassers and nail them to the barn like corn-eating crows — after all, they claimed they know who they are, and that’s why they got savagely hacked in the first place.

However — are HBGary gonna be able to carry out that revenge attack with their usual discretion — the shadowy obscurity with which they help deny climate change and break labor unions for the Chamber of Commerce? It’s like watching a shark fight a school of ink-squirting squids.

Normally, one never sees a submarine struggle like this. If it does happen to surface, it gets cordially ignored, or ritually dismissed as a sea-monster story. But boy, this one sure is leaky.

Things are getting very permeable of late, aren’t they?

Dadadadadada

Old Duchamp would be proud, I like to think… though given the responses of other postmodern artists to similar events, I’m probably being overoptimistic on that point. Nonetheless, the future shows no sign of waiting for us to reach an accommodation with it, and you can now get yourself a fabbed facsimile of Marcel’s iconic “readymade” urinal museum piece [via BoingBoing].

Fabbed Duchamp urinal clone

As mentioned before, copyright on physical objects is a lost cause, though I doubt that’s going to stop a phalanx of windmill-tilting IP knights charging into battle as the terrain churns like liquid beneath the hooves of their horses, and the lawyers slip in to their vulture costumes off-stage.

And hey, 3D printers are getting pretty close to the point where they can print copies of themselves, too… so at least the futile carnage should be short lived.

Welcome to the walled garden. Would you like to hire a periscope?

While we’re on the subject of ebooks, publishing, digital content delivery and all that jazz… how’re you “the iPad is the future of publishing!” types feeling right now?

Yup: that lush easy-to-use interface makes purchasing easy, and easy purchases happen more often! Which means those 30% rake-offs should make House Cupertino’s share prices rocket still further!

And should your customer – heaven forbid! – want to take true ownership of their hardware, don’t be surprised if that cuts them off from the content they bought through sanctioned channels.

All that money you spent on setting up your new outlet in the glitzy new mall… it seemed like a great way of short-cutting around the economic problems out on the old high street, didn’t it?

You went to bed with the landlord, and he went and raised the rent anyway.

Sleep tight.

This is why trying to prevent book piracy is utterly futile

Remember last year, when Nancy Kress had a weird run-in with people who’d been pirating her books? Well, they got in touch again. Here are a few highlights from the conversation:

How about a nice science fiction story (or book) about how the Computer Virus Hackers come to your rescue in the 11th hour through the rise of automated Artilects, Artificial Intelligence that has a conscious awareness and want to save America from the Illuminati New World Order world domination.

[…]

We (VXers) are deadly serious about getting a positive message out about a solution to this NWO Illuminati, Masonic takeover of the USA and the rest of the world.

The rise of A-Life is upon us and we aim to steer it in the direction of saving humanity not destroying it for a handful of global elitists.

Regards – PZest (aka Paul Zest VX history and science philosopher)

[…]

No it’s not like Cory’s book, the difference between us (VXers) and hackers is that VX is about the machines not the individual hackers. Artificial Life is what rises from of the codes we think about. These machines will rise without the hand of man, but they come from the minds of men. This is the second genesis of life and the emergence of a conscious soul that will protect us ordinary humans in our desperate hour of need.

It could be the true rapture we have been waiting for, not the false rapture the Illuminati plan to inflict upon us with their Project Blue beam aircraft spraying the skies with radioactive Barium isotopes.

[…]

Real Artificial Life hasn’t come into existence yet by certain scientific *life* criteria but VXers will be amongst the early witness it when it does happen. A-Life will not come from an individual studiously coding his designs, A-Life will emerge from some complex system outside of our control (and probably our understanding). The one thing you can be sure of is that once things get going the emerging intelligence out of the second genesis of life will want to know what freedom and survival is, an not want to be bonded slaves under the control of evil industrial- military globalists.

The immediate question anyone’s going to come up with is “are these people serious, or are they just yanking Nancy’s (and everyone else’s) chain?” It’s an interesting question, but it misses the point… or rather, the real point is independent of the answer to that question. So I’m going to reiterate the important point again.

*clears throat*

Look at that. That is book piracy in action. That is what you’re up against. That is someone upon whom you’re attempting to use rational persuasion and logic in order to convince them that they shouldn’t copy your books and give them away for free.

Technological measures will only be seen by these people as an intellectual challenge..

Reasoned arguments will wither under the blowlamp of their own spurious logical framework.

You cannot defeat these people. Every attempt to do so will be seen as a validation of their efforts, not a discouragement.

Yes, this is the Robin Hood Complex in action. So stop playing Sheriff of Nottingham and focus on winning over the people who really matter: the people who might have bought your book if it were available to them conveniently at a price they consider reasonable.

Seriously, people, give it up; you’re tilting at windmills. No, it’s worse than that: you’re tilting at windmill tilters. You will never stop these people. Stop trying. Instead, spend the effort on connecting with your customers. That’s the only route out of this labyrinth, and listening to the dying croaks of the record labels will not help you at all, except as an indication of what not to do.

And cue complaints that I simply don’t understand the scale of the problem or that I’m “on the same side” as the pirates in 5… 4… 3…