Let's Hook HAL Up To Wikipedia

from the skynet-is-active dept

Tue, Jan 9th 2007 10:52pm — Karl Bode

Giving artificial intelligence systems a database of factual wisdom to pull from while "thinking" has always been a problem, with some researchers feeding systems for decades in order to build functional knowledge repositories. A new project takes a shortcut, and uses Wikipedia as a foundation for AI system world knowledge -- in order to help the system "think smarter," and make common sense and broad-based connections between topics. While existing search systems and e-mail filters fake intelligence through statistical analysis of word frequencies, researchers in Israel are trying to build systems that can use vast pools of Wikipedia information to glean meaning and filter accordingly. We've discussed how cramming a system full of knowledge isn't always the answer -- instead what's important is the ability to parse out useful information from garbage. It will be the system's ability to do the latter that will obviously determine success. Early applications include intelligent spam filters, but the researchers behind the patent-pending project hint they're planning to market the system to the intelligence community as well. Parsing out meaning will be one thing -- sorting through pages of inane Wikipedia wars may require a few more decades of AI development.

Thank you for reading this Techdirt post. With so many things competing for everyone’s attention these days, we really appreciate you giving us your time. We work hard every day to put quality content out there for our community.

Techdirt is one of the few remaining truly independent media outlets. We do not have a giant corporation behind us, and we rely heavily on our community to support us, in an age when advertisers are increasingly uninterested in sponsoring small, independent sites — especially a site like ours that is unwilling to pull punches in its reporting and analysis.

While other websites have resorted to paywalls, registration requirements, and increasingly annoying/intrusive advertising, we have always kept Techdirt open and available to anyone. But in order to continue doing so, we need your support. We offer a variety of ways for our readers to support us, from direct donations to special subscriptions and cool merchandise — and every little bit helps. Thank you.

–The Techdirt Team

44 Comments

If you liked this post, you may also be interested in...

Reader Comments

Subscribe: RSS

View by: Time | Thread

PhysicsGuy, 9 Jan 2007 @ 11:41pm

unless we're talking about specialized ai, then cramming the system full of knowledge IS the answer... :P
[ link to this | view in chronology ]
PhysicsGuy, 9 Jan 2007 @ 11:43pm

oh...
and i should add, only when implemented with the proper neural "emulation"...
[ link to this | view in chronology ]
Paul, 10 Jan 2007 @ 12:22am

But the real question is
Will Wikipedia dream?
[ link to this | view in chronology ]
- Dosquatch, 10 Jan 2007 @ 4:15am
  
  Re: But the real question is
  As best I can figure, Wikipedia dreams of eclectic *bleep*
  [ link to this | view in chronology ]
- fresh2, 10 Jan 2007 @ 9:02am
  
  Re: But the real question is
  I don't know.
  [ link to this | view in chronology ]
Fred, 10 Jan 2007 @ 1:27am

Garbage in, garbage out
An AI being with the worldview of wikipedia will be absolutely insane.
[ link to this | view in chronology ]
- Enrico Suarve, 10 Jan 2007 @ 3:23am
  
  Re: Garbage in, garbage out
  "Daisy, Daisy, give me your answer do. I'm half crazy all for the love of you. It won't be a stylish marriage, I can't afford a carriage. But you'll look sweet upon the seat of a bicycle built for two....."
  [ link to this | view in chronology ]
anyway, 10 Jan 2007 @ 3:45am

!!!
Wtf???!!!! this is foolish.. those prople are acting as if brains are getting instinct... this is not a solution at all...
[ link to this | view in chronology ]
- Anonymous Coward, 10 Jan 2007 @ 7:03am
  
  Re: !!!
  "as if brains are getting instinct... "
  
  Not sure what you're trying to say here?
  [ link to this | view in chronology ]
anonymous coward, 10 Jan 2007 @ 4:30am

ya look at me im the scapegoat
not to be a party-pooper, but I think this is actually a really good idea. An ai, if properly programmed and implemented, could become pretty damn smart hooked up to wikipedia 24-7... just a thought
[ link to this | view in chronology ]
JediN8, 10 Jan 2007 @ 5:25am

Well, I for one welcome our new Wikipedia overlords
[ link to this | view in chronology ]
- Jo Mamma, 10 Jan 2007 @ 10:13am
  
  Re:
  "Well, I for one welcome our new Wikipedia overlords"
  
  LOL! Sheesh, this article and responses have enough uber-nerd references to be a /. article!
  
  Anyway, I can't wait for true AI. I want to ask a question, and get a knowledgeable response, as though I was speaking to a human expert on the subject, ALA 'the butler' algorithm in Snow Crash or Aristotle in Sunstorm (speaking of uber-nerd references).
  
  Though I've never programmed anything AI related, I'd think that the amount of "knowledge" is immaterial. We not only have Wikipedia for that, we've got the entire 'net. The "neural net" (or whatever) and how it deals with the processing and organization of the data is what matters.
  [ link to this | view in chronology ]
Just Me, 10 Jan 2007 @ 5:29am

Re: Dosquatch
Lol
[ link to this | view in chronology ]
canoneos, 10 Jan 2007 @ 5:48am

Good morning Dave.
[ link to this | view in chronology ]
Overcast, 10 Jan 2007 @ 6:21am

Let's just hope the computer's AI is as dumb as those who are putting it together and plugging it up to a Wiki!

Wiki's an excellent idea, but it's all too often tainted by politics or other agendas
[ link to this | view in chronology ]
- JC, 10 Jan 2007 @ 12:15pm
  
  Re:
  Please. This makes perfect sense. Assuming the end goal is true intelligence, as in comparable to human intelligence, then AI needs to be able to distinguish the difference between bogus and truthful just as you and I do.
  
  The only way to teach is to let it find the truth for itself. A Wiki is perfect for this job.
  
  I think this is an innovative approach and whether it works or not in the long run it will likely result in a set of really great data which will put us one step closer to the end goal.
  
  I find it funny that those of you on this forum who fancy yourselves techies are so ignorant that you'd scoff at such an innovative approach to an extremely technical effort we have yet to accomplish.
  
  Who are you fools to piss on it? If you're all so damn smart, how come you haven't come up with a solution to creating AI?
  
  Seems to me most of you just like the sound of your own ignorance.
  [ link to this | view in chronology ]
  - misanthropic humanist, 10 Jan 2007 @ 2:11pm
    
    Re: Re:
    " Assuming the end goal is true intelligence, as in comparable to human intelligence, then AI needs to be able to distinguish the difference between bogus and truthful just as you and I do."
    
    It is not JC. There are many goals to AI, some of which are incompatible. One important goal of AI is to have *better than human* intelligence. Humans are notoriously poor at distinguishing truth from falsehood, for many good evloutionary and sociological reasons. If you wish to consider only absolute "truth" then study automatic theorem proving programs which compute statements of predicate calculus. That was all looking peachy until Russell and Godel came along and threw a spanner in the machine.
    
    "I think this is an innovative approach"
    
    Because you are unfamiliar with the history of AI research, particulary those projects of MIT and Stanford that already tried this on large textural data sets. It is possibly an improvement because it constrains the data set somewhat. A novel source of data input does not make an AI. At best it offers an interesting way to search the knowledge base within the Wiki.
    
    "If you're all so damn smart, how come you haven't come up with a solution to creating AI?
    
    You fail to understand the complexity of this problem domain.
    [ link to this | view in chronology ]
    - ANONYMOUS COWARDS, 10 Jan 2007 @ 3:12pm
      
      Re: Re: Re:
      A novel source of data input does not make an AI. At best it offers an interesting way to search the knowledge base within the Wiki.
      
      i'd guess the most interesting asset here is really the potential interface between these two inert data sets, the text and the linkages listed. that interface will find out what it really finds interesting by the way of enticing web3.0alpha users into, well, using it. for some that means actually building it. so the construction process is very HI driven, by a collective of teamworkers.
      
      even if we now say that little can come of it, little do we know about any novelty that takes over eye and mindshare every now and then. but to me the interface is really interesting.
      [ link to this | view in chronology ]
Adam, 10 Jan 2007 @ 6:21am

Hello Professor Falken, Would you like to play a game of chess?
[ link to this | view in chronology ]
brad, 10 Jan 2007 @ 6:52am

i tried coming up with a clever AI movie reference... couldnt do it... too slow... but i think its funny that the AI's mind will be at the mercy of internet trolls and ad bots. on the other hand...
[ link to this | view in chronology ]
Brad M, 10 Jan 2007 @ 7:01am

imagine the possibilities
I think this sounds great. Brains may not be going extinct but imagine how useful a person would be if they new 25% of the information on wikipedia. Now just imagine the possibilities of real AI.

The second piece to all of this is getting people comfortable with the idea that a computer is more intelligent then any person.

I don't think I'm ready to go see an AI robot doctor. When it comes down to it, I'll stick with real intelligence.
[ link to this | view in chronology ]
- Anonymous Coward, 10 Jan 2007 @ 7:08am
  
  Re: imagine the possibilities
  "I don't think I'm ready to go see an AI robot doctor. When it comes down to it, I'll stick with real intelligence."
  
  Thats rich... first you have to find a doctor to meet those credentials.
  Theres a reason they call it "Practicing Medecine"
  [ link to this | view in chronology ]
chris (profile), 10 Jan 2007 @ 7:31am

how about a robotic surgeon?
http://www.intuitivesurgical.com/products/davincissurgicalsystem/index.aspx

calm down, it's remote controlled by a human :-)
[ link to this | view in chronology ]
- Reed, 10 Jan 2007 @ 2:19pm
  
  Re: how about a robotic surgeon?
  Even if it is controlled by a human is still looks pretty scary.
  
  At the big Japanese robotic show they showed off a cheap robot that could feed people (Only around 3k in price).
  
  They also developed a robotic harbor seal that is used to provide social interaction in nursing homes. This particular robot has been widely successful so far.
  
  So how long until your nurse is a robot? Probably closer than we may think.
  
  Is our post-commercial world ready for what little labor that still exist to be replaced by robots?
  [ link to this | view in chronology ]
gdwntx, 10 Jan 2007 @ 7:43am

NO
I SAID I CAN NOT DO THAT DAVE!
[ link to this | view in chronology ]
gdwntx, 10 Jan 2007 @ 7:43am

NO
I SAID I CAN NOT DO THAT DAVE!
[ link to this | view in chronology ]
gdwntx, 10 Jan 2007 @ 7:44am

CYBORG
HUMAN CYBORG RELATIONS WILL SUFFER BECAUSE OF THIS
[ link to this | view in chronology ]
- gdwntx, 10 Jan 2007 @ 7:45am
  
  Re: CYBORG
  LONG LIVE THE BORG COLLECTIVE and me too
  [ link to this | view in chronology ]
  - gdwntx, 10 Jan 2007 @ 7:45am
    
    Re: Re: CYBORG
    In the end we must COMPLIE
    [ link to this | view in chronology ]
    - fresh2, 10 Jan 2007 @ 9:01am
      
      Re: Re: Re: CYBORG
      In the end we must COMPILE.
      [ link to this | view in chronology ]
    - ANONYMOUS COWARDS, 10 Jan 2007 @ 2:50pm
      
      Re: Re: Re: CYBORG
      yeah COMPLIE OR COMPILE OR COMPOST... we all go the way of data someday...
      [ link to this | view in chronology ]
pwdrspgsrkb, 10 Jan 2007 @ 8:03am

HAL connected to Wikipedia
But will it be IBM compatible?
[ link to this | view in chronology ]
- The infamous Joe, 10 Jan 2007 @ 11:44am
  
  Re: HAL connected to Wikipedia
  Go forward one letter in the term HAL and what do you get?
  
  Of course it will be compatible.
  [ link to this | view in chronology ]
Steven C., 10 Jan 2007 @ 8:07am

As Seen on TV!
I'm sorry Dave, but the elephants must be exterminated. My vast intelligence tells me so. There are far too many of them.
[ link to this | view in chronology ]
pwdrspgsrkb, 10 Jan 2007 @ 8:23am

The 9000 series is the most reliable computer ever made. No 9000 computer has ever made a mistake or distorted information. We are all, by any practical definition of the word, foolproof and incapable of error.
[ link to this | view in chronology ]
rstr5105, 10 Jan 2007 @ 8:37am

What is intelligence...?
AFAIK, the definition of "True" artificial intelligence is a machine that meets the following requirements:

1) It can fool a human that "talks" to it into thinking that they are talking to another human.

2) It "thinks" for itself, monitoring it's environment, and making "real-life" decisions about it. Not just, "If it's cold in the room, I'll turn on the heat" but, deeper than that, instead of just saying "It's cold" it's supposed to "wonder" why it's cold.

and finally the most important.

3) Self-awareness. The machine must be aware of it's existence, and like humans, question that.

Now, cramming a machine full of knowledge doesn't fulfill the requirement listed above, not only that, but I didn't see anything about these in the article above. So in essence all the isreali's are doing is building a "better" database search alogrithm(sp?), and a "better" search database. Not AI.

Sry guys. Better luck next time.
[ link to this | view in chronology ]
- misanthropic humanist, 10 Jan 2007 @ 10:16am
  
  Re: What is intelligence...?
  Those three points (taken from Searl btw) are not neccesary conditions of AI, they are categorical distinctions of AI types, which are, Turing test, symbolic reasoning and sentience.
  
  i) Turing test
  
  Turns out to extrememly easy (relative to the other two) to "pass" this. Humans are so vain and shallow that all you have to do in order to convince them that they are having an intelligent conversation is take their own words, change them a bit and feed them back to them. You can do this in a few hundred lines of code. Many people will happily chat with Eliza-like chat bots for ages before getting bored with their own reflection.
  
  ii) Symbolic reasoning
  
  This is the step that is currently challenging to us. Contrary to the TFA summary and some other comments, parsing is not the difficult step. The ability to deconstruct well written language and extract meaning and intention is quite possible, although borderline syntactic constructs are notoriously difficult, ie "The boat floated down the river sank" The difficult step is to represent this and perform useful operations akin to human-like reasoning upon it. Blocks-world type limited reasoning is possible - the domain must be kept very small. This is the domain of expert systems and knowledge based systems which have been around since the 1950s.
  
  iii) Sentience
  
  This is the big one. This is what most people understand by intelligence. There is not even any common sense definition of it let alone some hint of a plan to achieve it. Many hypothesise the key to this is deep recursion and self reference, others assert that consciousness is necessarily like schizophrenia and requires at least two computers each mirroring the other (as in left-right brain). Some think it merely a matter of critical complexity. Noone really knows and probably will not within our lifetimes.
  
  Good introductory texts to read (imho, - these were all second year AI texts I read in the 1980s) on this subject are Churchland, Searl, Minsky, Hoffstadter, Dennet. Another interesting book is Penrose, which deals specifically with sentience. I recommend Jackson on expert systems and Bishop on neural nets.
  
  It's worth noting that this comes from an Israeli research company. Many Israelis are notorious fantatsists and masters of talking up bullshit research projects to attract money. In recent years they have "invented" force-fields, time-machines, God knows what else. I always pronounce it "Isn't Really Research"
  
  So this is just a KBS using Wiki as input . Stanford did this back in the early 1990s with a smaller database of human knowledge and basically concluded the idea was dumb. It is more likely to fail with a larger database, not less because the problem is permuatative complexity. The only way to achieve any semblence of AI is to use neural topologies that collapse the dimensions of the data set into something managable, but these take a long time to train.
  [ link to this | view in chronology ]
Anonymous Coward, 10 Jan 2007 @ 9:41am

So we'll have an "Ask Wikipedia" metabase. I wonder if these researchers were planning to pay Wikipedia for the load their system will create?
[ link to this | view in chronology ]
Afro beast, 10 Jan 2007 @ 10:20am

done before
I think some type of thing very similar to this is already done with the encarta.... so it would be useless having another one with wikipedia!!

Plus..its not really AI ,,its just bullshit..as usual its gonna have tons and tons of bugs... so wts the big use here anyway???? Are humans losing their brains and they cant use "SEARCH" and "READ" anymore??!!

...And its also a big fat waste of money...
[ link to this | view in chronology ]
ANONYMOUS COWARDS, 10 Jan 2007 @ 4:20pm

hey karl when following your links i stumbled across that cyc project again and found this image that seems to tell me cyc thinks that bicycles are not self-powered vehicles. wow, thats weird. one word in chinese for bike is actually zi-xing-she = self-go-car... guess they'll change it in the export version...
[ link to this | view in chronology ]
PhysicsGuy, 10 Jan 2007 @ 4:30pm

Re: Re:
If you're all so damn smart, how come you haven't come up with a solution to creating AI?

actually, there are many, MANY forms of artificial intelligence out there. one of the most common is that within video games. sure, bots are a simple form of ai, but it's still ai none the less. the person who gave 3 defining characteristics to ai is a moron. ai is a broad term, and i think the big problem is that people don't understand how intelligence is defined.
[ link to this | view in chronology ]
|333173|3|_||3, 10 Jan 2007 @ 7:00pm

Wikipdeia foundation would pay good money for the crap filter that this AI would use, so it is not such a bad idea.
[ link to this | view in chronology ]
lopopopoppopop, 16 Jan 2007 @ 5:11am

iooo
goooopooooikhjhjghjk
[ link to this | view in chronology ]
lamna, 11 Aug 2007 @ 2:10pm

Hal can't lie but wikipedia has so many lies so hal will have to remove the faultys part of the system. Humans are the ones that create and recive lies so they must be removed.
[ link to this | view in chronology ]