Let's Hook HAL Up To Wikipedia

from the skynet-is-active dept

Giving artificial intelligence systems a database of factual wisdom to pull from while "thinking" has always been a problem, with some researchers feeding systems for decades in order to build functional knowledge repositories. A new project takes a shortcut, and uses Wikipedia as a foundation for AI system world knowledge -- in order to help the system "think smarter," and make common sense and broad-based connections between topics. While existing search systems and e-mail filters fake intelligence through statistical analysis of word frequencies, researchers in Israel are trying to build systems that can use vast pools of Wikipedia information to glean meaning and filter accordingly. We've discussed how cramming a system full of knowledge isn't always the answer -- instead what's important is the ability to parse out useful information from garbage. It will be the system's ability to do the latter that will obviously determine success. Early applications include intelligent spam filters, but the researchers behind the patent-pending project hint they're planning to market the system to the intelligence community as well. Parsing out meaning will be one thing -- sorting through pages of inane Wikipedia wars may require a few more decades of AI development.
Hide this

Thank you for reading this Techdirt post. With so many things competing for everyone’s attention these days, we really appreciate you giving us your time. We work hard every day to put quality content out there for our community.

Techdirt is one of the few remaining truly independent media outlets. We do not have a giant corporation behind us, and we rely heavily on our community to support us, in an age when advertisers are increasingly uninterested in sponsoring small, independent sites — especially a site like ours that is unwilling to pull punches in its reporting and analysis.

While other websites have resorted to paywalls, registration requirements, and increasingly annoying/intrusive advertising, we have always kept Techdirt open and available to anyone. But in order to continue doing so, we need your support. We offer a variety of ways for our readers to support us, from direct donations to special subscriptions and cool merchandise — and every little bit helps. Thank you.

–The Techdirt Team


Reader Comments

Subscribe: RSS

View by: Time | Thread


  • identicon
    PhysicsGuy, 9 Jan 2007 @ 11:41pm

    unless we're talking about specialized ai, then cramming the system full of knowledge IS the answer... :P

    link to this | view in chronology ]

  • identicon
    PhysicsGuy, 9 Jan 2007 @ 11:43pm

    oh...

    and i should add, only when implemented with the proper neural "emulation"...

    link to this | view in chronology ]

  • identicon
    Paul, 10 Jan 2007 @ 12:22am

    But the real question is

    Will Wikipedia dream?

    link to this | view in chronology ]

  • identicon
    Fred, 10 Jan 2007 @ 1:27am

    Garbage in, garbage out

    An AI being with the worldview of wikipedia will be absolutely insane.

    link to this | view in chronology ]

    • identicon
      Enrico Suarve, 10 Jan 2007 @ 3:23am

      Re: Garbage in, garbage out

      "Daisy, Daisy, give me your answer do. I'm half crazy all for the love of you. It won't be a stylish marriage, I can't afford a carriage. But you'll look sweet upon the seat of a bicycle built for two....."

      link to this | view in chronology ]

  • identicon
    anyway, 10 Jan 2007 @ 3:45am

    !!!

    Wtf???!!!! this is foolish.. those prople are acting as if brains are getting instinct... this is not a solution at all...

    link to this | view in chronology ]

    • identicon
      Anonymous Coward, 10 Jan 2007 @ 7:03am

      Re: !!!

      "as if brains are getting instinct... "

      Not sure what you're trying to say here?

      link to this | view in chronology ]

  • identicon
    anonymous coward, 10 Jan 2007 @ 4:30am

    ya look at me im the scapegoat

    not to be a party-pooper, but I think this is actually a really good idea. An ai, if properly programmed and implemented, could become pretty damn smart hooked up to wikipedia 24-7... just a thought

    link to this | view in chronology ]

  • identicon
    JediN8, 10 Jan 2007 @ 5:25am

    Well, I for one welcome our new Wikipedia overlords

    link to this | view in chronology ]

    • identicon
      Jo Mamma, 10 Jan 2007 @ 10:13am

      Re:

      "Well, I for one welcome our new Wikipedia overlords"

      LOL! Sheesh, this article and responses have enough uber-nerd references to be a /. article!

      Anyway, I can't wait for true AI. I want to ask a question, and get a knowledgeable response, as though I was speaking to a human expert on the subject, ALA 'the butler' algorithm in Snow Crash or Aristotle in Sunstorm (speaking of uber-nerd references).

      Though I've never programmed anything AI related, I'd think that the amount of "knowledge" is immaterial. We not only have Wikipedia for that, we've got the entire 'net. The "neural net" (or whatever) and how it deals with the processing and organization of the data is what matters.

      link to this | view in chronology ]

  • identicon
    Just Me, 10 Jan 2007 @ 5:29am

    Re: Dosquatch

    Lol

    link to this | view in chronology ]

  • identicon
    canoneos, 10 Jan 2007 @ 5:48am

    Good morning Dave.

    link to this | view in chronology ]

  • identicon
    Overcast, 10 Jan 2007 @ 6:21am

    Let's just hope the computer's AI is as dumb as those who are putting it together and plugging it up to a Wiki!

    Wiki's an excellent idea, but it's all too often tainted by politics or other agendas

    link to this | view in chronology ]

    • identicon
      JC, 10 Jan 2007 @ 12:15pm

      Re:

      Please. This makes perfect sense. Assuming the end goal is true intelligence, as in comparable to human intelligence, then AI needs to be able to distinguish the difference between bogus and truthful just as you and I do.

      The only way to teach is to let it find the truth for itself. A Wiki is perfect for this job.

      I think this is an innovative approach and whether it works or not in the long run it will likely result in a set of really great data which will put us one step closer to the end goal.

      I find it funny that those of you on this forum who fancy yourselves techies are so ignorant that you'd scoff at such an innovative approach to an extremely technical effort we have yet to accomplish.

      Who are you fools to piss on it? If you're all so damn smart, how come you haven't come up with a solution to creating AI?

      Seems to me most of you just like the sound of your own ignorance.

      link to this | view in chronology ]

      • identicon
        misanthropic humanist, 10 Jan 2007 @ 2:11pm

        Re: Re:

        " Assuming the end goal is true intelligence, as in comparable to human intelligence, then AI needs to be able to distinguish the difference between bogus and truthful just as you and I do."

        It is not JC. There are many goals to AI, some of which are incompatible. One important goal of AI is to have *better than human* intelligence. Humans are notoriously poor at distinguishing truth from falsehood, for many good evloutionary and sociological reasons. If you wish to consider only absolute "truth" then study automatic theorem proving programs which compute statements of predicate calculus. That was all looking peachy until Russell and Godel came along and threw a spanner in the machine.

        "I think this is an innovative approach"

        Because you are unfamiliar with the history of AI research, particulary those projects of MIT and Stanford that already tried this on large textural data sets. It is possibly an improvement because it constrains the data set somewhat. A novel source of data input does not make an AI. At best it offers an interesting way to search the knowledge base within the Wiki.

        "If you're all so damn smart, how come you haven't come up with a solution to creating AI?

        You fail to understand the complexity of this problem domain.

        link to this | view in chronology ]

        • identicon
          ANONYMOUS COWARDS, 10 Jan 2007 @ 3:12pm

          Re: Re: Re:

          A novel source of data input does not make an AI. At best it offers an interesting way to search the knowledge base within the Wiki.


          i'd guess the most interesting asset here is really the potential interface between these two inert data sets, the text and the linkages listed. that interface will find out what it really finds interesting by the way of enticing web3.0alpha users into, well, using it. for some that means actually building it. so the construction process is very HI driven, by a collective of teamworkers.

          even if we now say that little can come of it, little do we know about any novelty that takes over eye and mindshare every now and then. but to me the interface is really interesting.

          link to this | view in chronology ]

  • identicon
    Adam, 10 Jan 2007 @ 6:21am

    Hello Professor Falken, Would you like to play a game of chess?

    link to this | view in chronology ]

  • identicon
    brad, 10 Jan 2007 @ 6:52am

    i tried coming up with a clever AI movie reference... couldnt do it... too slow... but i think its funny that the AI's mind will be at the mercy of internet trolls and ad bots. on the other hand...

    link to this | view in chronology ]

  • identicon
    Brad M, 10 Jan 2007 @ 7:01am

    imagine the possibilities

    I think this sounds great. Brains may not be going extinct but imagine how useful a person would be if they new 25% of the information on wikipedia. Now just imagine the possibilities of real AI.

    The second piece to all of this is getting people comfortable with the idea that a computer is more intelligent then any person.

    I don't think I'm ready to go see an AI robot doctor. When it comes down to it, I'll stick with real intelligence.

    link to this | view in chronology ]

    • identicon
      Anonymous Coward, 10 Jan 2007 @ 7:08am

      Re: imagine the possibilities

      "I don't think I'm ready to go see an AI robot doctor. When it comes down to it, I'll stick with real intelligence."

      Thats rich... first you have to find a doctor to meet those credentials.
      Theres a reason they call it "Practicing Medecine"

      link to this | view in chronology ]

  • icon
    chris (profile), 10 Jan 2007 @ 7:31am

    how about a robotic surgeon?

    link to this | view in chronology ]

    • identicon
      Reed, 10 Jan 2007 @ 2:19pm

      Re: how about a robotic surgeon?

      Even if it is controlled by a human is still looks pretty scary.

      At the big Japanese robotic show they showed off a cheap robot that could feed people (Only around 3k in price).

      They also developed a robotic harbor seal that is used to provide social interaction in nursing homes. This particular robot has been widely successful so far.

      So how long until your nurse is a robot? Probably closer than we may think.

      Is our post-commercial world ready for what little labor that still exist to be replaced by robots?

      link to this | view in chronology ]

  • identicon
    gdwntx, 10 Jan 2007 @ 7:43am

    NO

    I SAID I CAN NOT DO THAT DAVE!

    link to this | view in chronology ]

  • identicon
    gdwntx, 10 Jan 2007 @ 7:43am

    NO

    I SAID I CAN NOT DO THAT DAVE!

    link to this | view in chronology ]

  • identicon
    gdwntx, 10 Jan 2007 @ 7:44am

    CYBORG

    HUMAN CYBORG RELATIONS WILL SUFFER BECAUSE OF THIS

    link to this | view in chronology ]

  • identicon
    pwdrspgsrkb, 10 Jan 2007 @ 8:03am

    HAL connected to Wikipedia

    But will it be IBM compatible?

    link to this | view in chronology ]

    • identicon
      The infamous Joe, 10 Jan 2007 @ 11:44am

      Re: HAL connected to Wikipedia

      Go forward one letter in the term HAL and what do you get?

      Of course it will be compatible.

      link to this | view in chronology ]

  • identicon
    Steven C., 10 Jan 2007 @ 8:07am

    As Seen on TV!

    I'm sorry Dave, but the elephants must be exterminated. My vast intelligence tells me so. There are far too many of them.

    link to this | view in chronology ]

  • identicon
    pwdrspgsrkb, 10 Jan 2007 @ 8:23am

    The 9000 series is the most reliable computer ever made. No 9000 computer has ever made a mistake or distorted information. We are all, by any practical definition of the word, foolproof and incapable of error.

    link to this | view in chronology ]

  • identicon
    rstr5105, 10 Jan 2007 @ 8:37am

    What is intelligence...?

    AFAIK, the definition of "True" artificial intelligence is a machine that meets the following requirements:

    1) It can fool a human that "talks" to it into thinking that they are talking to another human.

    2) It "thinks" for itself, monitoring it's environment, and making "real-life" decisions about it. Not just, "If it's cold in the room, I'll turn on the heat" but, deeper than that, instead of just saying "It's cold" it's supposed to "wonder" why it's cold.

    and finally the most important.

    3) Self-awareness. The machine must be aware of it's existence, and like humans, question that.

    Now, cramming a machine full of knowledge doesn't fulfill the requirement listed above, not only that, but I didn't see anything about these in the article above. So in essence all the isreali's are doing is building a "better" database search alogrithm(sp?), and a "better" search database. Not AI.

    Sry guys. Better luck next time.

    link to this | view in chronology ]

    • identicon
      misanthropic humanist, 10 Jan 2007 @ 10:16am

      Re: What is intelligence...?

      Those three points (taken from Searl btw) are not neccesary conditions of AI, they are categorical distinctions of AI types, which are, Turing test, symbolic reasoning and sentience.

      i) Turing test

      Turns out to extrememly easy (relative to the other two) to "pass" this. Humans are so vain and shallow that all you have to do in order to convince them that they are having an intelligent conversation is take their own words, change them a bit and feed them back to them. You can do this in a few hundred lines of code. Many people will happily chat with Eliza-like chat bots for ages before getting bored with their own reflection.

      ii) Symbolic reasoning

      This is the step that is currently challenging to us. Contrary to the TFA summary and some other comments, parsing is not the difficult step. The ability to deconstruct well written language and extract meaning and intention is quite possible, although borderline syntactic constructs are notoriously difficult, ie "The boat floated down the river sank" The difficult step is to represent this and perform useful operations akin to human-like reasoning upon it. Blocks-world type limited reasoning is possible - the domain must be kept very small. This is the domain of expert systems and knowledge based systems which have been around since the 1950s.

      iii) Sentience

      This is the big one. This is what most people understand by intelligence. There is not even any common sense definition of it let alone some hint of a plan to achieve it. Many hypothesise the key to this is deep recursion and self reference, others assert that consciousness is necessarily like schizophrenia and requires at least two computers each mirroring the other (as in left-right brain). Some think it merely a matter of critical complexity. Noone really knows and probably will not within our lifetimes.

      Good introductory texts to read (imho, - these were all second year AI texts I read in the 1980s) on this subject are Churchland, Searl, Minsky, Hoffstadter, Dennet. Another interesting book is Penrose, which deals specifically with sentience. I recommend Jackson on expert systems and Bishop on neural nets.


      It's worth noting that this comes from an Israeli research company. Many Israelis are notorious fantatsists and masters of talking up bullshit research projects to attract money. In recent years they have "invented" force-fields, time-machines, God knows what else. I always pronounce it "Isn't Really Research"

      So this is just a KBS using Wiki as input . Stanford did this back in the early 1990s with a smaller database of human knowledge and basically concluded the idea was dumb. It is more likely to fail with a larger database, not less because the problem is permuatative complexity. The only way to achieve any semblence of AI is to use neural topologies that collapse the dimensions of the data set into something managable, but these take a long time to train.

      link to this | view in chronology ]

  • identicon
    Anonymous Coward, 10 Jan 2007 @ 9:41am

    So we'll have an "Ask Wikipedia" metabase. I wonder if these researchers were planning to pay Wikipedia for the load their system will create?

    link to this | view in chronology ]

  • identicon
    Afro beast, 10 Jan 2007 @ 10:20am

    done before

    I think some type of thing very similar to this is already done with the encarta.... so it would be useless having another one with wikipedia!!

    Plus..its not really AI ,,its just bullshit..as usual its gonna have tons and tons of bugs... so wts the big use here anyway???? Are humans losing their brains and they cant use "SEARCH" and "READ" anymore??!!

    ...And its also a big fat waste of money...

    link to this | view in chronology ]

  • identicon
    ANONYMOUS COWARDS, 10 Jan 2007 @ 4:20pm

    hey karl when following your links i stumbled across that cyc project again and found this image that seems to tell me cyc thinks that bicycles are not self-powered vehicles. wow, thats weird. one word in chinese for bike is actually zi-xing-she = self-go-car... guess they'll change it in the export version...

    link to this | view in chronology ]

  • identicon
    PhysicsGuy, 10 Jan 2007 @ 4:30pm

    Re: Re:

    If you're all so damn smart, how come you haven't come up with a solution to creating AI?

    actually, there are many, MANY forms of artificial intelligence out there. one of the most common is that within video games. sure, bots are a simple form of ai, but it's still ai none the less. the person who gave 3 defining characteristics to ai is a moron. ai is a broad term, and i think the big problem is that people don't understand how intelligence is defined.

    link to this | view in chronology ]

  • identicon
    |333173|3|_||3, 10 Jan 2007 @ 7:00pm

    Wikipdeia foundation would pay good money for the crap filter that this AI would use, so it is not such a bad idea.

    link to this | view in chronology ]

  • identicon
    lopopopoppopop, 16 Jan 2007 @ 5:11am

    iooo

    goooopooooikhjhjghjk

    link to this | view in chronology ]

  • identicon
    lamna, 11 Aug 2007 @ 2:10pm

    Hal can't lie but wikipedia has so many lies so hal will have to remove the faultys part of the system. Humans are the ones that create and recive lies so they must be removed.

    link to this | view in chronology ]


Follow Techdirt
Essential Reading
Techdirt Deals
Report this ad  |  Hide Techdirt ads
Techdirt Insider Discord

The latest chatter on the Techdirt Insider Discord channel...

Loading...
Recent Stories

This site, like most other sites on the web, uses cookies. For more information, see our privacy policy. Got it
Close

Email This

This feature is only available to registered users. Register or sign in to use it.