The Anti-Turing Test

from the captchas dept

Tue, Dec 10th 2002 12:57am — Mike Masnick

A few months ago someone sent me the following which I found to be very cool: "... randomising letters in the middle of words [has] little or no effect on the ability of skilled readers to understand the text. This is easy to denmtrasote. In a pubiltacion of New Scnieitst you could ramdinose all the letetrs, keipeng the first two and last two the same, and reibadailty would hadrly be aftcfeed. My ansaylis did not come to much beucase the thoery at the time was for shape and senqeuce retigcionon. Saberi's work sugsegts we may have some pofrweul palrlael prsooscers at work. The resaon for this is suerly that idnetiyfing coentnt by paarllel prseocsing speeds up regnicoiton. We only need the first and last two letetrs to spot chganes in meniang." I wish I had a real source for it, but all I get on a Google search is other sites posting the same quote. Anyway, I was just reminded of that when reading this NY Times article about the idea of "Captchas", which are tricks to make sure someone filling out a web-form or registration page is really a human, and not a bot. In other words, it's a sort of anti-Turing test. I would think that a system using plenty of misspelled words like the above paragraph could easily fool a computer, but is understandable by humans, and could make a good captcha.

Thank you for reading this Techdirt post. With so many things competing for everyone’s attention these days, we really appreciate you giving us your time. We work hard every day to put quality content out there for our community.

Techdirt is one of the few remaining truly independent media outlets. We do not have a giant corporation behind us, and we rely heavily on our community to support us, in an age when advertisers are increasingly uninterested in sponsoring small, independent sites — especially a site like ours that is unwilling to pull punches in its reporting and analysis.

While other websites have resorted to paywalls, registration requirements, and increasingly annoying/intrusive advertising, we have always kept Techdirt open and available to anyone. But in order to continue doing so, we need your support. We offer a variety of ways for our readers to support us, from direct donations to special subscriptions and cool merchandise — and every little bit helps. Thank you.

–The Techdirt Team

15 Comments

If you liked this post, you may also be interested in...

Reader Comments

Subscribe: RSS

View by: Time | Thread

Bob Bechtel, 10 Dec 2002 @ 5:16am

Randomize vs. swap?
How (if at all) does this finding interact with what is known of dyslexia?
[ link to this | view in chronology ]
- Karthik, 10 Apr 2003 @ 12:15pm
  
  Re: Randomize vs. swap?
  Don't you mean Slysdexic? :-p
  [ link to this | view in chronology ]
- Anonymous Coward, 10 Apr 2003 @ 3:20pm
  
  Re: Randomize vs. swap?
  I'm a reformed dyslexic. That is, I was taught strategies for dealing with it when I was very young. I've long since internalized them and can function as well as a non-dyslexic. Usually that is; if I'm emotionally agitated I start having trouble with spacial relationships..."right", "left", "inside", "outside" are very much front brain concepts to me. Get me upset and I can forget until I calm down.
  
  That said, I can say I had no trouble whatsover reading the mixed up paragraph. At least in my case, dyslexia has no effect on my ability to do such things.
  [ link to this | view in chronology ]
- michael rosenbaum, 10 Apr 2003 @ 9:10pm
  
  Re: Randomize vs. swap?
  most dyslexia occurs in english speaking countries, according to http://www.unifon.org/
  or more precisely http://66.41.60.21/research-reading-orthography.html
  
  english has so many spelling variations for individual phonemes, it makes sense we can understand this story. i wonder if readers of languages with fewer spelling variations can do this trick.
  [ link to this | view in chronology ]
2Lazy2Register, 10 Dec 2002 @ 6:22am

This was easy to read...
... but then again, I read Slashdot a lot where spelling correctly only confuses matters.
[ link to this | view in chronology ]
Anonymous Coward, 10 Dec 2002 @ 12:54pm

I fail to see
where this would be of any use in defeating a robot.

If my robot is matching "Address" now, how hard would it be to change that to match /A[dres]{5,5}s/?

Not very.
[ link to this | view in chronology ]
- Anonymous Coward, 10 Apr 2003 @ 1:39pm
  
  Re: I fail to see
  Because your robot has to know which variation to use by context (something beyond the scope of regex). And your pattern is too strict, you need to also have it accomodate missing letters (not mentioned here but another important challenge) like adress or addrss. It also needs to understand extra letters (in combination with missing letters) like addresse and adresse.
  [ link to this | view in chronology ]
  - James, 11 Apr 2003 @ 6:30am
    
    Re: I fail to see
    Actually, that isn't so hard either. Perl has a fuzzy string matching function somewhere - I remember using it. Knowing that generally, the first and last two letters will be the same, I think you could translate most of the words back into English.
    Of course, whether this helps to spot the difference between a computer and a person, like the captchas, depends on how you use it. If it's simply a matter of repeating the muddled word in English, then it's easy. If it's interpreting a sentence like "It's Friday today, and this weekend I'm having a party. Would you expect me to be happy?", or "What colour is grass?" or "My foot itches. Should I scratch it, slap it, or paint it blue?", then it's a problem. Of course, that would be a problem anyway.
    What was my point again?
    [ link to this | view in chronology ]
Timmmay!, 10 Dec 2002 @ 1:02pm

Not a suprise
Part of learning languages is learning the N-gram statistics (combinations of N letters -- usually 2 / 3 that are used in the language). For example, ea is a lot more common in english than ae. Your brain uses that to "repair" mixed up text. If you were not a fluent english speaker then you would have a great deal of difficulty doing this.

A computer can easily compensate for this by using a dictionary and N-gram statistics to correct text.
[ link to this | view in chronology ]
- Noam, 11 Apr 2003 @ 8:43am
  
  Re: Not a suprise
  In cases like these, syntax and semantic context are probably at least as important as word-by-word analysis. Linguistic research has shown that people tend to anticipate later words or grammatical structures as they read earlier words in a passage, and that set of expectations, produced by such on-the-fly progressive analysis of a sentence, speeds up our processing time. Familiarity with a language, with the lexicon and with the syntactical conventions will be useful in all instances.
  
  A related scenario comes up when letters, instead of being transposed, are substituted for the wrong letters or symbols. This tends to happen when Americans in France, using a French keyboard, write to me in English. Because the locations of the keys are transposed (QWERTY is not used there), I end up getting things like: "Deqr Noq,; It zqs reqlly greqt tqlking to you..." (This is a mild example.) If found this type of substitution very easy to pick up in real time, partly on the basis of context and partly because many key word-initial or word-final letters were not changed.
  [ link to this | view in chronology ]
Zak McKracken, 10 Dec 2002 @ 8:33pm

No Subject Given
Isn't this just a factor of dealing with twits^H^H^H^H^H users who can't be bothered learning how to either correct their spelling or just plain type accurately?<br><br>I know that I'll make the odd type-o that I don't pick up, but heck - its really not that hard to spend a little time proof reading?
[ link to this | view in chronology ]
- Anonymous Coward, 11 Apr 2003 @ 5:24am
  
  Re: No Subject Given
  That "its" was deliberate, I take it.
  [ link to this | view in chronology ]
Anonymous Coward, 11 Apr 2003 @ 9:36am

what a crock
yes, there will always be tricks to weed the X's from the Y's and the X's and Y's will continue to change and new tricks will pop up. Calling this an "anti-turing test" is glorifying stupid hacked up tests to tell things apart. if you think intelligence can be tested with some tricks then i feel sorry for you.
[ link to this | view in chronology ]
Jan, 23 Jul 2003 @ 7:42pm

Quote source
Your "reibadailty" quote is a letter to the New Scientist but I don't know what date. We have a copy of it on our staffroom wall -

"You report that reversing 50-millisecond segments of recorded sound does not greatly affect listeners' ability to understand speech (In Brief, 1 May, p27).
This reminds me of my PhD at Nottingham University (1976), which showed that randomising letters ..." etc.

Hope this helps.
[ link to this | view in chronology ]
huayangao, 28 Dec 2007 @ 3:50pm

Turing Test Two
... In Turing Test Two, two players A and B are again being questioned by a human interrogator C. Before A gave out his answer (labeled as aa) to a question, he would also be required to guess how the other player B will answer the same question and this guess is labeled as ab. Similarly B will give her answer (labeled as bb) and her guess of A's answer, ba. The answers aa and ba will be grouped together as group a and similarly bb and ab will be grouped together as group b. The interrogator will be given first the answers as two separate groups and with only the group label (a and b) and without the individual labels (aa, ab, ba and bb). If C cannot tell correctly which of the aa and ba is from player A and which is from player B, B will get a score of one. If C cannot tell which of the bb and ab is from player B and which is from player A, A will get a score of one. All answers (with the individual labels) are then made available to all parties (A, B and C) and then the game continues. At the end of the game, the player who scored more is considered had won the game and is more "intelligent". ... http://turing-test-two.com/ttt/TTT.pdf
[ link to this | view in chronology ]

Gateway Makes Store PCs Work Overtime

Capellas Has A $20 Million Month

Follow Techdirt

Essential Reading

The Techdirt Greenhouse

Read the latest posts:

read all »

Techdirt Deals

Report this ad | Hide Techdirt ads

Techdirt Insider Discord

The latest chatter on the Techdirt Insider Discord channel...

The Anti-Turing Test

from the captchas dept

Reader Comments

Randomize vs. swap?

Re: Randomize vs. swap?

Re: Randomize vs. swap?

Re: Randomize vs. swap?

This was easy to read...

I fail to see

Re: I fail to see

Re: I fail to see

Not a suprise

Re: Not a suprise

No Subject Given

Re: No Subject Given

what a crock

Quote source

Turing Test Two

The Techdirt Greenhouse

Friday

More

Tools & Services

Company

Contact

More

Friday
19:39	Important Announcement: Techdirt Is Migrating To A New Platform (0)

The Anti-Turing Test

from the captchas dept

Reader Comments

Randomize vs. swap?

Re: Randomize vs. swap?

Re: Randomize vs. swap?

Re: Randomize vs. swap?

This was easy to read...

I fail to see

Re: I fail to see

Re: I fail to see

Not a suprise

Re: Not a suprise

No Subject Given

Re: No Subject Given

what a crock

Quote source

Turing Test Two

Techdirt Daily Newsletter

The Techdirt Greenhouse

Friday

More

Tools & Services

Company

Contact

More