The Napoleon Dynamite Problem Stymies Netflix Prize Competitors
from the love-it-or-hate-it dept
We've been covering the ongoing race to claim the $1 million Netflix Prize for a while now, highlighting some surprising and unique methods for attacking the problem. Every time we write about it, it appears that the lead teams have inched just slightly closer to that 10% improvement hurdle, but progress has certainly been slow. Clive Thompson's latest NY Times piece looks at the latest standings, noting that the issue now is "The Napoleon Dynamite problem."Apparently, the algorithms cooked up by various teams seems to work great for your typical mainstream movies, but where it runs into trouble is when it hits on quirky films, like Napoleon Dynamite or Lost in Translation or I Heart Huckabees, where people tend to have a rather strong and immediate love or hate reaction to those films, with very little in-between. No one seems quite sure what leads to such a strong polar reaction, and no algorithm can yet figure out how people will react to such films, which is where all of the various algorithms seem to run into a dead end.
Some folks believe that's just the nature of taste. It really can't just be programmed like an algorithm, but takes into account a variety of other factors: including what your friends think of something, or even if you happened to go see that movie with certain friends. Basically, there are external factors that could play into taste, that isn't necessarily indicated in the fact that you may have liked some other set of quirky movies, and therefore you must love Napoleon Dynamite. In some ways, it makes you wonder if we're all putting too much emphasis on an algorithmic approach to the issue, and if other recommendation systems, including what specific friends think of a movie might be more effective. Of course, Netflix is hedging its bets. It's been pushing social networking "friend recommendation" features for a while as well.
Thank you for reading this Techdirt post. With so many things competing for everyone’s attention these days, we really appreciate you giving us your time. We work hard every day to put quality content out there for our community.
Techdirt is one of the few remaining truly independent media outlets. We do not have a giant corporation behind us, and we rely heavily on our community to support us, in an age when advertisers are increasingly uninterested in sponsoring small, independent sites — especially a site like ours that is unwilling to pull punches in its reporting and analysis.
While other websites have resorted to paywalls, registration requirements, and increasingly annoying/intrusive advertising, we have always kept Techdirt open and available to anyone. But in order to continue doing so, we need your support. We offer a variety of ways for our readers to support us, from direct donations to special subscriptions and cool merchandise — and every little bit helps. Thank you.
–The Techdirt Team
Filed Under: movies, napoleon dynamite, netflix prize, ranking, recommendation engine
Companies: netflix
Reader Comments
Subscribe: RSS
View by: Time | Thread
Taste is in the mind of the beholder
[ link to this | view in thread ]
[ link to this | view in thread ]
humans
[ link to this | view in thread ]
Re: humans
That being said, it is easily implied that such surprises are the rarity--q.e.d. people are predictable.
[ link to this | view in thread ]
Re: humans
[ link to this | view in thread ]
Re: Re: humans
[ link to this | view in thread ]
Approaching true noise
In this case it seems the contestants have reached that threshold.
[ link to this | view in thread ]
you track what people like what movies (with what percentage), then you tie those together and you have a tree-system that tracks distance of likability, how many people liked it, and the amount they liked it by. so for example:
most people who liked old yeller liked homeward bound.
a few people who liked old yeller liked airbud
most people who liked homeward bound liked Beethoven.
a few people who liked homeward bound liked Dunstan checks in
so it would suggest them in roughly this order:
homeward bound
Beethoven/airbud
Dunstan checks in
recurse until your algorithm would give a movie 50% or less probability, adjusting for more links where most movies you watch have the have common movies that people liked. the problem would be making it fast on anything other than a beast of a machine. but as they specify accuracy, not performance...
(feel free to poke holes in my plan, I really only thought about it for a few minutes before typing it up)
[ link to this | view in thread ]
Re: Re: Re: humans
[ link to this | view in thread ]
sense
No matter what aspect of the movie, its actors, or its characters that you look at you don't find anything compelling about it that would make more than a small subset of the population want to see it based on that alone. It was purely a word of mouth promotion.
That means most people that saw it did so ONLY because they heard it was really good. That means most either agreed and gave it five stars or were sorely disappointed and gave it one or two.
That differs from movies that appeal to and attract a large portion of the audience before word of mouth gets in the mix. These folks provide a lot of the middle ratings.
[ link to this | view in thread ]
can't predict my movies!
[ link to this | view in thread ]
Newton's got nothing on intuition
[ link to this | view in thread ]
[ link to this | view in thread ]
[ link to this | view in thread ]
[ link to this | view in thread ]
Re: Approaching true noise
[ link to this | view in thread ]
Re: Re: Approaching true noise
There is going to be a difference between actual rating and declared rating. It would be easier to predict actual rating (exactly what viewer thinks) than the declared ones. Some factors affecting declared (to be different from actual) are: alcohol, company, time.......
[ link to this | view in thread ]
based on personal experience...
until one of my teenage sons started quoting incessantly from that movie. Now I *HATE* it....
Idiots...
:-)
[ link to this | view in thread ]
Obvious solution
[ link to this | view in thread ]
Ratings by different family members
[ link to this | view in thread ]
Re: algorithm
The problem isn't one that's hard to solve to a first approximation. The hard part is improving significantly from a reasonable baseline algorithm.
[ link to this | view in thread ]
[ link to this | view in thread ]
[ link to this | view in thread ]