« Good guys in sports need a dose of reality | Main | Reality check on the long tail »


Feed You can follow this conversation by subscribing to the comment feed for this post.

Max Lin

Can the user-seen pair setup that does not suffer from missing data problem be treated as a special case of the user-rateing pair, but with seen movies with rating 1 and unseen movies with with rating 0?


Max: In terms of running the algorithm, you can do as you said. What you'll notice is that the matrix is now complete as opposed to very sparse in the case of ratings.


But doesn't the user-seen pair also suffer from not distinguishing between user - 'not aware movie existed' and user - 'chose not to watch it'


Chris: Awareness is a different concept. The proposed model is based on actual watching; aware but don't watch is grouped with not aware. You can of course build a more complicated model if you so desire.

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.


Post a comment

Your Information

(Name is required. Email address will not be displayed with the comment.)


Link to Principal Analytics Prep

See our curriculum, instructors. Apply.
Business analytics and data visualization expert. Author and Speaker. Founder of Principal Analytics Prep, MS Applied Analytics at Columbia. See my full bio.

Next Events

May: 2 New York Marketing Association Big Data Workshop, NYC

May: 5 NYPL Analytics Careers Talk, NYC

May: 8 Data Visualization Seminar, Denver, CO

May: 15 Data Visualization Seminar, Cambridge, MA

May: 17 Data Visualization Seminar, Philadelphia, PA

May: 22 Data Visualization Seminar, San Ramon, CA

Past Events

See here

Future Courses (New York)

Summer: Statistical Reasoning & Numbersense, Principal Analytics Prep (4 weeks)

Summer: Applied Analytics Frameworks & Methods, Columbia (6 weeks)

Junk Charts Blog

Link to junkcharts

Graphics design by Amanda Lee


  • only in Big Data