Factorized Variational Autoencoders for Modeling Audience Reactions to Movies

Authors

Zhiwei Deng (Simon Fraser University)
Rajitha Navarantha (Disney Research Pittsburgh)
Peter Carr (Disney Research Pittsburgh)
Stephan Mandt (Disney Research Pittsburgh)
Yisong Yue (Caltech)
Iain Matthews (Disney Research Pittsburgh)
Greg Mori (Simon Fraser University)

IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017

July 21, 2017

Matrix and tensor factorization methods are often used for finding underlying low-dimensional patterns from noisy data. In this paper, we study non-linear tensor factorization methods based on deep variational autoencoders. Our approach is well-suited for settings where the relationship between the latent representation to be learned and the raw data representation is highly complex. We apply our approach to a large dataset of facial expressions of movie-watching audiences (over 16 million faces). Our experiments show that compared to conventional linear factorization methods, our method achieves better reconstruction of the data, and further discovers interpretable latent factors.

Download File "Factorized Variational Autoencoders for Modeling Audience Reactions to Movies-Paper"
[pdf, 6.92 MB]

Additional files

Download additional file "Supplemental-Material – Factorized Variational Autoencoders for Modeling Audience Reactions to Movies"
[pdf, 1.00 MB]

Copyright Notice

The documents contained in these directories are included by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a non-commercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.