(archive site)

Video Content Analysis Publications


AURORA (An ALADDIN Project)

Start Here

  • Benjamín Elizalde, Gerald Friedland, Howard Lei, and Ajay Divakaran. 2012. There Is No Data Like Less Data: Percepts for Video Concept Detection on Consumer-Produced Media. In Proceedings of the ACM International Workshop on Audio and Multimedia Methods for Large-Scale Video Analysis (AMVA) at ACM Multimedia 2012 (MM’12), Nara, Japan, October 2012, pp. 27-32. [PDF]
  • Benjamín Elizalde, Mirco Ravanelli, and Gerald Friedland. 2013. Audio Concept Ranking for Video Event Detection on User-Generated Content. In Proceedings of the InterSpeech First Workshop on Speech, Language and Audio in Multimedia (SLAM ’13), Marseille, France, August 2013. [PDF]
  • Benjamín Elizalde, Howard Lei, and Gerald Friedland. 2013. An i-Vector Representation of Acoustic Environments for Audio-Based Video Event Detection on User Generated Content. In Proceedings of the IEEE International Symposium on Multimedia (ISM 2013), Anaheim, California, December 2013, pp. 114-117. [PDF]

More Publications

  • Benjamín Elizalde and Gerald Friedland. 2013. Lost in Segmentation: Three Approaches for Speech/Non-Speech Detection in Consumer-Produced Videos. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2013), San Jose, California, July 2013. [PDF]
  • Benjamín Elizalde and Gerald Friedland. 2013. Taming the Wild: Acoustic Segmentation in Consumer‐Produced Videos. ICSI Technical Report TR-12-016. Berkeley, CA: International Computer Science Institute. [PDF]
  • Benjamín Elizalde, Gerald Friedland, and Karl Ni. 2013. What You Hear Is What You Get: Audio-Based Video Content Analysis. In Proceedings of the Bay Area Machine Learning Symposium 2013 (BayLearn), Menlo Park, California, August 2013. [PDF]
  • Benjamín Elizalde, Howard Lei, Gerald Friedland, and Nils Peters. 2013. Capturing the Acoustic Scene Characteristics for Audio Scene Detection. In Proceedings of the IEEE AASP Challenge: Detection and Classification of Acoustic Scenes and Events (D-CASE) at the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2013), New Paltz, New York, October 2013. [PDF]
  • Hui Cheng, Jingen Liu, Saad Ali, Omar Javed, Qian Yu, Amir Tamrakar, Ajay Divakaran, Harpreet S. Sawhney, R. Manmatha, James Allan, Alex Hauptmann, Mubarak Shah, Subhabrata Bhattacharya, Afshin Dehghan, Gerald Friedland, Benjamin Martinez Elizalde, Trevor Darrell, Michael Witbrock, and Jon Curtis. 2012. SRI-Sarnoff AURORA System at TRECVID 2012: Multimedia Event Detection and Recounting. NIST TRECVID 2012. Gaithersburg, MD: National Institute of Standards and Technology. [PDF]
  • Gerald Friedland, Benjamín Martinez Elizalde, Howard Lei, and Ajay Divakaran. 2012. There Is No Data Like Less Data: Percepts for Video Concept Detection on Consumer-Produced Media. ICSI Technical Report TR-12-006. Berkeley, CA: International Computer Science Institute. [PDF]
  • Po-Sen Huang, Robert Mertens, Ajay Divakaran, Gerald Friedland, and Mark Hasegawa-Johnson. 2012. How to Put It into Words – Using Random Forests to Extract Symbol Level Descriptions from Audio Content for Concept Detection. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), pp. 505-508, Kyoto, Japan, March 2012. [PDF]
  • Bhiksha Raj, Benjamín Elizalde, Gerald Friedland, Juan A. Nolazco-Flores, and L. Paola Garcia-Perera. 2012. Segment and Conquer: Acoustic Segmentation on Consumer-Produced (aka “Wild”) Videos. Poster presented at 2nd Greater New York Area Multimedia and Vision Meeting, New York, NY, June 15, 2012.
  • Robert Mertens, Po-Sen Huang, Luke Gottlieb, Gerald Friedland, and Ajay Divakaran. 2011. On the Applicability of Speaker Diarization to Audio Concept Detection for Multimedia Retrieval. In Proceedings of the IEEE International Symposium on Multimedia (ISM 2011), Dana Point, California, December 2011, pp. 446-51. [PDF]
  • Robert Mertens, Howard Lei, Luke Gottlieb, Gerald Friedland, and Ajay Divakaran. 2011. Acoustic Super Models for Large Scale Video Event Detection. In Proceedings of the ACM International Workshop on Events in Multimedia (EiMM11), Scottsdale, Arizona, November 2011. [PDF]

Joke-O-Mat

Start Here

  • Gerald Friedland, Adam Janin, and Luke Gottlieb. 2013. Narrative Theme Navigation for Sitcoms Supported by Fan-Generated Scripts. Multimedia Tools and Applications 63:2, pp. 387-406. [PDF]
  • Gerald Friedland, Luke Gottlieb, and Adam Janin. 2009. Joke-O-Mat: Browsing Sitcoms Punchline by Punchline (ACM Grand Challenge submission). Proceedings of the ACM International Conference on Multimedia (ACM Multimedia 2009), Beijing, China, October 2009, pp. 1115-16. [PDF]

More Publications

  • Adam Janin, Luke Gottlieb, and Gerald Friedland. 2010. Joke-O-Mat HD: Browsing Sitcoms with Human Derived Transcripts. In Proceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, October 2010, pp. 1591-94. [PDF]
  • Gerald Friedland, Luke Gottlieb, and Adam Janin. 2010. Narrative-Theme Navigation for Sitcoms Supported by Fan-Generated Scripts. In Proceedings of the Third International Workshop on Automated Information Extraction in Media Production (AIEMPro ’10) at the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, October 2010, pp. 3-8. [PDF]
  • Gerald Friedland, Luke Gottlieb, and Adam Janin. 2009. Using Artistic Markers and Speaker Identification for Narrative-Theme Navigation of Seinfeld Episodes. In Proceedings of the 11th IEEE International Symposium on Multimedia (ISM09), San Diego, California, December 2009, Workshop on Content-Based Audio/Video Analysis for Novel TV Services, pp. 511-16. [PDF]

Video Duplicate Detection Using Acoustic Methods

  • Mary Knox, Gerald Friedland, and R. Paul Smith. 2012. Using Acoustic Diarization for Duplicate Detection. ICSI Technical Report TR-12-005. Berkeley, CA: International Computer Science Institute. [PDF]