(archive site)

Scalable Big Data Analysis Publications

Scalable Multimedia content AnalysiS in a High-level language (SMASH)

Start Here

  • Bart Thomee, David A. Shamma, Gerald Friedland, Benjamín Elizalde, Karl Ni, Douglas Poland, Damian Borth, and Li-Jia Li. 2015. The New Data and New Challenges in Multimedia Research. Computing Research Repository, arXiv:1503.01817. [PDF]
  • Jaeyoung Choi, Bart Thomee, Gerald Friedland, Liangliang Cao, Karl Ni, Damian Borth, Benjamín Elizalde, Luke Gottlieb, Carmen Carrano, Roger Pearce, and Doug Poland. 2014. The Placing Task: A Large-Scale Geo-Estimation Challenge for Social-Media Videos and Images. In Proceedings of the 3rd ACM Multimedia Workshop on Geotagging and Its Applications in Multimedia (GeoMM ’14). [PDF]
  • Khalid Ashraf, Benjamín Elizalde, Forrest Iandola, Matthew Moskewicz, Julia Bernd, Gerald Friedland, and Kurt Keutzer. 2015. Audio-Based Multimedia Event Detection with Deep Neural Nets and Sparse Sampling. To appear in Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR), Shanghai, China, June 23-26, 2015.

More Publications

  • Julia Bernd, Damian Borth, Benjamín Elizalde, Gerald Friedland, Heather Gallagher, Luke Gottlieb, Adam Janin, Sara Karabashlieva, Jocelyn Takahashi, and Jennifer Won. 2015. The YLI‐MED Corpus: Characteristics, Procedures, and Plans. ICSI Technical Report TR-15-001. Computing Research Repository, arXiv:1503.04250. [PDF]

Fast Speaker Diarization using Python

  • Ekaterina Gonina, Gerald Friedland, Henry Cook, Kurt Keutzer. 2011. Fast Speaker Diarization Using a High-Level Scripting Language. 2011. In Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Dec 11-15, 2011, Waikoloa, Hawaii. [PDF]
  • Henry Cook, Ekaterina Gonina, Shoaib Kamil, Gerald Friedland, David Patterson, and Armando Fox. 2011. CUDA-Level Performance with Python-Level Productivity for Gaussian Mixture Model Applications. In Proceedings of the Third USENIX Workshop on Hot Topics in Parallelism (HotPar ’11), Berkeley, California, May 2011. [PDF]

PyCASP (Python-based Content Analysis using SPecialization)

  • Ekaterina Gonina, Gerald Friedland, Eric Battenberg, Penporn Koanantakool, Michael Driscoll, Evangelos Georganan, and Kurt Keutzer. 2014. Scalable Multimedia Content Analysis on Parallel Platforms Using Python. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) 10:2. [PDF]