Cross-Media Video Classification, Indexing, Retrieval and Visualization

Challenging Problems:

  • Recent advances on high-performance video compression, storage and communication technologies present an extraordinary opportunity to enable evidence-based multimedia medical education by illustrating suitable medical video clips in the class. With the real clinic examples documented in the relevant medical video clips, medical instructors will have more capability and flexibility to explain medical concepts, principles and diagnosis skills in the class. As large-scale collections of medical education videos come into view, there is an urgent need for the efforts that classify and access medical education videos at the semantic level, so that the medical instructors can select the most relevant medical video clips over such large-scale video collections quickly and easily. In spite of some recent research progress, video classification and retrieval at the semantic level are still open problems with many unsolved challenging issues:
  • (a) Multi-Modal Query Concept Specification: There are three widely accepted approaches to specify query concepts: (1) video examples; (2) keywords; (3) browsing. Each approach represents a useful way of accessing a video database, but currently they all have limitations.
  • (b) Automatic Video Concept Detection: Because of semantic gap, it is very hard to detect the video concepts automatically, especially for the higher-level video concepts with large variations.
  • (c) Video Visualization for Query Evaluation: Video shots are used for video indexing, retrieval and displaying the query results, but the users may have stronger interest on identifying the most relevant lengthy video clips with complete descriptions of certain events.
  • Research Focus:

    This project will tackle these challenging problems in a specific domain of nursing education video, but we will also test and extend our algorithms for News Videos. The proposed research include:
  • develop a new framework for video content representation that is able to characterize the middle-level semantics of video contents;
  • incorporate domain knowledge to boost the video classifier training while reducing the cost and complexity significantly;
  • integrate video visualization for fast decision making and query result evaluation.
  • Current Project Achievements:

    a. Prototype System Implementation:

  • System Demo for Medical Video Classification and Summarization

  • System Demo for News Video Analysis and Visualization

  • Concept Ontology Visualization for Interactive Video Access:

    Need Java support to run this demo.

    Concept Ontology Visualization with ICONS for Interactive Video Access:

    Need Java support to run this demo.

    Concept Network Visualization for Interactive Video Access:

    Need Java support to run this demo.

    b. Algorithm Benchmarking:

  • Hierarchical Approach versus Flat Approach for Video Classification
  • Salient Objects versus Video Shots for Video Classification
  • Confident Object Volumes for Video Content Representation
  • Confidence Map for Automatic Salient Object Detection
  • Contextual Relationships for Semantic Video Interpretation and Modeling
  • c. Journal Publications:

  • J. Fan, H. Luo, Y. Gao, R. Jain, ``Incorporating Concept Ontology to Boost Hierarchical Classifier Training for Automatic Multi-Level Video Annotation", IEEE Trans. on Multimedia, special issue on Semantic Image and Video Indexing in Broad Domains, June, 2007.
  • J. Fan , H. Luo, A.K. Elmagarmid, ``Concept-Oriented Indexing of Video Database towards More Effective Retrieval and Browsing", IEEE Trans. on Image Processing, vol.13, no.7, pp.974-992, 2004.
  • J. Fan , X. Zhu, A.K. Elmagarmid, W.G. Aref, L. Wu, ``ClassView: Hierarchical Video Shot Classification, Indexing, and Accessing", IEEE Trans. on Multimedia, vol.6, no.1, pp.70-86, 2004.
  • d. Conference Publications:

  • H. Luo, J. Fan, ``Building Concept Ontology for Medical Video Annotation", ACM Multimedia, Santa Barbara, CA, 2006.
  • Y. Gao, J. Fan, ``Incorporate Concept Ontology to Enable Probabilistic Concept Reasoning for Multi-Level Image Annotation", ACM Multimedia Workshop on Multimedia Information Retrieval (MIR'06), Santa Barbara, CA, 2006.
  • Y. Gao, J. Fan, H. Luo, X. Xue, R. Jain, ``Automatic Image Annotation by Incorporating Feature Hierarchy and Boosting to Scale up SVM Classifiers", ACM Multimedia, Santa Barbara, CA, 2006.
  • H. Luo, J. Fan, J. Yang, B. Ribarsky, S. Satoh, ``Exploring large-scale video news via interactive visualization", IEEE VAST'06 (IEEE Symposium on Visual Analytics Science and Technology 2006), 2006.
  • H. Luo, J. Fan, ``Concept-oriented video skimming via semantic video classification" (demo paper), ACM Multimedia, New York, Oct. 10-15, 2004.
  • J. Fan, H. Luo, ``Semantic video classification by integrating unlabeled samples for classifier training" (poster paper), ACM SIGIR, Sheffield, UK, July 25-29, 2004.
  • J. Fan, H. Luo, J. Xiao, L. Wu, ``Semantic video classification and feature subset selection under context and concept uncertainty", ACM/IEEE Joint Conf. on Digital Libraries (JCDL'04), Tuson, AZ, June 7-11, 2004.
  • H. Luo, J. Fan, Y. Gao, G. Xu, ``Multi-Modal salient objects: General semantic building blocks for semantic video concept interpretation", Int. Conf. on Image and Video Retrieval (CIVR'04), Dublin, Ireland, 2004.
  • If we know what we were doing, it wouldn't be research, would it? ---Albert Einstein(1879-1955)---