From multitarget tracking to event recognition in videos Public Deposited

http://ir.library.oregonstate.edu/concern/graduate_thesis_or_dissertations/k643b467r

Descriptions

Attribute NameValues
Creator
Abstract or Summary
  • This dissertation addresses two fundamental problems in computer vision—namely, multitarget tracking and event recognition in videos. These problems are challenging because uncertainty may arise from a host of sources, including motion blur, occlusions, and dynamic cluttered backgrounds. We show that these challenges can be successfully addressed by using a multiscale, volumetric video representation, and taking into account various constraints between events offered by domain knowledge. The dissertation presents our two alternative approaches to multitarget tracking. The first approach seeks to transitively link object detections across consecutive video frames by finding the maximum independent set of a graph of all object detections. Two maximum-independent-set algorithms are specified, and their convergence properties theoretically analyzed. The second approach hierarchically partitions the space-time volume of a video into tracks of objects, producing a segmentation graph of that video. The resulting tracks encode rich contextual cues between salient video parts in space and time, and thus facilitate event recognition, and segmentation in space and time. We also describe our two alternative approaches to event recognition. The first approach seeks to learn a structural probabilistic model of an event class from training videos represented by hierarchical segmentation graphs. The graph model is then used for inference of event occurrences in new videos. Learning and inference algorithms are formulated within the same framework, and their convergence rates theoretically analyzed. The second approach to event recognition uses probabilistic first-order logic for reasoning over continuous time intervals. We specify the syntax, learning, and inference algorithms of this probabilistic event logic. Qualitative and quantitative results on benchmark video datasets are also presented. The results demonstrate that our approaches provide consistent video interpretation with respect to acquired domain knowledge. We outperform most of the state-of-the-art approaches on benchmark datasets. We also present our new basketball dataset that complements existing benchmarks with new challenges.
Resource Type
Date Available
Date Copyright
Date Issued
Degree Level
Degree Name
Degree Field
Degree Grantor
Commencement Year
Advisor
Committee Member
Academic Affiliation
Non-Academic Affiliation
Keyword
Subject
Rights Statement
Language
Replaces
Additional Information
  • description.provenance : Approved for entry into archive by Julie Kurtz(julie.kurtz@oregonstate.edu) on 2011-05-12T20:30:43Z (GMT) No. of bitstreams: 1 BrendelWilliam2011.pdf: 40789017 bytes, checksum: e0f710e07c1a64d69f1ed56f8e33352d (MD5)
  • description.provenance : Approved for entry into archive by Laura Wilson(laura.wilson@oregonstate.edu) on 2011-05-12T22:13:29Z (GMT) No. of bitstreams: 1 BrendelWilliam2011.pdf: 40789017 bytes, checksum: e0f710e07c1a64d69f1ed56f8e33352d (MD5)
  • description.provenance : Submitted by William Brendel (brendelw@onid.orst.edu) on 2011-05-06T01:19:09Z No. of bitstreams: 1 BrendelWilliam2011.pdf: 40789017 bytes, checksum: e0f710e07c1a64d69f1ed56f8e33352d (MD5)
  • description.provenance : Made available in DSpace on 2011-05-12T22:13:29Z (GMT). No. of bitstreams: 1 BrendelWilliam2011.pdf: 40789017 bytes, checksum: e0f710e07c1a64d69f1ed56f8e33352d (MD5)

Relationships

In Administrative Set:
Last modified: 08/19/2017

Downloadable Content

Download PDF
Citations:

EndNote | Zotero | Mendeley

Items