Index Catalog // ScholarsArchive@OSU

Window Axial Vision Transformer for Image Classification

Creator:: Lee, Jisoo
Abstract:: Currently, a popular approach to image classification uses the deep Transformer architecture. In a Transformer, the attention mechanism enables the model to learn efficiently with fewer computational resources than the convolutional neural networks (CNNs). In this thesis, we study the sparse attention mechanism widely used in the Transformers developed specifically...
Resource Type:: Masters Thesis
Full Text:: [11] Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. Swin

Part-based and Uncertainty-Aware Few-shot Object Segmentation in Images

Creator:: Nguyen, Khoi D.
Abstract:: This dissertation addresses few-shot object segmentation in images. The goal of segmentation is to label every image pixel with a class of the object occupying that pixel, where the class may represent a semantic object category or instance. In few-shot segmentation, training and test datasets have different classes. Every new...
Resource Type:: Dissertation

Action Segmentation in Videos: Deep Models, Prediction Diffusion, and Deep-Temporal Augmentations

Creator:: Aziere, Nicolas
Abstract:: This paper addresses the high model complexity and overconfident frame labeling of state-of-the-art (SOTA) action segmenters. Their complexity is typically justified by the need to sequentially refine action segmentation through multiple stages of a deep architecture. However, this multistage refinement does not take into account uncertainty of frame labeling predicted...
Resource Type:: Dissertation

Robust and Efficient Classification of Videos in the Wild

Creator:: Mahasseni, Behrooz
Abstract:: Recognizing human actions in videos is a long-standing problem in computer vision with a wide range of applications including video surveillance, content retrieval, and sports analysis. This thesis focuses on addressing efficiency and robustness of video classification in unconstrained real-world settings. The thesis work can be broadly divided into four...
Resource Type:: Dissertation

Interactive player tracking for videos in American football

Creator:: Bawaskar, Amit
Abstract:: This thesis presents an interactive software tool for tracking a moving object in a video. In particular, we focus on the problem of tracking a player in American football videos. Object tracking is one of the fundamental problems in computer vision. It is one of the most important components in...
Resource Type:: Masters Thesis

A Deep Action Segmentation and Its Explanation with a Dictionary of Meaningful Attention Maps

Creator:: Trigkakis, Dimitrios
Abstract:: This thesis addresses the problem of temporal action segmentation in videos, where the goal is to label every video frame with the appropriate action class present. We focus on the domain of NFL football videos, where action classes represent common football play types. For action segmentation, we use a temporal...
Resource Type:: Masters Thesis

Pixel- and Frame-level Video Labeling using Spatial and Temporal Convolutional Networks

Creator:: Lei, Peng
Abstract:: This dissertation addresses the problem of video labeling at both the frame and pixel levels using deep learning. For pixel-level video labeling, we have studied two problems: i) Spatiotemporal video segmentation and ii) Boundary detection and boundary flow estimation. For the problem of spatiotemporal video segmentation, we have developed recurrent...
Resource Type:: Dissertation

Semantic image segmentation using domain constraints

Creator:: Roy, Anirban
Abstract:: This dissertation addresses the problem of semantic labeling of image pixels. In the course of our work, we considered different types of semantic labels, including object classes (e.g., car, person), 3D depth values (in the range 0 to 80 meters), and affordance classes (e.g., walkable, sittable). Semantic pixel labeling is...
Resource Type:: Dissertation

Relational Networks for Visual Relationship Detection in Images

Creator:: Nguyen, Khoi
Abstract:: This thesis is about visual relationship detection. This is an important task in computer vision. The goal is to detect all visual relationships in a given image between objects. This thesis presents a new approach to this problem. Our approach does not use an object detector as a common pre-processing...
Resource Type:: Masters Thesis
Full Text:: Vision and Pattern Recognition (CVPR), July 2017. [15] Liang Lin, Guangrun Wang, Rui Zhang, Ruimao Zhang

Fine-Grained Object Recognition Under Limited Training Data

Creator:: Lam, Michael Q.
Abstract:: This dissertation addresses object recognition in challenging settings, where distinct object classes are visually very similar (e.g., species of birds and insects) and/or access to training examples of object classes is limited (e.g., due to the associated high costs of data annotation). In this dissertation, we present a variety of...
Resource Type:: Dissertation
Full Text:: hybrid of Lin- ear programming (LP) and QP [83], and search-based methods [44, 30, 74, 109, 126

ScholarsArchive@OSU

Window Axial Vision Transformer for Image Classification

Part-based and Uncertainty-Aware Few-shot Object Segmentation in Images

Action Segmentation in Videos: Deep Models, Prediction Diffusion, and Deep-Temporal Augmentations

Robust and Efficient Classification of Videos in the Wild

Interactive player tracking for videos in American football

A Deep Action Segmentation and Its Explanation with a Dictionary of Meaningful Attention Maps

Pixel- and Frame-level Video Labeling using Spatial and Temporal Convolutional Networks

Semantic image segmentation using domain constraints

Relational Networks for Visual Relationship Detection in Images

Fine-Grained Object Recognition Under Limited Training Data

Limit your search

Academic Affiliation

Advisor

Commencement Year

Committee Member

Creator

Date

Decade

Degree Field

Degree Level

Degree Name

File Format

Language

License

Non-Academic Affiliation

Peer Reviewed

Resource Type

Rights Statement

Subject

Search Constraints

Search Results

Limit your search