Index Catalog // ScholarsArchive@OSU

Window Axial Vision Transformer for Image Classification

Creator:: Lee, Jisoo
Abstract:: Currently, a popular approach to image classification uses the deep Transformer architecture. In a Transformer, the attention mechanism enables the model to learn efficiently with fewer computational resources than the convolutional neural networks (CNNs). In this thesis, we study the sparse attention mechanism widely used in the Transformers developed specifically...
Resource Type:: Masters Thesis
Full Text:: , Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. CoRR, abs

Pixel- and Frame-level Video Labeling using Spatial and Temporal Convolutional Networks

Creator:: Lei, Peng
Abstract:: This dissertation addresses the problem of video labeling at both the frame and pixel levels using deep learning. For pixel-level video labeling, we have studied two problems: i) Spatiotemporal video segmentation and ii) Boundary detection and boundary flow estimation. For the problem of spatiotemporal video segmentation, we have developed recurrent...
Resource Type:: Dissertation
Full Text:: University: Mohamed Amer, Sheng Chen, Liping Liu, Jun Li, Anirban Roy, Behrooz Mahasseni, Michael Lam

Fine-Grained Object Recognition Under Limited Training Data

Creator:: Lam, Michael Q.
Abstract:: This dissertation addresses object recognition in challenging settings, where distinct object classes are visually very similar (e.g., species of birds and insects) and/or access to training examples of object classes is limited (e.g., due to the associated high costs of data annotation). In this dissertation, we present a variety of...
Resource Type:: Dissertation

Relational Networks for Visual Relationship Detection in Images

Creator:: Nguyen, Khoi
Abstract:: This thesis is about visual relationship detection. This is an important task in computer vision. The goal is to detect all visual relationships in a given image between objects. This thesis presents a new approach to this problem. Our approach does not use an object detector as a common pre-processing...
Resource Type:: Masters Thesis
Full Text:: Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer

Action Segmentation in Videos: Deep Models, Prediction Diffusion, and Deep-Temporal Augmentations

Creator:: Aziere, Nicolas
Abstract:: This paper addresses the high model complexity and overconfident frame labeling of state-of-the-art (SOTA) action segmenters. Their complexity is typically justified by the need to sequentially refine action segmentation through multiple stages of a deep architecture. However, this multistage refinement does not take into account uncertainty of frame labeling predicted...
Resource Type:: Dissertation

Part-based and Uncertainty-Aware Few-shot Object Segmentation in Images

Creator:: Nguyen, Khoi D.
Abstract:: This dissertation addresses few-shot object segmentation in images. The goal of segmentation is to label every image pixel with a class of the object occupying that pixel, where the class may represent a semantic object category or instance. In few-shot segmentation, training and test datasets have different classes. Every new...
Resource Type:: Dissertation

Robust and Efficient Classification of Videos in the Wild

Creator:: Mahasseni, Behrooz
Abstract:: Recognizing human actions in videos is a long-standing problem in computer vision with a wide range of applications including video surveillance, content retrieval, and sports analysis. This thesis focuses on addressing efficiency and robustness of video classification in unconstrained real-world settings. The thesis work can be broadly divided into four...
Resource Type:: Dissertation
Full Text:: , Anirban Roy, Michael Lam, Sheng Chen, Peng Lei, Christos Margiolas, Alberto Garcia, Suren Jayasuriya and

Hierarchical graphical models for activity recognition in videos

Creator:: Amer, Mohamed R.
Abstract:: This dissertation addresses the problem of recognizing human activities in videos. Our focus is on activities with stochastic structure, where the activities are characterized by variable space-time arrangements of actions, and conducted by a variable number of actors. These activities occur frequently in sports and surveillance videos. They may appear...
Resource Type:: Dissertation

Semantic image segmentation using domain constraints

Creator:: Roy, Anirban
Abstract:: This dissertation addresses the problem of semantic labeling of image pixels. In the course of our work, we considered different types of semantic labels, including object classes (e.g., car, person), 3D depth values (in the range 0 to 80 meters), and affordance classes (e.g., walkable, sittable). Semantic pixel labeling is...
Resource Type:: Dissertation

Efficient Incremental Panorama Reconstruction from Multiple Videos

Creator:: Feng, Zhongyuan
Abstract:: Constructing a panorama from a set of videos is a long-standing problem in computer vision. A panorama represents an enhanced still-image representation of an entire scene captured in a set of videos, where each video shows only a part of the scene. Importantly, a panorama shows only the scene background,...
Resource Type:: Capstone Project

ScholarsArchive@OSU

Window Axial Vision Transformer for Image Classification

Pixel- and Frame-level Video Labeling using Spatial and Temporal Convolutional Networks

Fine-Grained Object Recognition Under Limited Training Data

Relational Networks for Visual Relationship Detection in Images

Action Segmentation in Videos: Deep Models, Prediction Diffusion, and Deep-Temporal Augmentations

Part-based and Uncertainty-Aware Few-shot Object Segmentation in Images

Robust and Efficient Classification of Videos in the Wild

Hierarchical graphical models for activity recognition in videos

Semantic image segmentation using domain constraints

Efficient Incremental Panorama Reconstruction from Multiple Videos

Limit your search

Academic Affiliation

Advisor

Commencement Year

Committee Member

Creator

Date

Decade

Degree Field

Degree Level

Degree Name

File Format

Language

License

Non-Academic Affiliation

Peer Reviewed

Resource Type

Rights Statement

Subject

Search Constraints

Search Results

Limit your search