The abilities of plant biologists to characterize the genetic basis of physiological traits are limited by their abilities to obtain quantitative data representing precise details of trait variation and mainly to collect this data on a high-throughput scale at low cost. Deep learning-based methods have demonstrated unprecedented potential to automate...
We model the popular board game of Clue as an MDP and evaluate Monte-Carlo policy rollout in a simulated environment pitting different agents and policies against each other. We describe the choices we made in the representation, along with some of the problems we encountered along the way. We find...
Bioacoustics analysis can be used to conduct environmental monitoring by detecting the presence of birds species. This analysis usually involves identifying the species from their calls. In most frameworks, bird song syllables are extracted from audio recordings and individual syllables are input to a classifier to identify the species. Extraction...
We investigate a search and coverage planning problem, where an area of interest has to be explored by a number of vehicles, given a fixed time budget. A good coverage plan has a low probability of a target remaining unobserved. We introduce a formal problem statement, suggest a greedy algorithm...
The scientific method applies hypothesis testing to material samples through experimentation, measurement, and data analysis, which produce representations—or features—that describe phenomena of interest. Biomolecular features come in numerous forms such as values, matrices, graphs, three-dimensional structures, trajectories, and molecular surfaces. Researchers have tested thousands of features related to protein molecules...
Modular construction is increasingly seen as an efficient construction method in terms of time, cost, and energy. The full realization of these advantages partly relies on the efficiency of the production process inside the modular factories, which currently rely on tedious manual monitoring methods or expensive automated techniques. As a...
Labeling videos is costly, time-consuming and tedious. These costs can escalate in applications such as medical diagnosis or autonomous driving where we need domain expertise for annotation. Few-shot action recognition aims to solve this problem by annotation-efficient learning mechanisms.
This thesis presents MetaUVFS as the first Unsupervised Meta-learning algorithm for...
Heatmap regression has became one of the mainstream approaches to localize facial landmarks. As Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) are becoming popular in solving computer vision tasks, extensive research has been done on these architectures. However, the loss function for heatmap regression is rarely studied. In...
Semantic image segmentation is a relatively difficult task in computer vision. With the advent of deep learning, semantic image segmentation is increasingly of interest for researchers because of the excellent predictions from Convolutional Neural Network (CNN). However, CNNs have proven to struggle with obtaining global context of image due to...
Sports analytics is rapidly evolving today through the use of computer vision systems that automatically extract huge amount of information inherently present in multimedia data without much human assistance. This information can facilitate a better understanding of patterns and strategies in various sports. However, for non-professional teams, due to expense...
Image classification is a difficult problem, often requiring large training sets to get satisfactory results. However this is a task that humans perform very well, and incorporating user feedback into these learning algorithms could help reduce the dependency on large amounts of labeled training data. This process has already been...
Data can be represented in multiple views. Traditional multi-view learning methods (i.e., co-training, multi-task learning) focus on improving learning performance using information from the auxiliary view, although information from the target view is sufficient for learning task. However, this work addresses a semi-supervised case of multi-view learning, the surrogate supervision...
Automatic painterly rendering systems have been proposed but they opted for selecting a single style to generate paintings from images, which lacks the ability of creatively using multiple styles to focus important objects and deemphasize unimportant part of the scenes. We provide a multi-style painting framework to
address this issue...
Learning to recognize objects is a fundamental and essential step in human perception and understanding of the world. Accordingly, research of object discovery across diverse modalities plays a pivotal role in the context of computer vision. This field not only contributes significantly to enhancing our understanding of visual information but...
The advancement of artificial intelligence (AI) has led to transformative developments across multiple sectors, fostering innovation and redefining our interactions with technology. As AI matures and becomes integrated into society, it offers numerous opportunities to address global challenges and revolutionize a wide array of human endeavors. These advances are driven...
As one of the most popular data types, the point cloud is widely used in various appli- cations, including computer vision, computer graphics and robotics. The capability to directly measure 3D point clouds is invaluable in those applications as depth information could remove a lot of the segmentation ambiguities in...
Hand detection is a fundamental step for many hand-related computer vision tasks, such as gesture recognition, hand pose estimation, hand sign language translation, and so on. However, robustly detecting hands is a challenging task because of drastic changes in appearance based on finger articulation and changes in lighting conditions, camera...
In pursuit of global sustainability, forestry has witnessed significant shifts in practices and the development of new technologies and ideas. Primary and secondary processing industries have made substantial efforts to increase wood utilization rates, improve occupational safety and the working environment for humans, and have exhibited interest in procuring raw...
In supervised learning, label information can be provided at different levels of granularity. For small datasets, it is possible to acquire a label for each data instance. However, in the big-data regime, this fine granularity approach is prohibitively costly. For example, in semi-supervised learning, only a limited number of samples...
Wood composites are an important renewable structural material which can be a net carbon sink when used in combination with sustainable forest management practices and high rates of log utilization. Adhesive bondlines are an essential part of composites, and for wood composites, they determine the moisture durability and mechanical performance...
In the field of machine learning, clustering and classification are two fundamental tasks. Traditionally, clustering is an unsupervised method, where no supervision about the data is available for learning; classification is a supervised task, where fully-labeled data are collected for training a classifier. In some scenarios, however, we may not...
This thesis focuses on the problem of object tracking. Given a video, the general objective of tracking is to track the location over time of one or more targets in the image sequence. This is a very challenging task as algorithms need to deal with problems such as appearance variations,...
The widespread use of wireless devices that we have recently been witnessing, such as smartphones, tablets, laptops, and wirelessly accessible devices in general, is causing an unprecedented growth in the required amount of the wireless radio spectrum. On the other hand, the spectrum resource has, for the last several decades,...
This research explores several novel approaches to improve visualization and segmentation
of point clouds acquired with 3D laser scanning. 3D laser scanning is used
in a wide variety of applications including surveying and mapping, transportation asset
management, facilities management, building information modeling, crime scene investigations,
cultural heritage and geologic instigations....
Protein-protein interactions underlie all biological processes and are a field of study that has wide implications throughout many other fields including medicine, genetics, biology, and ecology. Proteins are the building blocks and primary actors of life. They work together to accomplish virtually every task within a cell, including, metabolism, signal...
We consider two semiparametric regression models for data analysis, the stochastic additive model (SAM) for nonlinear time series data and the additive coefficient model (ACM) for randomly sampled data with nonparametric structure. We employ the SCAD-penalized polynomial spline estimation method for estimation and simultaneous variable selection in both models. It...
In this work, I examine the problem of understanding American football in video. In particular, I present several mid-level computer vision algorithms that each accomplish a different sub-task within a larger system for annotating, interpreting, and analyzing collections of American football video. The analysis of football video is useful in...
The quality of a digital image pipeline relies greatly on its color reproduction which should at a minimum handle the color constancy, and the final judgment of the excellence of the pipeline is made through subjective observations by humans.
This dissertation addresses a few topics surrounding the color processing of...