Real-world datasets are dirty and contain many errors. Examples of these issues are violations of integrity constraints, duplicates, and inconsistencies in representing data values and entities. Applying machine learning on dirty databases may lead to inaccurate results. Users have to spend a lot of time and effort repairing data errors...
The advancement of artificial intelligence (AI) has led to transformative developments across multiple sectors, fostering innovation and redefining our interactions with technology. As AI matures and becomes integrated into society, it offers numerous opportunities to address global challenges and revolutionize a wide array of human endeavors. These advances are driven...
Analysis of observations on sequential events over time is common in real life. Sequential measurements over time describing the behavior of systems are usually called time series data, which have been collected in a wide range of disciplines. Over the years there have been multiple research areas in studying stochastic...
The Gene Ontology (GO) Consortium (http://www.geneontology.org) (GOC) continues to develop,
maintain and use a set of structured, controlled
vocabularies for the annotation of genes, gene
products and sequences. The GO ontologies
are expanding both in content and in structure.
Several new relationship types have been introduced
and used, along with...
In this dissertation, the primary objective is to discover more sustainable electrode materials and study new reaction mechanisms using aqueous electrolytes. The first study conducted reveals a reversible conversion reaction from copper to Cu2CO3(OH)2. The reaction mechanism uses OH- and CO32- as charge carriers at the cathode. The results open...
An important impact of the genome technology revolution will be the elucidation of mechanisms of cancer pathogenesis, leading to improvements in the diagnosis of cancer and the selection of cancer treatment. Integrated with current well-studied massive knowledge and findings about the role of protein-coding mutations in cancer, demystifying the functional...
The objectives of this project were to investigate the critical factors impacting the physicochemical and antibacterial properties of β-chitosan based films derived from jumbo squid (Dosidicus gigas) pens, and to evaluate the feasibility of improving water solubility of β-chitosan through Maillard reaction. The studies examined the effect of molecular weight...
In the past two decades, the advancement in data collection and storage have led to the accumulation of complex datasets. Consequently, various industries have sought data-driven solutions to predict and detect anomalies. Temporal patterns have emerged as potential features in prediction models that could improve the performance of the identification...
Small unmanned aircraft systems (UAS) carrying consumer-grade nonmetric cameras are increasingly utilized to generate high-resolution 3D geospatial data. Low cost, ease of operation, widespread availability and low altitude maneuvering capabilities of UAS, as well as the rapid development of technology and methods, make UAS-based photogrammetry applicable to many civil engineering...