Algorithms for massive biological datasets Public Deposited

http://ir.library.oregonstate.edu/concern/graduate_thesis_or_dissertations/70795c23n

Descriptions

Attribute NameValues
Creator
Abstract or Summary
  • Within the past several years the technology of high-throughput sequencing has transformed the study of biology by offering unprecedented access to life's fundamental building block, DNA. With this transformation's potential a host of brand-new challenges have emerged, many of which lend themselves to being solved through computational methods. From de novo and reference-guided genome assembly to gene prediction and identification, from genome annotation to gene expression, a multitude of biological questions are being asked and answered using high-throughput sequencing and computational methods. In this thesis we examine topics relating to high-throughput sequencing. Beginning with de novo assembly we outline current state-of-the-art methods for stitching short reads, the output of high-throughput sequencing experiments, into cohesive genomic contigs and scaffolds. Next we present our own de novo assembly software, QSRA, created in an effort to form longer contigs even through areas of low coverage and high error. We then present an application of short-read assembly and mutation analysis in a discussion of single nucleotide polymorphism discovery in hazelnut, followed by a review of de novo gene finding, the act of identifying genes in anonymous stretches of genomic sequence. Next we outline our supersplat software, built to align short reads generated by RNA-seq experiments, which span splice junctions, followed by the presentation of our gumby software, build to construct putative gene models from purely empirical short-read data. Finally we outline current state-of-the-art methods for discovering and quantifying alternative splicing variants from RNA-seq short-read data. High-throughput sequencing has fundamentally changed the way in which we approach biological questions. While an exceptionally powerful tool, high-throughput sequencing analysis demands equally powerful algorithmic techniques. We examine these issues through the lens of computational biology.
Resource Type
Date Available
Date Copyright
Date Issued
Degree Level
Degree Name
Degree Field
Degree Grantor
Commencement Year
Advisor
Committee Member
Academic Affiliation
Non-Academic Affiliation
Keyword
Subject
Rights Statement
Peer Reviewed
Language
Replaces
Additional Information
  • description.provenance : Made available in DSpace on 2011-11-11T00:14:06Z (GMT). No. of bitstreams: 3 BryantJrDouglasW2012.pdf: 5808123 bytes, checksum: af12b691fbb2b395bb379f4b9c64fb91 (MD5) license_rdf: 22797 bytes, checksum: c87af7b95ad7d9cc7c32b1191f533229 (MD5) license_text: 23084 bytes, checksum: 24f8f2abcc0a604d988d82a6620de1dd (MD5) Previous issue date: 2011-10-04
  • description.provenance : Submitted by Douglas Bryant Jr (bryantdo@onid.orst.edu) on 2011-10-31T16:32:07Z No. of bitstreams: 3 BryantJrDouglasW2012.pdf: 5808123 bytes, checksum: af12b691fbb2b395bb379f4b9c64fb91 (MD5) license_rdf: 22797 bytes, checksum: c87af7b95ad7d9cc7c32b1191f533229 (MD5) license_text: 23084 bytes, checksum: 24f8f2abcc0a604d988d82a6620de1dd (MD5)
  • description.provenance : Submitted by Douglas Bryant Jr (bryantdo@onid.orst.edu) on 2011-10-10T15:58:45Z No. of bitstreams: 3 BryantJrDouglasW2012.pdf: 5807570 bytes, checksum: 9db843beabb0c33e35ca2541bd509b88 (MD5) license_rdf: 22797 bytes, checksum: c87af7b95ad7d9cc7c32b1191f533229 (MD5) license_text: 23084 bytes, checksum: 24f8f2abcc0a604d988d82a6620de1dd (MD5)
  • description.provenance : Rejected by Julie Kurtz(julie.kurtz@oregonstate.edu), reason: Rejecting to resubmit after the final exam and once thesis is approved. Thanks, Julie on 2011-10-10T18:09:58Z (GMT)
  • description.provenance : Approved for entry into archive by Laura Wilson(laura.wilson@oregonstate.edu) on 2011-11-11T00:14:06Z (GMT) No. of bitstreams: 3 BryantJrDouglasW2012.pdf: 5808123 bytes, checksum: af12b691fbb2b395bb379f4b9c64fb91 (MD5) license_rdf: 22797 bytes, checksum: c87af7b95ad7d9cc7c32b1191f533229 (MD5) license_text: 23084 bytes, checksum: 24f8f2abcc0a604d988d82a6620de1dd (MD5)
  • description.provenance : Approved for entry into archive by Julie Kurtz(julie.kurtz@oregonstate.edu) on 2011-11-10T16:19:19Z (GMT) No. of bitstreams: 3 BryantJrDouglasW2012.pdf: 5808123 bytes, checksum: af12b691fbb2b395bb379f4b9c64fb91 (MD5) license_rdf: 22797 bytes, checksum: c87af7b95ad7d9cc7c32b1191f533229 (MD5) license_text: 23084 bytes, checksum: 24f8f2abcc0a604d988d82a6620de1dd (MD5)

Relationships

In Administrative Set:
Last modified: 08/19/2017

Downloadable Content

Download PDF
Citations:

EndNote | Zotero | Mendeley

Items