Graduate Thesis Or Dissertation
 

Accuracy versus cost in distributed data mining

公开 Deposited

可下载的内容

下载PDF文件
https://ir.library.oregonstate.edu/concern/graduate_thesis_or_dissertations/0c483p87s

Descriptions

Attribute NameValues
Creator
Abstract
  • A basic tradeoff to consider when designing a distributed data-mining framework is the need for a compromise between the cost of communication and computation resources and the accuracy of the mining results. This is essentially a decision of whether it is more efficient to communicate all of the data to a central site for analysis, possibly increasing the accuracy of the results, or is it more efficient to mine the data locally at each of the remote sites and then combine the results, possibly reducing the use of communication and computation resources. This research attempts the design, analysis, and implementation of an efficient distributed and cumulative learning algorithm with performance guarantees that are provable relative to its centralized or batch counterparts for knowledge acquisition from distributed data sources that will address this tradeoff. This thesis also develops a methodical mathematical framework to describe this type of tradeoff, describes the reduction of the problem to a constrained optimization problem, and demonstrates techniques to balance cost and accuracy levels.
License
Resource Type
Date Available
Date Issued
Degree Level
Degree Name
Degree Field
Degree Grantor
Commencement Year
Advisor
Committee Member
Academic Affiliation
Non-Academic Affiliation
Subject
权利声明
Publisher
Peer Reviewed
Language
File Format
File Extent
  • 226665 bytes
Replaces
Additional Information
  • description.provenance : Approved for entry into archive by Linda Kathman(linda.kathman@oregonstate.edu) on 2007-07-31T16:07:01Z (GMT) No. of bitstreams: 1 Accuracy versus cost.pdf: 226665 bytes, checksum: 3d850f89fcf491350ca7d0754bd643dd (MD5)
  • description.provenance : Rejected by Julie Kurtz(julie.kurtz@oregonstate.edu), reason: Rejecting to change the page numbering, starting the thesis on page one. Then open item that was rejected, replace with revised file and resubmit. Thanks, Julie on 2007-07-30T18:03:41Z (GMT)
  • description.provenance : Made available in DSpace on 2007-07-31T16:07:03Z (GMT). No. of bitstreams: 1 Accuracy versus cost.pdf: 226665 bytes, checksum: 3d850f89fcf491350ca7d0754bd643dd (MD5)
  • description.provenance : Submitted by Stephanie Deutschman (deutschs@onid.orst.edu) on 2007-07-30T17:49:12Z No. of bitstreams: 1 Accuracy versus cost.pdf: 520785 bytes, checksum: a4207d07b9ec6473721cc6417df5dadb (MD5)
  • description.provenance : Submitted by Stephanie Deutschman (deutschs@onid.orst.edu) on 2007-07-30T18:24:41Z No. of bitstreams: 1 Accuracy versus cost.pdf: 226665 bytes, checksum: 3d850f89fcf491350ca7d0754bd643dd (MD5)
  • description.provenance : Approved for entry into archive by Julie Kurtz(julie.kurtz@oregonstate.edu) on 2007-07-31T15:21:48Z (GMT) No. of bitstreams: 1 Accuracy versus cost.pdf: 226665 bytes, checksum: 3d850f89fcf491350ca7d0754bd643dd (MD5)

关联

Parents:

This work has no parents.

属于 Collection:

单件