Accuracy versus cost in distributed data mining Public Deposited

http://ir.library.oregonstate.edu/concern/graduate_thesis_or_dissertations/0c483p87s

Descriptions

Attribute NameValues
Creator
Abstract or Summary
  • A basic tradeoff to consider when designing a distributed data-mining framework is the need for a compromise between the cost of communication and computation resources and the accuracy of the mining results. This is essentially a decision of whether it is more efficient to communicate all of the data to a central site for analysis, possibly increasing the accuracy of the results, or is it more efficient to mine the data locally at each of the remote sites and then combine the results, possibly reducing the use of communication and computation resources. This research attempts the design, analysis, and implementation of an efficient distributed and cumulative learning algorithm with performance guarantees that are provable relative to its centralized or batch counterparts for knowledge acquisition from distributed data sources that will address this tradeoff. This thesis also develops a methodical mathematical framework to describe this type of tradeoff, describes the reduction of the problem to a constrained optimization problem, and demonstrates techniques to balance cost and accuracy levels.
Resource Type
Date Available
Date Copyright
Date Issued
Degree Level
Degree Name
Degree Field
Degree Grantor
Commencement Year
Advisor
Committee Member
Academic Affiliation
Non-Academic Affiliation
Keyword
Subject
Rights Statement
Publisher
Language
File Format
File Extent
  • 226665 bytes
Replaces
Additional Information
  • description.provenance : Approved for entry into archive by Linda Kathman(linda.kathman@oregonstate.edu) on 2007-07-31T16:07:01Z (GMT) No. of bitstreams: 1 Accuracy versus cost.pdf: 226665 bytes, checksum: 3d850f89fcf491350ca7d0754bd643dd (MD5)
  • description.provenance : Rejected by Julie Kurtz(julie.kurtz@oregonstate.edu), reason: Rejecting to change the page numbering, starting the thesis on page one. Then open item that was rejected, replace with revised file and resubmit. Thanks, Julie on 2007-07-30T18:03:41Z (GMT)
  • description.provenance : Made available in DSpace on 2007-07-31T16:07:03Z (GMT). No. of bitstreams: 1 Accuracy versus cost.pdf: 226665 bytes, checksum: 3d850f89fcf491350ca7d0754bd643dd (MD5)
  • description.provenance : Submitted by Stephanie Deutschman (deutschs@onid.orst.edu) on 2007-07-30T17:49:12Z No. of bitstreams: 1 Accuracy versus cost.pdf: 520785 bytes, checksum: a4207d07b9ec6473721cc6417df5dadb (MD5)
  • description.provenance : Submitted by Stephanie Deutschman (deutschs@onid.orst.edu) on 2007-07-30T18:24:41Z No. of bitstreams: 1 Accuracy versus cost.pdf: 226665 bytes, checksum: 3d850f89fcf491350ca7d0754bd643dd (MD5)
  • description.provenance : Approved for entry into archive by Julie Kurtz(julie.kurtz@oregonstate.edu) on 2007-07-31T15:21:48Z (GMT) No. of bitstreams: 1 Accuracy versus cost.pdf: 226665 bytes, checksum: 3d850f89fcf491350ca7d0754bd643dd (MD5)

Relationships

Parents:

This work has no parents.

Last modified

Downloadable Content

Download PDF

Items