mirage   mirage   mirage

code4lib Conference 2006, February 16, 2006 : morning session

DSpace/Manakin Repository

ScholarsArchive@OSU will be migrating to a new platform in the coming weeks - likely by November 1, 2017. We do not expect major service disruptions during this process, but if you encounter problems or have questions, please contact us at scholarsarchive@oregonstate.edu. Thank you for your patience.

Show simple item record

dc.creator Whitney, Colleen
dc.creator Sanderson, Robert
dc.creator Chute, Robert
dc.date.accessioned 2006-08-23T16:23:20Z
dc.date.available 2006-08-23T16:23:20Z
dc.date.issued 2006-02-16
dc.identifier.uri http://hdl.handle.net/1957/2946
dc.description Presentation given on the second day of the code4lib Conference held Feb. 15-17, 2006, at LaSells Stewart Center, Oregon State University, Corvallis, Oregon. en
dc.description.abstract Generating recommendations in OPACS: initial results and open areas for exploration : In the context of a research and prototyping project, the California Digital Library is using catalog content indexed in XTF, along with over 9 million historical circulation transaction records and other external data, to generate recommendations for an academic audience. Early results are promising. This talk will focus on methods, challenges, and plans for further development. -- Library Text Mining : Using the TeraGrid1 and the SRB DataGrid2, we have sufficient computational and storage facilities to run normally prohibitively expensive processing tasks. By integrating text and data mining tools3[4] within the Cheshire35 information architecture, we can parse the natural language present in 20 million MARC records (the University of California's MELVYL collection) and extract information to provide to search/retrieve applications. In this talk, we'll discuss the results of applying new techniques to "old" data. -- Anatomy of aDORe : The aDORe Archive is a write-once/read-many storage approach for Digital Objects and their constituent datastreams. First, XML-based representations of multiple Digital Objects are concatenated into a single, valid XML file named an XMLtape. Second, ARC files, as introduced by the Internet Archive, are used to contain the constituent datastreams of the Digital Objects. The software was developed by the LANL Digital Library Research & Prototyping Team and is available under GNU LGPL license. en
dc.format.extent 76746752 bytes
dc.format.mimetype audio/x-mpeg
dc.language.iso en_US en
dc.relation.ispartofseries code4lib Conference 2006 en
dc.subject aDORe Archive en
dc.subject Digital storage en
dc.subject Online catalogs en
dc.subject Recommendation systems en
dc.subject Text mining en
dc.subject Information retrieval systems en
dc.subject Data mining en
dc.title code4lib Conference 2006, February 16, 2006 : morning session en
dc.title.alternative Generating recommendatons in OPACS : initial results and open areas for exploration en
dc.title.alternative Library text mining en
dc.title.alternative Anatomy of aDORe en
dc.type Recording, oral en

This item appears in the following Collection(s)

Show simple item record

Search ScholarsArchive@OSU

Advanced Search


My Account