This documentation file was generated on 2021-02-15 by Clara Llebot and Diana Castillo ------------------- # GENERAL INFORMATION ------------------- 1. Title of Dataset: Research Data Policies at Doctoral Universities with very high research activity (R1 institutions) 2. Creator Information Name: Clara Llebot Lorente Institution: Oregon State University College, School or Department: OSU Libraries and Press Address: 121 The Valley Library, Corvallis, OR 97331 Email: Clara.Llebot@oregonstate.edu ORCID: https://orcid.org/0000-0003-3211-7396 Role: PI Name: Diana J. Castillo Institution: Oregon State University College, School or Department: OSU Libraries and Press Address: 121 The Valley Library, Corvallis, OR 97331 Email: diana.castillo@oregonstate.edu ORCID: Role: researcher 3. Contributor information Name: Gloria Park Institution: Oregon State University Email: parkg@oregonstate.edu Role: Student intern Name: Jen Martin Institution: Oregon State University College, School or Department: Email: martjen4@oregonstate.edu Role: Student intern 3. Contact Information Name: Clara Llebot Lorente Institution: Oregon State University College, School or Department: OSU Libraries and Press Address: 121 The Valley Library, Corvallis, OR 97331 Email: clara.llebot@oregonstate.edu ORCID: https://orcid.org/0000-0003-3211-7396 ------------------- CONTEXTUAL INFORMATION ------------------- 1. Abstract for the dataset This dataset compiles a list of universities classified as Doctoral Universities with very high research activity (R1) by the Carnegie Commission for Higher Education as well as their unique identifier. The dataset also lists whether or not an institutional research data policy exists, the internal classification number used by the researchers, the title of the policy, a link to the policy, comments from the researchers, and who located the policy. The policies included in this dataset are policies that refer to the management and retention of research data, and (a) refer exclusively to research data (b) the policy affects all types of research data generated in an institution (c) the policy is an official, university wide policy. 2. Context of the research project that this dataset was collected for. This project was undertaken to see if there were common items or best practices included in research data policies when Oregon State University was writing its own. 3. Date of data collection: The Carnegie list of R1 universities was downloaded in early 2020. The data about policies was compiled between 2020-07-01 and 2020-08-30 5. Funding sources that supported the collection of the data: OSU Libraries paid for the salary of the two students that worked on data collection. -------------------------- SHARING/ACCESS INFORMATION -------------------------- 1. Licenses/restrictions placed on the data: This work is licensed under a Creative Commons Attribution 4.0 International License https://creativecommons.org/licenses/by/4.0/ 2. Links to publications related to the dataset: There are no publications related to the dataset, yet. 3. Links to other publicly accessible locations of the data: There are no other publicly accessible locations for this dataset. 4. Recommended citation for the data: Llebot, C. & Castillo, D. J. (2021) policy_data_LlebotCastillo2021 [Data set]. Oregon State University. https://doi.org/10.7267/4j03d6263 5. Dataset Digital Object Identifier (DOI) https://doi.org/10.7267/4j03d6263 -------------------------- VERSIONING AND PROVENANCE -------------------------- 1. Last modification date 2021-02-11 2. Links/relationships to other versions of this dataset: There are no other versions of this dataset. 3. Was data derived from another source? No -------------------------- METHODOLOGICAL INFORMATION -------------------------- 1. Description of methods used for collection/generation of data: This dataset identifies data policies that comply with the following criteria: Refer exclusively to research data. This excludes policies that talk about other types of data such as university business data, even if these policies also affect research data. The policy affects all research data and all management actions on research data in an institution, as opposed to only one type of research data (e.g. only research notebooks) or only one management activity (e.g. only data security). Are university-wide. Exclude policies from departments, colleges, etc. Stand alone research data policies. Excludes other policies that may also talk about research data, like Intellectual Property policies. Official university policies, created by University Administration. Excludes faculty senate policies, faculty workbooks, etc. Phase 1 of data collection: Identification of universities. This dataset focuses on policies of universities that are classified as Doctoral Universities – Very high research activity by the Carnegie classification of institutions of Higher education. According to Carnegie, this classification includes “only institutions that awarded at least 20 research/scholarship doctoral degrees and had at least $5 million in total research expenditures (as reported through the National Science Foundation (NSF) Higher Education Research & Development Survey).” These universities are referred to as R1 universities by Carnegie, and we also do so in this document. A list of R1 universities and their unitid were downloaded from the Carnegie Commission for Higher Education website. Phase 2 of data collection: Identification of contact information from Universities. To ensure that the policies collected are the current and correct versions, and to make sure that we find policies that may have restricted access, we identified one person at each university that we thought could potentially know the status of data policies at their university. These contacts were: Librarians with data management responsibilities. They had titles like data management specialist, data management librarian, data librarian, data management consultant, data services librarian, etc. When there was no appropriate person from the library, we looked for people from the office in charge of writing policies at each university. They were called University Policy and Standards Program, Policy gatekeeper, etc. If no other contacts were found, we identified a person at the research office of the university. Phase 3 of data collection: we sent the following e-mail to each of the contacts identified in phase 2. Dear [First_Name] [Last_Name], We, Clara Llebot Lorente and Diana Castillo, are a team of librarians at Oregon State University who are working on a research project analyzing the content of institutional research data policies of R1 universities. We are writing to you because we think that as the [Job Title] of [University name] you may be able to help us identify the institutional research data policy at your institution. We’re looking for institutional research data policies that address the management of research data at the institution, and that fulfill the following requirements: The policy refers exclusively to research data. This excludes policies that talk about other types of data such as university business data, student data, etc. even if these policies also affect research data. The policy is university-wide. Excludes policies from departments, colleges, etc. The policy is a stand alone research data policy. Excludes other policies that may also include research data, but that have a different focus, like Intellectual Property policies. It is an official university policy, created by university administration. Excludes faculty senate policies, faculty workbooks, etc. We would appreciate it if you could reply to this e-mail with answers to these 3 questions: Does your institution have a research data policy that fits the description above? If your university has this policy, could you please send us a link to the policy or instructions on how to access it? If your institution does not have a data policy that fits our definition, are you aware of any plans that your university has to develop one? Your name will not be used during the research process, we are only contacting you to make sure that we identify the current research data policy for your institution. If you are not the right person to contact, we will appreciate a referral to somebody that may know. Thanks so much for your help! Phase 4: a student completed the spreadsheet shared here with the responses received by e-mail. This data was coded as “source: Contact”. Phase 5: a student conducted a web search to locate policies of the universities for which we had not received answers (or for which the contacts were not able to provide answers, for several reasons). This data was coded as “source: Intern” Phase 6: we compared our dataset with the policies identified in Briney et al (2015) as a quality control procedure. All differences between the two datasets were attributed to (a) change of R1 qualification of universities between 2015 and 2020 (b) new policies being published between 2015 and 2020 or (c) the policies identified by Briney et al (2015) did not fit the “data policy” definition used in this study. Briney, K., Goben, A., & Zilinski, L.. (2015). Do You Have an Institutional Data Policy? A Review of the Current Landscape of Library Data Services and Institutional Data Policies. Journal of Librarianship and Scholarly Communication, 3(2), eP1232. DOI: http://doi.org/10.7710/2162-3309.1232 --------------------- DATA & FILE OVERVIEW --------------------- Files This dataset has one file only. Filename: policy_data_LlebotCastillo2021 Short description: csv file with information on research data policies for R1 universities. 2. Formats .csv ----------------------------------------- TABULAR DATA-SPECIFIC INFORMATION FOR: policy_data_LlebotCastillo2021 ----------------------------------------- 1. Number of variables: 10 2. Number of cases/rows: 134 3. Missing data codes: Columns unitid, Institution, PolicyExists and Source should have no missing data. The rest of variables have blanks as blank values (e.g. if there is no policy, there is a blank in PolicyURL and Policy name). 4. Variable List A. Variable name: Unitid Description: Unique IPEDS identification number for an institution, as used by The Carnegie Classification of Institutions of Higher Education https://carnegieclassifications.iu.edu/index.php Value: six digit integer B. Variable name: Institution Description: Institution name, as used by The Carnegie Classification of Institutions of Higher Education https://carnegieclassifications.iu.edu/index.php Value: text C. Variable name: InternalPolicyID Description: Identifier used internally for the work of Diana Castillo and Clara Llebot. Only universities with policies that fit the criteria of this study have values. There are two missing numbers (11 and 26), because of policies that we decided to remove from the pool after initially generating the identifiers. Value: two digit integer D. Variable name: PolicyExists Description: Whether there is a policy that fits our description that exists at the university. Value: logic. Yes / No E. Variable name: PolicyName Description: Name of the policy, as it appears in the text. Value: text F. Variable name: PolicyURL Description: URL to access the policy, complete starting with https:// Value: text. If policy is not accessible via URL leave blank. G. Variable name: AlternativeAccess Description: If the policy is not accessible through a URL, explanation about how it was accessed. Value: text. H. Variable name: Source Description: source of the information that we used to fill this table. Value: Contact/Intern. Contact: we identified a person from the university that could potentially be knowledgeable about data policies at the university and their answer allowed us to fill this table. See methods earlier for more information. Intern: when contacts did not or could not respond with a definitive answer a student worker did an online search to fill this table. I. Variable name: Comments Description: any other information about policies that may be useful. Value: text