Did you know that CMU has its own institutional repository for scholarly works? KiltHub is our comprehensive institutional repository and research collaboration platform for research data and scholarly outputs produced by members of Carnegie Mellon University and their collaborators. KiltHub collects, preserves, and provides stable, long-term global open access to a wide range of research data and scholarly outputs created by faculty, staff, and student members of Carnegie Mellon University in the course of their research and teaching. Interested in depositing a dataset? Find out more information on our KiltHub Libguide!
Sample Open Datasets in KiltHub:
The CMU Libraries Data Collaboration Lab (DataCoLAB) connects the research community across disciplinary borders, and facilitates collaborations between data producers and data scientists. The program connects researchers who want more from their datasets with individuals who have data and computer science skills, creating opportunities for people with different technical and disciplinary backgrounds to work together.
Want to learn more or ask questions? Email dataCoLAB@andrew.cmu.edu.
Before sharing your data, it is important to ask yourself several questions to ensure you are ethically putting your data out there!
There are dozens of questions to consider when ensuring you are ethically sharing data. If you want help navigating these questions, please feel free to contact us at email@example.com!
Many of the places where you find data can also serve as data sharing platforms! Check with your liaison librarian (https://www.library.cmu.edu/about/people) for repositories specific to your discipline, or check out the resources below for general data repositories:
There is a wide range of licenses that can be used for data. This includes the many Creative Commons Licenses, which range from very open (CCO) to very restrictive (CCBY-NC-ND). Additionally, there are several licenses focused towards software, programs, scripts, and code. These include the MIT License, GNU General Public License GPLv3, and the Apache License 2.0.
When choosing a license, you should consider the type of data that will be licensed, how you want the data to be re-used, what (if any) commercial use you wish to grant, and if you want to receive attribution for your data. Another consideration is whether you want to place your data in the public domain (CCO), which means no restrictions to the user or requirement for attribution. If you'd like more help in navigating which license to use for your data, please reach out to our Scholarly Communication and Research Curation Consultant David Scherer at firstname.lastname@example.org.