Skip to Main Content

Storing and working with data in the cloud with Redivis: Start

This guide describes the data hosting and sharing platform, Redivis, and CMU Libraries' Redivis trial.

About CMU's Redivis Trial

CMU Libraries is committed to providing researchers with access to Redivis from January 2023 to August 2025. To gain access to the platform, please contact Kristen Scotti (kscotti@andrew.cmu.edu) or Melanie Gainey (mgainey@andrew.cmu.edu).

Until August 2025, research groups can put up to 1TB/dataset on the platform for free. Datasets exceeding 1TB will be allowed on a case-by-case basis. Early adopters can take advantage of a high level of customized one-on-one support from Redivis. 

We will gauge campus interest in the platform and further commitment to it over the next couple of years but your data will not vanish in August 2025. You will have options for migrating data or purchasing an individual subscription at that time if we can no longer support that platform.

We welcome and appreciate any feedback you might have, as it helps us make informed decisions about allocating Libraries' budgets to significant resources like this one. This is a platform we're excited about and we encourage you to use and share it with your colleagues.

What is Redivis?

Redivis is a data hosting and sharing platform. It can be used to host datasets for a variety of scenarios including:

 

  • restricted data sets (i.e. HIPAA and GDPR compliant)
  • data sets that you want to share with collaborators before publication
  • large data sets (greater than 1TB)
  • manipulating data and querying billions of records in seconds with both a GUI and programmatic interface
  • Python, R, or Stata analysis in the cloud with a Jupyter Notebook integration 
  • storing all of a research group's datasets in one place for easy access

 

3 min Demo of Redivis

 

Redivis Town Hall Demo

 

1 hr Demo of Redivis

 

Upcoming Workshop

Data Exploration with Python and SQL in Redivis

Part 1: Wednesday, March 13, 2024, 11:00AM - 12:30PM, virtual, Register here.

Part 2: Wednesday, March 20, 2024, 11:00AM - 12:30PM, virtual, Register here.

Redivis, a powerful platform for collaborative data science, facilitates seamless data sharing and collaboration. In this two-part, hands-on session, you'll learn how to leverage Redivis to access, manipulate, and analyze datasets with the flexibility of Python and the power of SQL. 

This is a two-part workshop; please try to attend both sessions. 

Part 1

  • Explore the unique features of Redivis and understand how it streamlines the data collaboration process. Learn how to create and navigate datasets and projects.
  • Learn the fundamentals of SQL, including how to craft SQL queries to filter, aggregate, and analyze data efficiently.

Part 2

  • Learn how to utilize Python within Redivis to visualize data and streamline your workflow. 
  • Learn how to collaborate within Redivis, sharing your analyses and insights with team members. Explore the platform's collaborative features that enhance teamwork in data-driven projects.

How is this different from KiltHub?

KiltHub is our institutional repository, built on the Figshare platform. KiltHub is used to archive finished research products for public access, including datasets up to 1TB. 

Unlike KiltHub, Redivis works particularly well for datasets that need to be queried or are still in the collection and analysis phases of research, have specific access permissions, or are larger than 1TB. 

Open Science Postdoctoral Associate

Profile Photo
Kristen Scotti

Librarian for Biological Sciences, Computational Biology, Biomedical Engineering, and Neuroscience

Profile Photo
Melanie Gainey
Contact:
Mellon Library
4th Floor Mellon Institute
412-268-3171