CCR cluster downtime is scheduled for Tuesday 27 July 2021. CCR resources (Globus, UB-HPC, submit) will be unavailable. Details: https://ubccr.freshdesk.com/support/discussions/topics/13000027917 close

GHub tool users: please note that these tools are currently being moved to Debian10 containers, and may not run as expected: cmctgm; cmcthistplot; crevasseoib; gisplot2. Thanks for your patience! close

Contribute Data

Ghub can be used as a repository for data products generated during your research. To host your data products on GHub as an open repository to the scientific and education communities, please read on.

Contents

  1. Small Datasets
  2. Large Datasets
  3. Contributing Data
    1. Projects
    2. Resources
    3. Publications

Small datasets

Datasets encompassing files of modest size (under 1 GB) can be stored with GHub in two different ways. To do so, you can create either a GHub Project or a GHub Resource of type Data sets and collections.  

How can I choose between Projects and Resources for my small dataset?

  • Choose a Project for an evolving dataset or collaborative effort that can be associated with to-do lists and other dynamic features. (Note: max individual file size 100 MB)
  • Choose a Resource for a dataset that is firmly established, yet can still feature new releases.

 

Large datasets

Much larger files and datasets can also be stored with GHub. We will use the Resources method outlined below for collecting and documenting the dataset's metadata (abstract, credits, citations, etc.). Please create a ticket to let us know about your dataset's space needs and background information, and begin the Resource metadata creation as described.

Where are my data stored?

For large datasets, the data itself will be stored at UB CCR's data center. We will create a Globus endpoint for your data and provide read and write access to you.


Contributing Data

The GHub platform offers several different ways to publish, document, cite, and upload your data to make it available to others. These options are summarized for you here, along with links to complete documentation.

About Hub Projects

GHub-projects-page

Projects provide a way to associate data files with to-do lists, notes, citations, and documents, and allow other members of the group to contribute to the collaboration without creating a new release of the dataset.

To explore and get started with Projects, select Projects from the GHub Collaborate menu. The system will display the GHub Projects home page as pictured in the screen shot at left. From there, you can explore existing GHub projects and begin one of your own by clicking Add Project.

Any registered GHub user can create, view, and contribute to GHub Projects.

Full documentation, helpful how-to videos, and further information on using Projects is available at the HUBzero platforms's Project help pages.

If you create a GHub Project, please contact us if you want to add it to GHub's dataset listing. This process does not occur automatically.

Where are my data stored?

Files you upload for your Project are stored on and accessible from GHub's dedicated Google Drive space.


About Hub Resources

GHub-resources-page

Resources enable you to associate background information such as citations, and documentation with your dataset and release the whole package in a citable way. The Dataset resource is suitable for either small, self-contained datasets, or large ones.

To explore and get started with Dataset Resources, navigate to: https://vhub.org/groups/ghub/resources. The system will display the GHub Resources home page as pictured in the screen shot at left. From there, you can explore existing GHub resources and begin one of your own by clicking Start a Contribution.

What features are included for Resources?

The Hub supports several different types of resources; Datasets are only one. All have elements in common.

Resources on the Hub can include an Abstract, citations, supporting documentation, and topic tagging. As the creator of a Resource, you can assign a development team of other GHub users, who will be able to work on the supporting material and descriptive metadata that describe the resource.

New versions of the resource can be released as needed. The Hub provides a fully guided release flow that enables you to collect the needed information to document your dataset. Once released, the Resource supports an area for user questions and answers, citations, a usage report, and user-contributed wishlists.

Full documentation and further information on Resources is available at the HUBzero platforms's Resource help pages.

Where are my data stored?

For smaller datasets, Resources are stored directly on the GHub webserver. Large datasets are stored at UB CCR's datacenter, where they are accessible to users via the Globus app and command line interface.

Publications

Publications provide another way to publish your own datasets on GHub. In addition to features described above for Resources, they offer the ability to assign curators who review your contribution prior to publication.

For further information about Publications, please refer to HUBzero user documentation: https://help.hubzero.org/documentation/current/users/publications

More information

Please contact us, or submit a ticket via UB CCR or vhub.org, for further information or to start a dataset contribution on GHub.