Getting Started with Datasets on Red Hat Marketplace
By Sara Hagen, Partner Business Development Leader, Red Hat Marketplace
February 1, 2021
In September 2020, Red Hat and IBM announced the launch of
Red Hat Marketplace,
a one-stop-shop to find, try, buy, deploy, and manage enterprise tools,
applications and services across an organization’s hybrid IT infrastructure,
including on-premises and multicloud environments. Our aim is to provide
resources to help enterprise developers using Red Hat OpenShift who are being
asked to rapidly develop strong, more secure and compliant apps in hybrid cloud
environments. Today, I’m excited to share the latest addition to Red Hat
Marketplace that will help developers use data to solve problems, innovate, and
drive new application development: public datasets.
Take advantage of more than 100 free, downloadable datasets on Red Hat
Marketplace
Together, we’ve curated an initial selection of datasets with input from our
enterprise clients and with a focus on topics at the heart of our end customers’
needs or current situations. These include high-demand datasets related to
COVID-19, the U.S. Census, employment figures, demographics, weather, city data,
and more. Our goal is to enable you to find and use raw data, as well as
Kubeflow Pipelines, and most importantly, to send your chosen dataset to the
application of your choice running on the Red Hat OpenShift cluster of your
choice. Let’s dive into some of the key questions about the datasets available
on Red Hat Marketplace.
What can developers do with the datasets and why is this important?
Developers now have a single location where they can access enterprise
applications and public datasets to build their applications. This provides a
more streamlined experience, resulting in less searching and faster development
cycles.
Datasets available via Red Hat Marketplace are ideal for developers working with
data in an open hybrid cloud environment because developers are free to use any
tool or environment when downloading the data.
An example of data you will find on Red Hat Marketplace today is the COVID-19
Vaccine Distribution Allocations by Jurisdiction for both Pfizer and Moderna
from the CDC. This dataset lists the allocations of doses that will be made
available for states and jurisdictions to order against and is updated daily.
This data can be used by developers who are, for example, building applications
for COVID-19 recovery plans or integrations into existing applications that
include specific plans for their organizations.
Will there be more datasets coming in the months ahead? Will there be any new
categories added?
Our plan is to add more datasets throughout 2021 from existing and new sources
to ensure we have a robust dataset category in Red Hat Marketplace that stays
relevant to the types of apps developers are creating. We are also planning to
offer access to models and application programming interfaces (APIs).
Are there any limitations for how I can use data from the datasets available on
Red Hat Marketplace?
All of the datasets we’ve made available on Red Hat Marketplace have licenses
provided from the source that outline how the data can be used and any relevant
copyright or distribution information. We’ve made all licensing information
available for each dataset, and we’ve provided links to the license information
and additional resources.
How can I request more information about datasets or request additional
datasets be added to Red Hat Marketplace?
We recommend reviewing the documentation on Red Hat Marketplace specifically
created for datasets. While exploring the dataset category you can find a chat
icon at the bottom right to request additional information. There is also a
“Feedback” tab on the right side of the screen which you can use to suggest
additional datasets for Red Hat Marketplace.