As use cases for big data continue to grow, enterprise customers increasingly rely on big data driven applications and analytics to power their business and are rapidly embracing a modern data architecture, that augments their existing data stores with Hadoop, in order to meet their data storage and processing challenges. They’re also rapidly integrating their data stores with Hadoop, to create new applications. These same organizations are also seeking to accelerate web and mobile application delivery by leveraging Platform as a Service (PaaS) solutions and embracing DevOps and Continuous Delivery models.
In February, Red Hat and Hortonworks announced a comprehensive open source solution to deliver infrastructure solutions that enable the next generation of data-driven applications. We are now excited to add OpenShift to this product offering by integrating OpenShift PaaS with Hortonworks Data Platform. Hortonworks and Red Hat, are combining 100-percent open source Apache Hadoop with OpenShift PaaS to speed application development
The integration of Hadoop with OpenShift PaaS will enable customers to create analytic applications that run where the big data resides, in Hadoop, via the Hortonworks Data Platform. Customers can start to expand development to the Internet of Things, where applications can tap into and make sense of sensor and machine-generated data, server logs, social, clickstream and geo-location data, and much more.
Since Hadoop clusters can be very large, with significant data gravity, applications need to be able to be deployed into a PaaS in a way that runs “In-Hadoop” (i.e. in a YARN-enabled manner). This enables those applications to leverage data access for SQL, NoSQL, Stream and other data sources. The benefits of this approach include data locality, efficient data access, and the ability to manage resource requirements along with other workloads in the cluster so no single application dominates the cluster.
For more details on how OpenShift and Hortonworks Data Platform work together to deliver on this vision, see the following example. This shows how to deploy an application in OpenShift which leverages data stored in Apache HBase on the Hortonworks Data Platform. HBase is a non-relational (NoSQL) database that runs on top of the Hadoop Distributed File System (HDFS). The OpenShift application used in this demo was created in Python using Flask and Bootstrap frameworks. This is a simple but powerful example that shows how you can combine OpenShift and Hortonworks Data Platform to build a data-driven application.
If you would like to learn more about this announcement and to see progress on this integration join us at the Red Hat Summit on April 14-17 in San Francisco.