openstack-manuals/doc/common/section_getstart_sahara.xml
Shilla Saebi aa6f0d0694 changes to section_getstart_sahara.xml
added “a” simple means
removed nodes plural
added “a” before user
changed to “provides a means to scale”
added the before ability

Change-Id: I1420db2a875b4f95bd88c5d5f62d45a545f003a4
2015-03-02 23:55:30 -05:00

49 lines
2.1 KiB
XML

<?xml version="1.0" encoding="UTF-8"?>
<section xmlns="http://docbook.org/ns/docbook"
xmlns:xi="http://www.w3.org/2001/XInclude"
xmlns:xlink="http://www.w3.org/1999/xlink"
version="5.0"
xml:id="sahara-service">
<title>Data processing service</title>
<para>The Data processing service for OpenStack (sahara) aims to provide
users with a simple means to provision data processing (Hadoop, Spark)
clusters by specifying several parameters like Hadoop version, cluster
topology, node hardware details and a few more. After a user fills in
all the parameters, the Data processing service deploys the cluster in a
few minutes. Sahara also provides a means to scale already provisioned
clusters by adding/removing worker nodes on demand.
</para>
<para>The solution addresses the following use cases:
<itemizedlist>
<listitem><para>Fast provisioning of Hadoop clusters on OpenStack for
development and QA.</para></listitem>
<listitem><para>Utilization of unused compute power from general
purpose OpenStack IaaS cloud.</para></listitem>
<listitem><para>Analytics-as-a-Service for ad-hoc or bursty analytic
workloads.</para></listitem>
</itemizedlist>
</para>
<para>Key features are:
<itemizedlist>
<listitem><para>Designed as an OpenStack component.</para></listitem>
<listitem><para>Managed through REST API with UI available as part
of OpenStack dashboard.</para></listitem>
<listitem><para>Support for different Hadoop distributions:
<itemizedlist>
<listitem><para>Pluggable system of Hadoop installation
engines.</para></listitem>
<listitem><para>Integration with vendor specific management tools,
such as Apache Ambari or Cloudera Management Console.</para></listitem>
</itemizedlist>
</para></listitem>
<listitem><para>Predefined templates of Hadoop configurations with
the ability to modify parameters.</para></listitem>
<listitem><para>User-friendly UI for ad-hoc analytics queries based on
Hive or Pig.</para></listitem>
</itemizedlist>
</para>
</section>