Edits to ch_compute.xml

1. Removing the user_requirements file (unnecessary or duplicated
   content)
2. Edits to operational_considerations
3. Updating ch_compute_focus file

Change-Id: I32d10c507bd82008d4bda4c6f02b0fc3d02d31bf
Implements: blueprint arch-guide
This commit is contained in:
asettle 2015-08-13 15:09:51 +10:00
parent 4ea6f57651
commit 7bfa107ac1
3 changed files with 20 additions and 245 deletions

View File

@ -37,7 +37,6 @@
persistent block storage.</para>
</note>
<xi:include href="compute_focus/section_user_requirements_compute_focus.xml"/>
<xi:include href="compute_focus/section_tech_considerations_compute_focus.xml"/>
<xi:include href="compute_focus/section_operational_considerations_compute_focus.xml"/>
<xi:include href="compute_focus/section_architecture_compute_focus.xml"/>

View File

@ -6,8 +6,8 @@
xml:id="operational-considerations-compute-focus">
<?dbhtml stop-chunking?>
<title>Operational considerations</title>
<para>Operationally, there are a number of considerations that affect the
design of compute-focused OpenStack clouds. Some examples include:</para>
<para>There are a number of operational considerations that affect the
design of compute-focused OpenStack clouds, including:</para>
<itemizedlist>
<listitem>
<para>
@ -29,50 +29,18 @@
ensure the availability of a service. When designing an OpenStack cloud,
factoring in promises of availability implies a certain level of
redundancy and resiliency.</para>
<itemizedlist>
<listitem>
<para>Guarantees for API availability imply multiple infrastructure
services combined with appropriate, highly available load
balancers.</para>
</listitem>
<listitem>
<para>Network uptime guarantees affect the switch design and might
require redundant switching and power.</para>
</listitem>
<listitem>
<para>Factoring of network security policy requirements in to deployments.</para>
</listitem>
</itemizedlist>
<section xml:id="support-and-maintainability-compute-focus">
<title>Support and maintainability</title>
<para>OpenStack cloud management requires a certain level of
understanding and comprehension of design architecture. Specially trained,
dedicated operations organizations are more likely to manage larger
cloud service providers or telecom providers. Smaller implementations
are more inclined to rely on smaller support teams that need
to combine the engineering, design, and operation roles.</para>
<para>The maintenance of OpenStack installations requires a variety
of technical skills. To ease the operational burden, consider
incorporating features into the architecture and
design. Some examples include:</para>
<itemizedlist>
<listitem>
<para>Automating the operations functions</para>
</listitem>
<listitem>
<para>Utilizing a third party management company</para>
</listitem>
</itemizedlist>
</section>
<section xml:id="montioring-compute-focus">
<title>Monitoring</title>
<para>OpenStack clouds require appropriate monitoring platforms that
help to catch and manage errors adequately. Consider leveraging any
existing monitoring systems to see if they are able to
effectively monitor an OpenStack environment. Specific meters that
are critically important to capture include:</para>
<para>OpenStack clouds require appropriate monitoring platforms
to catch and manage errors.</para>
<note>
<para>We recommend leveraging existing monitoring systems
to see if they are able to effectively monitor an
OpenStack environment.</para>
</note>
<para>Specific meters that are critically important to capture
include:</para>
<itemizedlist>
<listitem>
<para>Image disk utilization</para>
@ -83,31 +51,12 @@
</itemizedlist>
</section>
<section xml:id="expected-unexpected-server-downtime">
<title>Expected and unexpected server downtime</title>
<para>Unexpected server downtime is inevitable, and SLAs can
address how long it takes to recover from failure.
Recovery of a failed host means restoring instances from a snapshot, or
respawning that instance on another available host.</para>
<para>It is acceptable to design a compute-focused cloud
without the ability to migrate instances from one host to
another. The expectation is that the application
developer must handle failure within the application itself.
However, provisioning a compute-focused cloud
provides extra resilience. In this scenario, the
developer deploys extra support services.</para>
</section>
<section xml:id="capacity-planning-operational">
<title>Capacity planning</title>
<para>Adding extra capacity to an OpenStack cloud is a
horizontally scaling process.</para>
<note>
<para>Be mindful, however, of additional work to place the nodes into
appropriate Availability Zones and Host Aggregates.</para>
</note>
<para>We recommend the same or very similar CPUs
when adding extra nodes to the environment because they reduce
<para>We recommend similar (or the same) CPUs
when adding extra nodes to the environment. This reduces
the chance of breaking live-migration features if they are
present. Scaling out hypervisor hosts also has a direct effect
on network and other data center resources. We recommend you
@ -120,11 +69,13 @@
capacity for running applications.</para>
<para>Another option is to assess the average workloads and
increase the number of instances that can run within the
compute environment by adjusting the overcommit ratio. While
only appropriate in some environments, it's important to
remember that changing the CPU overcommit ratio can have a
detrimental effect and cause a potential increase in a noisy
neighbor. The added risk of increasing the overcommit ratio is that
compute environment by adjusting the overcommit ratio.</para>
<note>
<para>It is important to remember that changing the CPU
overcommit ratio can have a detrimental effect and cause
a potential increase in a noisy neighbor.</para>
</note>
<para>The added risk of increasing the overcommit ratio is that
more instances fail when a compute host fails. We do not recommend
that you increase the CPU overcommit ratio in compute-focused
OpenStack design architecture, as it can increase the potential

View File

@ -1,175 +0,0 @@
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE section [
<!ENTITY % openstack SYSTEM "../../common/entities/openstack.ent">
%openstack;
]>
<section xmlns="http://docbook.org/ns/docbook"
xmlns:xi="http://www.w3.org/2001/XInclude"
xmlns:xlink="http://www.w3.org/1999/xlink"
version="5.0"
xml:id="user-requirements-compute-focus">
<?dbhtml stop-chunking?>
<title>User requirements</title>
<para>High utilization of CPU, RAM, or both defines compute
intensive workloads. User requirements determine the performance
demands for the cloud.
</para>
<variablelist>
<varlistentry>
<term>Cost</term>
<listitem>
<para>Cost is not generally a primary concern for a
compute-focused cloud, however some organizations
might be concerned with cost avoidance. Repurposing
existing resources to tackle compute-intensive tasks
instead of acquiring additional resources may
offer cost reduction opportunities.</para>
</listitem>
</varlistentry>
<varlistentry>
<term>Time to market</term>
<listitem>
<para>Compute-focused clouds can deliver products more quickly,
for example by speeding up a company's software development
life cycle (SDLC) for building products and applications.</para>
</listitem>
</varlistentry>
<varlistentry>
<term>Revenue opportunity</term>
<listitem>
<para>Companies that want to build services or products that
rely on the power of compute resources benefit from a
compute-focused cloud. Examples include the analysis
of large data sets (via Hadoop or Cassandra) or
completing computational intensive tasks such as
rendering, scientific computation, or
simulations.</para>
</listitem>
</varlistentry>
</variablelist>
<section xml:id="legal-requirements-compute-focus">
<title>Legal requirements</title>
<para>Many jurisdictions have legislative and regulatory
requirements governing the storage and management of data in
cloud environments. Common areas of regulation include:</para>
<itemizedlist>
<listitem>
<para>Data retention policies ensuring storage of
persistent data and records management to meet data
archival requirements.</para>
</listitem>
<listitem>
<para>Data ownership policies governing the possession and
responsibility for data.</para>
</listitem>
<listitem>
<para>Data sovereignty policies governing the storage of
data in foreign countries or otherwise separate
jurisdictions.</para>
</listitem>
<listitem>
<para>Data compliance: certain types of information need
to reside in certain locations due to regular issues and,
more importantly, cannot reside in other locations
for the same reason.</para>
</listitem>
</itemizedlist>
<para>
Examples of such legal frameworks include the <link
xlink:href="http://ec.europa.eu/justice/data-protection/">data
protection framework</link> of the European Union and the
requirements of the <link
xlink:href="http://www.finra.org/Industry/Regulation/FINRARules/">Financial
Industry Regulatory Authority</link> in the United
States. Consult a local regulatory body for more
information.</para></section>
<section xml:id="technical-considerations-compute-focus-user">
<title>Technical considerations</title>
<para>The following are some technical requirements you must consider
in the architecture design:
</para>
<variablelist>
<varlistentry>
<term>Performance</term>
<listitem>
<para>If a primary technical concern is to deliver high performance
capability, then a compute-focused design is an
obvious choice because it is specifically designed to
host compute-intensive workloads.</para>
</listitem>
</varlistentry>
<varlistentry>
<term>Workload persistence</term>
<listitem>
<para>Workloads can be either
short-lived or long-running. Short-lived workloads
can include continuous integration and continuous
deployment (CI-CD) jobs, which create large numbers of
compute instances simultaneously to
perform a set of compute-intensive tasks. The environment then
copies the results or artifacts from each instance into
long-term storage before destroying the instance.
Long-running workloads, like a Hadoop or
high-performance computing (HPC) cluster, typically
ingest large data sets, perform the computational work
on those data sets, then push the results into long-term
storage. When the computational work finishes, the instances
remain idle until they receive another job. Environments
for long-running workloads are often larger and more complex,
but you can offset the cost of building them by keeping them
active between jobs. Another example of long-running
workloads is legacy applications that are
persistent over time.</para>
</listitem>
</varlistentry>
<varlistentry>
<term>Storage</term>
<listitem>
<para>Workloads targeted for a compute-focused
OpenStack cloud generally do not require any
persistent block storage, although some uses of
Hadoop with HDFS may require persistent
block storage. A shared filesystem or object store
maintains the initial data sets and serves as the
destination for saving the computational results. By
avoiding the input-output (IO) overhead, you can significantly
enhance workload performance. Depending on
the size of the data sets, it may be necessary to
scale the object store or shared file system to match
the storage demand.</para>
</listitem>
</varlistentry>
<varlistentry>
<term>User interface</term>
<listitem>
<para>Like any other cloud architecture, a
compute-focused OpenStack cloud requires an on-demand
and self-service user interface. End users must be
able to provision computing power, storage, networks,
and software simply and flexibly. This includes
scaling the infrastructure up to a substantial level
without disrupting host operations.</para>
</listitem>
</varlistentry>
<varlistentry>
<term>Security</term>
<listitem>
<para>Security is highly dependent
on business requirements. For example, a
computationally intense drug discovery application
has much higher security requirements
than a cloud for processing market
data for a retailer. As a general rule, the security
recommendations and guidelines provided in the
OpenStack Security Guide are applicable.</para>
</listitem>
</varlistentry>
</variablelist>
</section>
<section xml:id="operational-considerations-compute-focus-user">
<title>Operational considerations</title>
<para>From an operational perspective, a compute intensive cloud
is similar to a general-purpose cloud. See the general-purpose
design section for more details on operational requirements.</para>
</section>
</section>