Merge "[arch-guide-archive] Removing old arch guide from master"

2017-08-08 12:58:17 +00:00 · 2017-08-08 12:58:17 +00:00 · c9754cc576
commit c9754cc576
parent b20d62000f 8454a103ac
82 changed files with 0 additions and 7299 deletions
--- a/doc-tools-check-languages.conf
+++ b/doc-tools-check-languages.conf
@ -33,6 +33,4 @@ declare -A SPECIAL_BOOKS=(
    ["contributor-guide"]="skip"
    ["releasenotes"]="skip"
    ["ha-guide-draft"]="skip"
    # Skip old arch design, will be archived
    ["arch-design-to-archive"]="skip"
 )
--- a/doc/arch-design-to-archive/setup.cfg
+++ b/doc/arch-design-to-archive/setup.cfg
@ -1,27 +0,0 @@
 [metadata]
 name = architecturedesignguide
 summary = OpenStack Architecture Design Guide
 author = OpenStack
 author-email = openstack-docs@lists.openstack.org
 home-page = https://docs.openstack.org/
 classifier =
 Environment :: OpenStack
 Intended Audience :: Information Technology
 Intended Audience :: Cloud Architects
 License :: OSI Approved :: Apache Software License
 Operating System :: POSIX :: Linux
 Topic :: Documentation
 [global]
 setup-hooks =
    pbr.hooks.setup_hook
 [files]
 [build_sphinx]
 warning-is-error = 1
 build-dir = build
 source-dir = source
 [wheel]
 universal = 1
--- a/doc/arch-design-to-archive/setup.py
+++ b/doc/arch-design-to-archive/setup.py
@ -1,30 +0,0 @@
 #!/usr/bin/env python
 # Copyright (c) 2013 Hewlett-Packard Development Company, L.P.
 #
 # Licensed under the Apache License, Version 2.0 (the "License");
 # you may not use this file except in compliance with the License.
 # You may obtain a copy of the License at
 #
 #    http://www.apache.org/licenses/LICENSE-2.0
 #
 # Unless required by applicable law or agreed to in writing, software
 # distributed under the License is distributed on an "AS IS" BASIS,
 # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
 # implied.
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # THIS FILE IS MANAGED BY THE GLOBAL REQUIREMENTS REPO - DO NOT EDIT
 import setuptools
 # In python < 2.7.4, a lazy loading of package `pbr` will break
 # setuptools if some other modules registered functions in `atexit`.
 # solution from: http://bugs.python.org/issue15881#msg170215
 try:
    import multiprocessing  # noqa
 except ImportError:
    pass
 setuptools.setup(
    setup_requires=['pbr'],
    pbr=True)
--- a/doc/arch-design-to-archive/source/common
+++ b/doc/arch-design-to-archive/source/common
@ -1 +0,0 @@
 ../../common
--- a/doc/arch-design-to-archive/source/compute-focus-architecture.rst
+++ b/doc/arch-design-to-archive/source/compute-focus-architecture.rst
@ -1,212 +0,0 @@
 ============
 Architecture
 ============
 The hardware selection covers three areas:
 * Compute
 * Network
 * Storage
 Compute-focused OpenStack clouds have high demands on processor and
 memory resources, and requires hardware that can handle these demands.
 Consider the following factors when selecting compute (server) hardware:
 * Server density
 * Resource capacity
 * Expandability
 * Cost
 Weigh these considerations against each other to determine the best
 design for the desired purpose. For example, increasing server density
 means sacrificing resource capacity or expandability.
 A compute-focused cloud should have an emphasis on server hardware that
 can offer more CPU sockets, more CPU cores, and more RAM. Network
 connectivity and storage capacity are less critical.
 When designing a compute-focused OpenStack architecture, you must
 consider whether you intend to scale up or scale out. Selecting a
 smaller number of larger hosts, or a larger number of smaller hosts,
 depends on a combination of factors: cost, power, cooling, physical rack
 and floor space, support-warranty, and manageability.
 Considerations for selecting hardware:
 * Most blade servers can support dual-socket multi-core CPUs. To avoid
  this CPU limit, select ``full width`` or ``full height`` blades. Be
  aware, however, that this also decreases server density. For example,
  high density blade servers such as HP BladeSystem or Dell PowerEdge
  M1000e support up to 16 servers in only ten rack units. Using
  half-height blades is twice as dense as using full-height blades,
  which results in only eight servers per ten rack units.
 * 1U rack-mounted servers that occupy only a single rack unit may offer
  greater server density than a blade server solution. It is possible
  to place forty 1U servers in a rack, providing space for the top of
  rack (ToR) switches, compared to 32 full width blade servers.
 * 2U rack-mounted servers provide quad-socket, multi-core CPU support,
  but with a corresponding decrease in server density (half the density
  that 1U rack-mounted servers offer).
 * Larger rack-mounted servers, such as 4U servers, often provide even
  greater CPU capacity, commonly supporting four or even eight CPU
  sockets. These servers have greater expandability, but such servers
  have much lower server density and are often more expensive.
 * ``Sled servers`` are rack-mounted servers that support multiple
  independent servers in a single 2U or 3U enclosure. These deliver
  higher density as compared to typical 1U or 2U rack-mounted servers.
  For example, many sled servers offer four independent dual-socket
  nodes in 2U for a total of eight CPU sockets in 2U.
 Consider these when choosing server hardware for a compute-focused
 OpenStack design architecture:
 * Instance density
 * Host density
 * Power and cooling density
 Selecting networking hardware
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Some of the key considerations for networking hardware selection
 include:
 * Port count
 * Port density
 * Port speed
 * Redundancy
 * Power requirements
 We recommend designing the network architecture using a scalable network
 model that makes it easy to add capacity and bandwidth. A good example
 of such a model is the leaf-spline model. In this type of network
 design, it is possible to easily add additional bandwidth as well as
 scale out to additional racks of gear. It is important to select network
 hardware that supports the required port count, port speed, and port
 density while also allowing for future growth as workload demands
 increase. It is also important to evaluate where in the network
 architecture it is valuable to provide redundancy.
 Operating system and hypervisor
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 The selection of operating system (OS) and hypervisor has a significant
 impact on the end point design.
 OS and hypervisor selection impact the following areas:
 * Cost
 * Supportability
 * Management tools
 * Scale and performance
 * Security
 * Supported features
 * Interoperability
 OpenStack components
 ~~~~~~~~~~~~~~~~~~~~
 The selection of OpenStack components is important. There are certain
 components that are required, for example the compute and image
 services, but others, such as the Orchestration service, may not be
 present.
 For a compute-focused OpenStack design architecture, the following
 components may be present:
 * Identity (keystone)
 * Dashboard (horizon)
 * Compute (nova)
 * Object Storage (swift)
 * Image (glance)
 * Networking (neutron)
 * Orchestration (heat)
  .. note::
     A compute-focused design is less likely to include OpenStack Block
     Storage. However, there may be some situations where the need for
     performance requires a block storage component to improve data I-O.
 The exclusion of certain OpenStack components might also limit the
 functionality of other components. If a design includes the
 Orchestration service but excludes the Telemetry service, then the
 design cannot take advantage of Orchestration's auto scaling
 functionality as this relies on information from Telemetry.
 Networking software
 ~~~~~~~~~~~~~~~~~~~
 OpenStack Networking provides a wide variety of networking services for
 instances. There are many additional networking software packages that
 might be useful to manage the OpenStack components themselves. The
 `OpenStack High Availability Guide <https://docs.openstack.org/ha-guide/>`_
 describes some of these software packages in more detail.
 For a compute-focused OpenStack cloud, the OpenStack infrastructure
 components must be highly available. If the design does not include
 hardware load balancing, you must add networking software packages, for
 example, HAProxy.
 Management software
 ~~~~~~~~~~~~~~~~~~~
 The selected supplemental software solution impacts and affects the
 overall OpenStack cloud design. This includes software for providing
 clustering, logging, monitoring and alerting.
 The availability of design requirements is the main determiner for the
 inclusion of clustering software, such as Corosync or Pacemaker.
 Operational considerations determine the requirements for logging,
 monitoring, and alerting. Each of these sub-categories include various
 options.
 Some other potential design impacts include:
 OS-hypervisor combination
 Ensure that the selected logging, monitoring, or alerting tools
 support the proposed OS-hypervisor combination.
 Network hardware
 The logging, monitoring, and alerting software must support the
 network hardware selection.
 Database software
 ~~~~~~~~~~~~~~~~~
 A large majority of OpenStack components require access to back-end
 database services to store state and configuration information. Select
 an appropriate back-end database that satisfies the availability and
 fault tolerance requirements of the OpenStack services. OpenStack
 services support connecting to any database that the SQLAlchemy Python
 drivers support, however most common database deployments make use of
 MySQL or some variation of it. We recommend that you make the database
 that provides back-end services within a general-purpose cloud highly
 available. Some of the more common software solutions include Galera,
 MariaDB, and MySQL with multi-master replication.
--- a/doc/arch-design-to-archive/source/compute-focus-operational-considerations.rst
+++ b/doc/arch-design-to-archive/source/compute-focus-operational-considerations.rst
@ -1,68 +0,0 @@
 ==========================
 Operational considerations
 ==========================
 There are a number of operational considerations that affect the design
 of compute-focused OpenStack clouds, including:
 * Enforcing strict API availability requirements
 * Understanding and dealing with failure scenarios
 * Managing host maintenance schedules
 Service-level agreements (SLAs) are contractual obligations that ensure
 the availability of a service. When designing an OpenStack cloud,
 factoring in promises of availability implies a certain level of
 redundancy and resiliency.
 Monitoring
 ~~~~~~~~~~
 OpenStack clouds require appropriate monitoring platforms to catch and
 manage errors.
 .. note::
   We recommend leveraging existing monitoring systems to see if they
   are able to effectively monitor an OpenStack environment.
 Specific meters that are critically important to capture include:
 * Image disk utilization
 * Response time to the Compute API
 Capacity planning
 ~~~~~~~~~~~~~~~~~
 Adding extra capacity to an OpenStack cloud is a horizontally scaling
 process.
 We recommend similar (or the same) CPUs when adding extra nodes to the
 environment. This reduces the chance of breaking live-migration features
 if they are present. Scaling out hypervisor hosts also has a direct
 effect on network and other data center resources. We recommend you
 factor in this increase when reaching rack capacity or when requiring
 extra network switches.
 Changing the internal components of a Compute host to account for
 increases in demand is a process known as vertical scaling. Swapping a
 CPU for one with more cores, or increasing the memory in a server, can
 help add extra capacity for running applications.
 Another option is to assess the average workloads and increase the
 number of instances that can run within the compute environment by
 adjusting the overcommit ratio.
 .. note::
   It is important to remember that changing the CPU overcommit ratio
   can have a detrimental effect and cause a potential increase in a
   noisy neighbor.
 The added risk of increasing the overcommit ratio is that more instances
 fail when a compute host fails. We do not recommend that you increase
 the CPU overcommit ratio in compute-focused OpenStack design
 architecture, as it can increase the potential for noisy neighbor
 issues.
--- a/doc/arch-design-to-archive/source/compute-focus-prescriptive-examples.rst
+++ b/doc/arch-design-to-archive/source/compute-focus-prescriptive-examples.rst
@ -1,126 +0,0 @@
 =====================
 Prescriptive examples
 =====================
 The Conseil Européen pour la Recherche Nucléaire (CERN), also known as
 the European Organization for Nuclear Research, provides particle
 accelerators and other infrastructure for high-energy physics research.
 As of 2011 CERN operated these two compute centers in Europe with plans
 to add a third.
 +-----------------------+------------------------+
 | Data center           | Approximate capacity   |
 +=======================+========================+
 | Geneva, Switzerland   | -  3.5 Mega Watts      |
 |                       |                        |
 |                       | -  91000 cores         |
 |                       |                        |
 |                       | -  120 PB HDD          |
 |                       |                        |
 |                       | -  100 PB Tape         |
 |                       |                        |
 |                       | -  310 TB Memory       |
 +-----------------------+------------------------+
 | Budapest, Hungary     | -  2.5 Mega Watts      |
 |                       |                        |
 |                       | -  20000 cores         |
 |                       |                        |
 |                       | -  6 PB HDD            |
 +-----------------------+------------------------+
 To support a growing number of compute-heavy users of experiments
 related to the Large Hadron Collider (LHC), CERN ultimately elected to
 deploy an OpenStack cloud using Scientific Linux and RDO. This effort
 aimed to simplify the management of the center's compute resources with
 a view to doubling compute capacity through the addition of a data
 center in 2013 while maintaining the same levels of compute staff.
 The CERN solution uses :term:`cells <cell>` for segregation of compute
 resources and for transparently scaling between different data centers.
 This decision meant trading off support for security groups and live
 migration. In addition, they must manually replicate some details, like
 flavors, across cells. In spite of these drawbacks cells provide the
 required scale while exposing a single public API endpoint to users.
 CERN created a compute cell for each of the two original data centers
 and created a third when it added a new data center in 2013. Each cell
 contains three availability zones to further segregate compute resources
 and at least three RabbitMQ message brokers configured for clustering
 with mirrored queues for high availability.
 The API cell, which resides behind a HAProxy load balancer, is in the
 data center in Switzerland and directs API calls to compute cells using
 a customized variation of the cell scheduler. The customizations allow
 certain workloads to route to a specific data center or all data
 centers, with cell RAM availability determining cell selection in the
 latter case.
 .. figure:: figures/Generic_CERN_Example.png
 There is also some customization of the filter scheduler that handles
 placement within the cells:
 ImagePropertiesFilter
 Provides special handling depending on the guest operating system in
 use (Linux-based or Windows-based).
 ProjectsToAggregateFilter
 Provides special handling depending on which project the instance is
 associated with.
 default_schedule_zones
 Allows the selection of multiple default availability zones, rather
 than a single default.
 A central database team manages the MySQL database server in each cell
 in an active/passive configuration with a NetApp storage back end.
 Backups run every 6 hours.
 Network architecture
 ~~~~~~~~~~~~~~~~~~~~
 To integrate with existing networking infrastructure, CERN made
 customizations to legacy networking (nova-network). This was in the form
 of a driver to integrate with CERN's existing database for tracking MAC
 and IP address assignments.
 The driver facilitates selection of a MAC address and IP for new
 instances based on the compute node where the scheduler places the
 instance.
 The driver considers the compute node where the scheduler placed an
 instance and selects a MAC address and IP from the pre-registered list
 associated with that node in the database. The database updates to
 reflect the address assignment to that instance.
 Storage architecture
 ~~~~~~~~~~~~~~~~~~~~
 CERN deploys the OpenStack Image service in the API cell and configures
 it to expose version 1 (V1) of the API. This also requires the image
 registry. The storage back end in use is a 3 PB Ceph cluster.
 CERN maintains a small set of Scientific Linux 5 and 6 images onto which
 orchestration tools can place applications. Puppet manages instance
 configuration and customization.
 Monitoring
 ~~~~~~~~~~
 CERN does not require direct billing, but uses the Telemetry service to
 perform metering for the purposes of adjusting project quotas. CERN uses
 a sharded, replicated, MongoDB back-end. To spread API load, CERN
 deploys instances of the nova-api service within the child cells for
 Telemetry to query against. This also requires the configuration of
 supporting services such as keystone, glance-api, and glance-registry in
 the child cells.
 .. figure:: figures/Generic_CERN_Architecture.png
 Additional monitoring tools in use include
 `Flume <https://flume.apache.org/>`__, `Elastic
 Search <https://www.elastic.co/>`__,
 `Kibana <https://www.elastic.co/products/kibana>`__, and the CERN
 developed `Lemon <http://lemon.web.cern.ch/lemon/index.shtml>`__
 project.
--- a/doc/arch-design-to-archive/source/compute-focus-technical-considerations.rst
+++ b/doc/arch-design-to-archive/source/compute-focus-technical-considerations.rst
@ -1,214 +0,0 @@
 ========================
 Technical considerations
 ========================
 In a compute-focused OpenStack cloud, the type of instance workloads you
 provision heavily influences technical decision making.
 Public and private clouds require deterministic capacity planning to
 support elastic growth in order to meet user SLA expectations.
 Deterministic capacity planning is the path to predicting the effort and
 expense of making a given process perform consistently. This process is
 important because, when a service becomes a critical part of a user's
 infrastructure, the user's experience links directly to the SLAs of the
 cloud itself.
 There are two aspects of capacity planning to consider:
 * Planning the initial deployment footprint
 * Planning expansion of the environment to stay ahead of cloud user demands
 Begin planning an initial OpenStack deployment footprint with
 estimations of expected uptake, and existing infrastructure workloads.
 The starting point is the core count of the cloud. By applying relevant
 ratios, the user can gather information about:
 * The number of expected concurrent instances: (overcommit fraction ×
  cores) / virtual cores per instance
 * Required storage: flavor disk size × number of instances
 These ratios determine the amount of additional infrastructure needed to
 support the cloud. For example, consider a situation in which you
 require 1600 instances, each with 2 vCPU and 50 GB of storage. Assuming
 the default overcommit rate of 16:1, working out the math provides an
 equation of:
 * 1600 = (16 × (number of physical cores)) / 2
 * Storage required = 50 GB × 1600
 On the surface, the equations reveal the need for 200 physical cores and
 80 TB of storage for ``/var/lib/nova/instances/``. However, it is also
 important to look at patterns of usage to estimate the load that the API
 services, database servers, and queue servers are likely to encounter.
 Aside from the creation and termination of instances, consider the
 impact of users accessing the service, particularly on nova-api and its
 associated database. Listing instances gathers a great deal of
 information and given the frequency with which users run this operation,
 a cloud with a large number of users can increase the load
 significantly. This can even occur unintentionally. For example, the
 OpenStack Dashboard instances tab refreshes the list of instances every
 30 seconds, so leaving it open in a browser window can cause unexpected
 load.
 Consideration of these factors can help determine how many cloud
 controller cores you require. A server with 8 CPU cores and 8 GB of RAM
 server would be sufficient for a rack of compute nodes, given the above
 caveats.
 Key hardware specifications are also crucial to the performance of user
 instances. Be sure to consider budget and performance needs, including
 storage performance (spindles/core), memory availability (RAM/core),
 network bandwidth (Gbps/core), and overall CPU performance (CPU/core).
 The cloud resource calculator is a useful tool in examining the impacts
 of different hardware and instance load outs. See `cloud-resource-calculator
 <https://github.com/noslzzp/cloud-resource-calculator/blob/master/cloud-resource-calculator.ods>`_.
 Expansion planning
 ~~~~~~~~~~~~~~~~~~
 A key challenge for planning the expansion of cloud compute services is
 the elastic nature of cloud infrastructure demands.
 Planning for expansion is a balancing act. Planning too conservatively
 can lead to unexpected oversubscription of the cloud and dissatisfied
 users. Planning for cloud expansion too aggressively can lead to
 unexpected underuse of the cloud and funds spent unnecessarily
 on operating infrastructure.
 The key is to carefully monitor the trends in cloud usage over time. The
 intent is to measure the consistency with which you deliver services,
 not the average speed or capacity of the cloud. Using this information
 to model capacity performance enables users to more accurately determine
 the current and future capacity of the cloud.
 CPU and RAM
 ~~~~~~~~~~~
 OpenStack enables users to overcommit CPU and RAM on compute nodes. This
 allows an increase in the number of instances running on the cloud at
 the cost of reducing the performance of the instances. OpenStack Compute
 uses the following ratios by default:
 * CPU allocation ratio: 16:1
 * RAM allocation ratio: 1.5:1
 The default CPU allocation ratio of 16:1 means that the scheduler
 allocates up to 16 virtual cores per physical core. For example, if a
 physical node has 12 cores, the scheduler sees 192 available virtual
 cores. With typical flavor definitions of 4 virtual cores per instance,
 this ratio would provide 48 instances on a physical node.
 Similarly, the default RAM allocation ratio of 1.5:1 means that the
 scheduler allocates instances to a physical node as long as the total
 amount of RAM associated with the instances is less than 1.5 times the
 amount of RAM available on the physical node.
 You must select the appropriate CPU and RAM allocation ratio based on
 particular use cases.
 Additional hardware
 ~~~~~~~~~~~~~~~~~~~
 Certain use cases may benefit from exposure to additional devices on the
 compute node. Examples might include:
 * High performance computing jobs that benefit from the availability of
  graphics processing units (GPUs) for general-purpose computing.
 * Cryptographic routines that benefit from the availability of hardware
  random number generators to avoid entropy starvation.
 * Database management systems that benefit from the availability of
  SSDs for ephemeral storage to maximize read/write time.
 Host aggregates group hosts that share similar characteristics, which
 can include hardware similarities. The addition of specialized hardware
 to a cloud deployment is likely to add to the cost of each node, so
 consider carefully whether all compute nodes, or just a subset targeted
 by flavors, need the additional customization to support the desired
 workloads.
 Utilization
 ~~~~~~~~~~~
 Infrastructure-as-a-Service offerings, including OpenStack, use flavors
 to provide standardized views of virtual machine resource requirements
 that simplify the problem of scheduling instances while making the best
 use of the available physical resources.
 In order to facilitate packing of virtual machines onto physical hosts,
 the default selection of flavors provides a second largest flavor that
 is half the size of the largest flavor in every dimension. It has half
 the vCPUs, half the vRAM, and half the ephemeral disk space. The next
 largest flavor is half that size again. The following figure provides a
 visual representation of this concept for a general purpose computing
 design:
 .. figure:: figures/Compute_Tech_Bin_Packing_General1.png
 The following figure displays a CPU-optimized, packed server:
 .. figure:: figures/Compute_Tech_Bin_Packing_CPU_optimized1.png
 These default flavors are well suited to typical configurations of
 commodity server hardware. To maximize utilization, however, it may be
 necessary to customize the flavors or create new ones in order to better
 align instance sizes to the available hardware.
 Workload characteristics may also influence hardware choices and flavor
 configuration, particularly where they present different ratios of CPU
 versus RAM versus HDD requirements.
 For more information on Flavors see `OpenStack Operations Guide:
 Flavors <https://docs.openstack.org/ops-guide/ops-user-facing-operations.html#flavors>`_.
 OpenStack components
 ~~~~~~~~~~~~~~~~~~~~
 Due to the nature of the workloads in this scenario, a number of
 components are highly beneficial for a Compute-focused cloud. This
 includes the typical OpenStack components:
 * :term:`Compute service (nova)`
 * :term:`Image service (glance)`
 * :term:`Identity service (keystone)`
 Also consider several specialized components:
 * :term:`Orchestration service (heat)`
   Given the nature of the applications involved in this scenario, these
   are heavily automated deployments. Making use of Orchestration is
   highly beneficial in this case. You can script the deployment of a
   batch of instances and the running of tests, but it makes sense to
   use the Orchestration service to handle all these actions.
 * :term:`Telemetry service (telemetry)`
   Telemetry and the alarms it generates support autoscaling of
   instances using Orchestration. Users that are not using the
   Orchestration service do not need to deploy the Telemetry service and
   may choose to use external solutions to fulfill their metering and
   monitoring requirements.
 * :term:`Block Storage service (cinder)`
   Due to the burst-able nature of the workloads and the applications
   and instances that perform batch processing, this cloud mainly uses
   memory or CPU, so the need for add-on storage to each instance is not
   a likely requirement. This does not mean that you do not use
   OpenStack Block Storage (cinder) in the infrastructure, but typically
   it is not a central component.
 * :term:`Networking service (neutron)`
   When choosing a networking platform, ensure that it either works with
   all desired hypervisor and container technologies and their OpenStack
   drivers, or that it includes an implementation of an ML2 mechanism
   driver. You can mix networking platforms that provide ML2 mechanisms
   drivers.
--- a/doc/arch-design-to-archive/source/compute-focus.rst
+++ b/doc/arch-design-to-archive/source/compute-focus.rst
@ -1,34 +0,0 @@
 ===============
 Compute focused
 ===============
 .. toctree::
   :maxdepth: 2
   compute-focus-technical-considerations.rst
   compute-focus-operational-considerations.rst
   compute-focus-architecture.rst
   compute-focus-prescriptive-examples.rst
 Compute-focused clouds are a specialized subset of the general
 purpose OpenStack cloud architecture. A compute-focused cloud
 specifically supports compute intensive workloads.
 .. note::
   Compute intensive workloads may be CPU intensive, RAM intensive,
   or both; they are not typically storage or network intensive.
 Compute-focused workloads may include the following use cases:
 * High performance computing (HPC)
 * Big data analytics using Hadoop or other distributed data stores
 * Continuous integration/continuous deployment (CI/CD)
 * Platform-as-a-Service (PaaS)
 * Signal processing for network function virtualization (NFV)
 .. note::
   A compute-focused OpenStack cloud does not typically use raw
   block storage services as it does not host applications that
   require persistent block storage.
--- a/doc/arch-design-to-archive/source/conf.py
+++ b/doc/arch-design-to-archive/source/conf.py
@ -1,291 +0,0 @@
 # Licensed under the Apache License, Version 2.0 (the "License");
 # you may not use this file except in compliance with the License.
 # You may obtain a copy of the License at
 #
 #    http://www.apache.org/licenses/LICENSE-2.0
 #
 # Unless required by applicable law or agreed to in writing, software
 # distributed under the License is distributed on an "AS IS" BASIS,
 # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
 # implied.
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # This file is execfile()d with the current directory set to its
 # containing dir.
 #
 # Note that not all possible configuration values are present in this
 # autogenerated file.
 #
 # All configuration values have a default; values that are commented out
 # serve to show the default.
 import os
 # import sys
 # If extensions (or modules to document with autodoc) are in another directory,
 # add these directories to sys.path here. If the directory is relative to the
 # documentation root, use os.path.abspath to make it absolute, like shown here.
 # sys.path.insert(0, os.path.abspath('.'))
 # -- General configuration ------------------------------------------------
 # If your documentation needs a minimal Sphinx version, state it here.
 # needs_sphinx = '1.0'
 # Add any Sphinx extension module names here, as strings. They can be
 # extensions coming with Sphinx (named 'sphinx.ext.*') or your custom
 # ones.
 extensions = ['openstackdocstheme']
 # Add any paths that contain templates here, relative to this directory.
 # templates_path = ['_templates']
 # The suffix of source filenames.
 source_suffix = '.rst'
 # The encoding of source files.
 # source_encoding = 'utf-8-sig'
 # The master toctree document.
 master_doc = 'index'
 # General information about the project.
 repository_name = "openstack/openstack-manuals"
 bug_project = 'openstack-manuals'
 project = u'Architecture Design Guide'
 bug_tag = u'arch-design-to-archive'
 copyright = u'2015-2017, OpenStack contributors'
 # The version info for the project you're documenting, acts as replacement for
 # |version| and |release|, also used in various other places throughout the
 # built documents.
 #
 # The short X.Y version.
 version = '0.9'
 # The full version, including alpha/beta/rc tags.
 release = '0.9'
 # The language for content autogenerated by Sphinx. Refer to documentation
 # for a list of supported languages.
 # language = None
 # There are two options for replacing |today|: either, you set today to some
 # non-false value, then it is used:
 # today = ''
 # Else, today_fmt is used as the format for a strftime call.
 # today_fmt = '%B %d, %Y'
 # List of patterns, relative to source directory, that match files and
 # directories to ignore when looking for source files.
 exclude_patterns = ['common/cli*', 'common/nova*', 'common/get-started-*']
 # The reST default role (used for this markup: `text`) to use for all
 # documents.
 # default_role = None
 # If true, '()' will be appended to :func: etc. cross-reference text.
 # add_function_parentheses = True
 # If true, the current module name will be prepended to all description
 # unit titles (such as .. function::).
 # add_module_names = True
 # If true, sectionauthor and moduleauthor directives will be shown in the
 # output. They are ignored by default.
 # show_authors = False
 # The name of the Pygments (syntax highlighting) style to use.
 pygments_style = 'sphinx'
 # A list of ignored prefixes for module index sorting.
 # modindex_common_prefix = []
 # If true, keep warnings as "system message" paragraphs in the built documents.
 # keep_warnings = False
 # -- Options for HTML output ----------------------------------------------
 # The theme to use for HTML and HTML Help pages.  See the documentation for
 # a list of builtin themes.
 html_theme = 'openstackdocs'
 # Theme options are theme-specific and customize the look and feel of a theme
 # further.  For a list of options available for each theme, see the
 # documentation.
 # html_theme_options = {}
 # Add any paths that contain custom themes here, relative to this directory.
 # html_theme_path = [openstackdocstheme.get_html_theme_path()]
 # The name for this set of Sphinx documents.  If None, it defaults to
 # "<project> v<release> documentation".
 # html_title = None
 # A shorter title for the navigation bar.  Default is the same as html_title.
 # html_short_title = None
 # The name of an image file (relative to this directory) to place at the top
 # of the sidebar.
 # html_logo = None
 # The name of an image file (within the static path) to use as favicon of the
 # docs.  This file should be a Windows icon file (.ico) being 16x16 or 32x32
 # pixels large.
 # html_favicon = None
 # Add any paths that contain custom static files (such as style sheets) here,
 # relative to this directory. They are copied after the builtin static files,
 # so a file named "default.css" will overwrite the builtin "default.css".
 # html_static_path = []
 # Add any extra paths that contain custom files (such as robots.txt or
 # .htaccess) here, relative to this directory. These files are copied
 # directly to the root of the documentation.
 # html_extra_path = []
 # If not '', a 'Last updated on:' timestamp is inserted at every page bottom,
 # using the given strftime format.
 # So that we can enable "log-a-bug" links from each output HTML page, this
 # variable must be set to a format that includes year, month, day, hours and
 # minutes.
 html_last_updated_fmt = '%Y-%m-%d %H:%M'
 # If true, SmartyPants will be used to convert quotes and dashes to
 # typographically correct entities.
 # html_use_smartypants = True
 # Custom sidebar templates, maps document names to template names.
 # html_sidebars = {}
 # Additional templates that should be rendered to pages, maps page names to
 # template names.
 # html_additional_pages = {}
 # If false, no module index is generated.
 # html_domain_indices = True
 # If false, no index is generated.
 html_use_index = False
 # If true, the index is split into individual pages for each letter.
 # html_split_index = False
 # If true, links to the reST sources are added to the pages.
 html_show_sourcelink = False
 # If true, "Created using Sphinx" is shown in the HTML footer. Default is True.
 # html_show_sphinx = True
 # If true, "(C) Copyright ..." is shown in the HTML footer. Default is True.
 # html_show_copyright = True
 # If true, an OpenSearch description file will be output, and all pages will
 # contain a <link> tag referring to it.  The value of this option must be the
 # base URL from which the finished HTML is served.
 # html_use_opensearch = ''
 # This is the file name suffix for HTML files (e.g. ".xhtml").
 # html_file_suffix = None
 # Output file base name for HTML help builder.
 htmlhelp_basename = 'arch-design-to-archive'
 # If true, publish source files
 html_copy_source = False
 # -- Options for LaTeX output ---------------------------------------------
 latex_engine = 'xelatex'
 latex_elements = {
    # The paper size ('letterpaper' or 'a4paper').
    # 'papersize': 'letterpaper',
    # set font (TODO: different fonts for translated PDF document builds)
    'fontenc': '\\usepackage{fontspec}',
    'fontpkg': '''\
 \defaultfontfeatures{Scale=MatchLowercase}
 \setmainfont{Liberation Serif}
 \setsansfont{Liberation Sans}
 \setmonofont[SmallCapsFont={Liberation Mono}]{Liberation Mono}
 ''',
    # The font size ('10pt', '11pt' or '12pt').
    # 'pointsize': '10pt',
    # Additional stuff for the LaTeX preamble.
    # 'preamble': '',
 }
 # Grouping the document tree into LaTeX files. List of tuples
 # (source start file, target name, title,
 #  author, documentclass [howto, manual, or own class]).
 latex_documents = [
    ('index', 'ArchGuideRst.tex', u'Architecture Design Guide',
     u'OpenStack contributors', 'manual'),
 ]
 # The name of an image file (relative to this directory) to place at the top of
 # the title page.
 # latex_logo = None
 # For "manual" documents, if this is true, then toplevel headings are parts,
 # not chapters.
 # latex_use_parts = False
 # If true, show page references after internal links.
 # latex_show_pagerefs = False
 # If true, show URL addresses after external links.
 # latex_show_urls = False
 # Documents to append as an appendix to all manuals.
 # latex_appendices = []
 # If false, no module index is generated.
 # latex_domain_indices = True
 # -- Options for manual page output ---------------------------------------
 # One entry per manual page. List of tuples
 # (source start file, name, description, authors, manual section).
 man_pages = [
    ('index', 'ArchDesignRst', u'Architecture Design Guide',
     [u'OpenStack contributors'], 1)
 ]
 # If true, show URL addresses after external links.
 # man_show_urls = False
 # -- Options for Texinfo output -------------------------------------------
 # Grouping the document tree into Texinfo files. List of tuples
 # (source start file, target name, title, author,
 #  dir menu entry, description, category)
 texinfo_documents = [
    ('index', 'ArchDesignRst', u'Architecture Design Guide',
     u'OpenStack contributors', 'ArchDesignRst',
     'To reap the benefits of OpenStack, you should plan, design,'
     'and architect your cloud properly, taking user needs into'
     'account and understanding the use cases.'
     'commands.', 'Miscellaneous'),
 ]
 # Documents to append as an appendix to all manuals.
 # texinfo_appendices = []
 # If false, no module index is generated.
 # texinfo_domain_indices = True
 # How to display URL addresses: 'footnote', 'no', or 'inline'.
 # texinfo_show_urls = 'footnote'
 # If true, do not generate a @detailmenu in the "Top" node's menu.
 # texinfo_no_detailmenu = False
 # -- Options for Internationalization output ------------------------------
 locale_dirs = ['locale/']
--- a/doc/arch-design-to-archive/source/figures/Compute_NSX.png
+++ b/doc/arch-design-to-archive/source/figures/Compute_NSX.png
--- a/doc/arch-design-to-archive/source/figures/Compute_Tech_Bin_Packing_CPU_optimized1.png
+++ b/doc/arch-design-to-archive/source/figures/Compute_Tech_Bin_Packing_CPU_optimized1.png
--- a/doc/arch-design-to-archive/source/figures/Compute_Tech_Bin_Packing_General1.png
+++ b/doc/arch-design-to-archive/source/figures/Compute_Tech_Bin_Packing_General1.png
--- a/doc/arch-design-to-archive/source/figures/General_Architecture3.png
+++ b/doc/arch-design-to-archive/source/figures/General_Architecture3.png
--- a/doc/arch-design-to-archive/source/figures/Generic_CERN_Architecture.png
+++ b/doc/arch-design-to-archive/source/figures/Generic_CERN_Architecture.png
--- a/doc/arch-design-to-archive/source/figures/Generic_CERN_Example.png
+++ b/doc/arch-design-to-archive/source/figures/Generic_CERN_Example.png
--- a/doc/arch-design-to-archive/source/figures/Massively_Scalable_Cells_regions_azs.png
+++ b/doc/arch-design-to-archive/source/figures/Massively_Scalable_Cells_regions_azs.png
--- a/doc/arch-design-to-archive/source/figures/Multi-Cloud_Priv-AWS4.png
+++ b/doc/arch-design-to-archive/source/figures/Multi-Cloud_Priv-AWS4.png
--- a/doc/arch-design-to-archive/source/figures/Multi-Cloud_Priv-Pub3.png
+++ b/doc/arch-design-to-archive/source/figures/Multi-Cloud_Priv-Pub3.png
--- a/doc/arch-design-to-archive/source/figures/Multi-Cloud_failover2.png
+++ b/doc/arch-design-to-archive/source/figures/Multi-Cloud_failover2.png
--- a/doc/arch-design-to-archive/source/figures/Multi-Site_Customer_Edge.png
+++ b/doc/arch-design-to-archive/source/figures/Multi-Site_Customer_Edge.png
--- a/doc/arch-design-to-archive/source/figures/Multi-Site_shared_keystone1.png
+++ b/doc/arch-design-to-archive/source/figures/Multi-Site_shared_keystone1.png
--- a/doc/arch-design-to-archive/source/figures/Multi-Site_shared_keystone_horizon_swift1.png
+++ b/doc/arch-design-to-archive/source/figures/Multi-Site_shared_keystone_horizon_swift1.png
--- a/doc/arch-design-to-archive/source/figures/Multi-site_Geo_Redundant_LB.png
+++ b/doc/arch-design-to-archive/source/figures/Multi-site_Geo_Redundant_LB.png
--- a/doc/arch-design-to-archive/source/figures/Network_Cloud_Storage2.png
+++ b/doc/arch-design-to-archive/source/figures/Network_Cloud_Storage2.png
--- a/doc/arch-design-to-archive/source/figures/Network_Web_Services1.png
+++ b/doc/arch-design-to-archive/source/figures/Network_Web_Services1.png
--- a/doc/arch-design-to-archive/source/figures/Specialized_Hardware2.png
+++ b/doc/arch-design-to-archive/source/figures/Specialized_Hardware2.png
--- a/doc/arch-design-to-archive/source/figures/Specialized_OOO.png
+++ b/doc/arch-design-to-archive/source/figures/Specialized_OOO.png
--- a/doc/arch-design-to-archive/source/figures/Specialized_SDN_external.png
+++ b/doc/arch-design-to-archive/source/figures/Specialized_SDN_external.png
--- a/doc/arch-design-to-archive/source/figures/Specialized_SDN_hosted.png
+++ b/doc/arch-design-to-archive/source/figures/Specialized_SDN_hosted.png
--- a/doc/arch-design-to-archive/source/figures/Specialized_VDI1.png
+++ b/doc/arch-design-to-archive/source/figures/Specialized_VDI1.png
--- a/doc/arch-design-to-archive/source/figures/Storage_Database_+_Object5.png
+++ b/doc/arch-design-to-archive/source/figures/Storage_Database_+_Object5.png
--- a/doc/arch-design-to-archive/source/figures/Storage_Hadoop3.png
+++ b/doc/arch-design-to-archive/source/figures/Storage_Hadoop3.png
--- a/doc/arch-design-to-archive/source/figures/Storage_Object.png
+++ b/doc/arch-design-to-archive/source/figures/Storage_Object.png
--- a/doc/arch-design-to-archive/source/generalpurpose-architecture.rst
+++ b/doc/arch-design-to-archive/source/generalpurpose-architecture.rst
@ -1,483 +0,0 @@
 ============
 Architecture
 ============
 Hardware selection involves three key areas:
 * Compute
 * Network
 * Storage
 Hardware for a general purpose OpenStack cloud should reflect a cloud
 with no pre-defined usage model, designed to run a wide variety of
 applications with varying resource usage requirements. These
 applications include any of the following:
 * RAM-intensive
 * CPU-intensive
 * Storage-intensive
 Certain hardware form factors may better suit a general purpose
 OpenStack cloud due to the requirement for equal (or nearly equal)
 balance of resources. Server hardware must provide the following:
 * Equal (or nearly equal) balance of compute capacity (RAM and CPU)
 * Network capacity (number and speed of links)
 * Storage capacity (gigabytes or terabytes as well as :term:`Input/Output
  Operations Per Second (IOPS)`
 Evaluate server hardware around four conflicting dimensions:
 Server density
 A measure of how many servers can fit into a given measure of
 physical space, such as a rack unit [U].
 Resource capacity
 The number of CPU cores, amount of RAM, or amount of deliverable
 storage.
 Expandability
 Limit of additional resources you can add to a server.
 Cost
 The relative purchase price of the hardware weighted against the
 level of design effort needed to build the system.
 Increasing server density means sacrificing resource capacity or
 expandability, however, increasing resource capacity and expandability
 increases cost and decreases server density. As a result, determining
 the best server hardware for a general purpose OpenStack architecture
 means understanding how choice of form factor will impact the rest of
 the design. The following list outlines the form factors to choose from:
 * Blade servers typically support dual-socket multi-core CPUs. Blades
  also offer outstanding density.
 * 1U rack-mounted servers occupy only a single rack unit. Their
  benefits include high density, support for dual-socket multi-core
  CPUs, and support for reasonable RAM amounts. This form factor offers
  limited storage capacity, limited network capacity, and limited
  expandability.
 * 2U rack-mounted servers offer the expanded storage and networking
  capacity that 1U servers tend to lack, but with a corresponding
  decrease in server density (half the density offered by 1U
  rack-mounted servers).
 * Larger rack-mounted servers, such as 4U servers, will tend to offer
  even greater CPU capacity, often supporting four or even eight CPU
  sockets. These servers often have much greater expandability so will
  provide the best option for upgradability. This means, however, that
  the servers have a much lower server density and a much greater
  hardware cost.
 * *Sled servers* are rack-mounted servers that support multiple
  independent servers in a single 2U or 3U enclosure. This form factor
  offers increased density over typical 1U-2U rack-mounted servers but
  tends to suffer from limitations in the amount of storage or network
  capacity each individual server supports.
 The best form factor for server hardware supporting a general purpose
 OpenStack cloud is driven by outside business and cost factors. No
 single reference architecture applies to all implementations; the
 decision must flow from user requirements, technical considerations, and
 operational considerations. Here are some of the key factors that
 influence the selection of server hardware:
 Instance density
 Sizing is an important consideration for a general purpose OpenStack
 cloud. The expected or anticipated number of instances that each
 hypervisor can host is a common meter used in sizing the deployment.
 The selected server hardware needs to support the expected or
 anticipated instance density.
 Host density
 Physical data centers have limited physical space, power, and
 cooling. The number of hosts (or hypervisors) that can be fitted
 into a given metric (rack, rack unit, or floor tile) is another
 important method of sizing. Floor weight is an often overlooked
 consideration. The data center floor must be able to support the
 weight of the proposed number of hosts within a rack or set of
 racks. These factors need to be applied as part of the host density
 calculation and server hardware selection.
 Power density
 Data centers have a specified amount of power fed to a given rack or
 set of racks. Older data centers may have a power density as power
 as low as 20 AMPs per rack, while more recent data centers can be
 architected to support power densities as high as 120 AMP per rack.
 The selected server hardware must take power density into account.
 Network connectivity
 The selected server hardware must have the appropriate number of
 network connections, as well as the right type of network
 connections, in order to support the proposed architecture. Ensure
 that, at a minimum, there are at least two diverse network
 connections coming into each rack.
 The selection of form factors or architectures affects the selection of
 server hardware. Ensure that the selected server hardware is configured
 to support enough storage capacity (or storage expandability) to match
 the requirements of selected scale-out storage solution. Similarly, the
 network architecture impacts the server hardware selection and vice
 versa.
 Selecting storage hardware
 ~~~~~~~~~~~~~~~~~~~~~~~~~~
 Determine storage hardware architecture by selecting specific storage
 architecture. Determine the selection of storage architecture by
 evaluating possible solutions against the critical factors, the user
 requirements, technical considerations, and operational considerations.
 Incorporate the following facts into your storage architecture:
 Cost
 Storage can be a significant portion of the overall system cost. For
 an organization that is concerned with vendor support, a commercial
 storage solution is advisable, although it comes with a higher price
 tag. If initial capital expenditure requires minimization, designing
 a system based on commodity hardware would apply. The trade-off is
 potentially higher support costs and a greater risk of
 incompatibility and interoperability issues.
 Scalability
 Scalability, along with expandability, is a major consideration in a
 general purpose OpenStack cloud. It might be difficult to predict
 the final intended size of the implementation as there are no
 established usage patterns for a general purpose cloud. It might
 become necessary to expand the initial deployment in order to
 accommodate growth and user demand.
 Expandability
 Expandability is a major architecture factor for storage solutions
 with general purpose OpenStack cloud. A storage solution that
 expands to 50 PB is considered more expandable than a solution that
 only scales to 10 PB. This meter is related to scalability, which is
 the measure of a solution's performance as it expands.
 Using a scale-out storage solution with direct-attached storage (DAS) in
 the servers is well suited for a general purpose OpenStack cloud. Cloud
 services requirements determine your choice of scale-out solution. You
 need to determine if a single, highly expandable and highly vertical,
 scalable, centralized storage array is suitable for your design. After
 determining an approach, select the storage hardware based on this
 criteria.
 This list expands upon the potential impacts for including a particular
 storage architecture (and corresponding storage hardware) into the
 design for a general purpose OpenStack cloud:
 Connectivity
 Ensure that, if storage protocols other than Ethernet are part of
 the storage solution, the appropriate hardware has been selected. If
 a centralized storage array is selected, ensure that the hypervisor
 will be able to connect to that storage array for image storage.
 Usage
 How the particular storage architecture will be used is critical for
 determining the architecture. Some of the configurations that will
 influence the architecture include whether it will be used by the
 hypervisors for ephemeral instance storage or if OpenStack Object
 Storage will use it for object storage.
 Instance and image locations
 Where instances and images will be stored will influence the
 architecture.
 Server hardware
 If the solution is a scale-out storage architecture that includes
 DAS, it will affect the server hardware selection. This could ripple
 into the decisions that affect host density, instance density, power
 density, OS-hypervisor, management tools and others.
 General purpose OpenStack cloud has multiple options. The key factors
 that will have an influence on selection of storage hardware for a
 general purpose OpenStack cloud are as follows:
 Capacity
 Hardware resources selected for the resource nodes should be capable
 of supporting enough storage for the cloud services. Defining the
 initial requirements and ensuring the design can support adding
 capacity is important. Hardware nodes selected for object storage
 should be capable of support a large number of inexpensive disks
 with no reliance on RAID controller cards. Hardware nodes selected
 for block storage should be capable of supporting high speed storage
 solutions and RAID controller cards to provide performance and
 redundancy to storage at a hardware level. Selecting hardware RAID
 controllers that automatically repair damaged arrays will assist
 with the replacement and repair of degraded or deleted storage
 devices.
 Performance
 Disks selected for object storage services do not need to be fast
 performing disks. We recommend that object storage nodes take
 advantage of the best cost per terabyte available for storage.
 Contrastingly, disks chosen for block storage services should take
 advantage of performance boosting features that may entail the use
 of SSDs or flash storage to provide high performance block storage
 pools. Storage performance of ephemeral disks used for instances
 should also be taken into consideration.
 Fault tolerance
 Object storage resource nodes have no requirements for hardware
 fault tolerance or RAID controllers. It is not necessary to plan for
 fault tolerance within the object storage hardware because the
 object storage service provides replication between zones as a
 feature of the service. Block storage nodes, compute nodes, and
 cloud controllers should all have fault tolerance built in at the
 hardware level by making use of hardware RAID controllers and
 varying levels of RAID configuration. The level of RAID chosen
 should be consistent with the performance and availability
 requirements of the cloud.
 Selecting networking hardware
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Selecting network architecture determines which network hardware will be
 used. Networking software is determined by the selected networking
 hardware.
 There are more subtle design impacts that need to be considered. The
 selection of certain networking hardware (and the networking software)
 affects the management tools that can be used. There are exceptions to
 this; the rise of *open* networking software that supports a range of
 networking hardware means that there are instances where the
 relationship between networking hardware and networking software are not
 as tightly defined.
 Some of the key considerations that should be included in the selection
 of networking hardware include:
 Port count
 The design will require networking hardware that has the requisite
 port count.
 Port density
 The network design will be affected by the physical space that is
 required to provide the requisite port count. A higher port density
 is preferred, as it leaves more rack space for compute or storage
 components that may be required by the design. This can also lead
 into concerns about fault domains and power density that should be
 considered. Higher density switches are more expensive and should
 also be considered, as it is important not to over design the
 network if it is not required.
 Port speed
 The networking hardware must support the proposed network speed, for
 example: 1 GbE, 10 GbE, or 40 GbE (or even 100 GbE).
 Redundancy
 The level of network hardware redundancy required is influenced by
 the user requirements for high availability and cost considerations.
 Network redundancy can be achieved by adding redundant power
 supplies or paired switches. If this is a requirement, the hardware
 will need to support this configuration.
 Power requirements
 Ensure that the physical data center provides the necessary power
 for the selected network hardware.
 .. note::
   This may be an issue for spine switches in a leaf and spine
   fabric, or end of row (EoR) switches.
 There is no single best practice architecture for the networking
 hardware supporting a general purpose OpenStack cloud that will apply to
 all implementations. Some of the key factors that will have a strong
 influence on selection of networking hardware include:
 Connectivity
 All nodes within an OpenStack cloud require network connectivity. In
 some cases, nodes require access to more than one network segment.
 The design must encompass sufficient network capacity and bandwidth
 to ensure that all communications within the cloud, both north-south
 and east-west traffic have sufficient resources available.
 Scalability
 The network design should encompass a physical and logical network
 design that can be easily expanded upon. Network hardware should
 offer the appropriate types of interfaces and speeds that are
 required by the hardware nodes.
 Availability
 To ensure that access to nodes within the cloud is not interrupted,
 we recommend that the network architecture identify any single
 points of failure and provide some level of redundancy or fault
 tolerance. With regard to the network infrastructure itself, this
 often involves use of networking protocols such as LACP, VRRP or
 others to achieve a highly available network connection. In
 addition, it is important to consider the networking implications on
 API availability. In order to ensure that the APIs, and potentially
 other services in the cloud are highly available, we recommend you
 design a load balancing solution within the network architecture to
 accommodate for these requirements.
 Software selection
 ~~~~~~~~~~~~~~~~~~
 Software selection for a general purpose OpenStack architecture design
 needs to include these three areas:
 * Operating system (OS) and hypervisor
 * OpenStack components
 * Supplemental software
 Operating system and hypervisor
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 The operating system (OS) and hypervisor have a significant impact on
 the overall design. Selecting a particular operating system and
 hypervisor can directly affect server hardware selection. Make sure the
 storage hardware and topology support the selected operating system and
 hypervisor combination. Also ensure the networking hardware selection
 and topology will work with the chosen operating system and hypervisor
 combination.
 Some areas that could be impacted by the selection of OS and hypervisor
 include:
 Cost
 Selecting a commercially supported hypervisor, such as Microsoft
 Hyper-V, will result in a different cost model rather than
 community-supported open source hypervisors including
 :term:`KVM<kernel-based VM (KVM)>`, Kinstance or :term:`Xen`. When
 comparing open source OS solutions, choosing Ubuntu over Red Hat
 (or vice versa) will have an impact on cost due to support
 contracts.
 Supportability
 Depending on the selected hypervisor, staff should have the
 appropriate training and knowledge to support the selected OS and
 hypervisor combination. If they do not, training will need to be
 provided which could have a cost impact on the design.
 Management tools
 The management tools used for Ubuntu and Kinstance differ from the
 management tools for VMware vSphere. Although both OS and hypervisor
 combinations are supported by OpenStack, there will be very
 different impacts to the rest of the design as a result of the
 selection of one combination versus the other.
 Scale and performance
 Ensure that selected OS and hypervisor combinations meet the
 appropriate scale and performance requirements. The chosen
 architecture will need to meet the targeted instance-host ratios
 with the selected OS-hypervisor combinations.
 Security
 Ensure that the design can accommodate regular periodic
 installations of application security patches while maintaining
 required workloads. The frequency of security patches for the
 proposed OS-hypervisor combination will have an impact on
 performance and the patch installation process could affect
 maintenance windows.
 Supported features
 Determine which features of OpenStack are required. This will often
 determine the selection of the OS-hypervisor combination. Some
 features are only available with specific operating systems or
 hypervisors.
 Interoperability
 You will need to consider how the OS and hypervisor combination
 interactions with other operating systems and hypervisors, including
 other software solutions. Operational troubleshooting tools for one
 OS-hypervisor combination may differ from the tools used for another
 OS-hypervisor combination and, as a result, the design will need to
 address if the two sets of tools need to interoperate.
 OpenStack components
 ~~~~~~~~~~~~~~~~~~~~
 Selecting which OpenStack components are included in the overall design
 is important. Some OpenStack components, like compute and Image service,
 are required in every architecture. Other components, like
 Orchestration, are not always required.
 Excluding certain OpenStack components can limit or constrain the
 functionality of other components. For example, if the architecture
 includes Orchestration but excludes Telemetry, then the design will not
 be able to take advantage of Orchestrations' auto scaling functionality.
 It is important to research the component interdependencies in
 conjunction with the technical requirements before deciding on the final
 architecture.
 Networking software
 -------------------
 OpenStack Networking (neutron) provides a wide variety of networking
 services for instances. There are many additional networking software
 packages that can be useful when managing OpenStack components. Some
 examples include:
 * Software to provide load balancing
 * Network redundancy protocols
 * Routing daemons
 Some of these software packages are described in more detail in the
 OpenStack High Availability Guide (refer to the `OpenStack network
 nodes
 chapter <https://docs.openstack.org/ha-guide/networking-ha.html>`__ of
 the OpenStack High Availability Guide).
 For a general purpose OpenStack cloud, the OpenStack infrastructure
 components need to be highly available. If the design does not include
 hardware load balancing, networking software packages like HAProxy will
 need to be included.
 Management software
 -------------------
 Selected supplemental software solution impacts and affects the overall
 OpenStack cloud design. This includes software for providing clustering,
 logging, monitoring and alerting.
 Inclusion of clustering software, such as Corosync or Pacemaker, is
 determined primarily by the availability requirements. The impact of
 including (or not including) these software packages is primarily
 determined by the availability of the cloud infrastructure and the
 complexity of supporting the configuration after it is deployed. The
 `OpenStack High Availability
 Guide <https://docs.openstack.org/ha-guide/>`__ provides more details on
 the installation and configuration of Corosync and Pacemaker, should
 these packages need to be included in the design.
 Requirements for logging, monitoring, and alerting are determined by
 operational considerations. Each of these sub-categories includes a
 number of various options.
 If these software packages are required, the design must account for the
 additional resource consumption (CPU, RAM, storage, and network
 bandwidth). Some other potential design impacts include:
 * OS-hypervisor combination: Ensure that the selected logging,
  monitoring, or alerting tools support the proposed OS-hypervisor
  combination.
 * Network hardware: The network hardware selection needs to be
  supported by the logging, monitoring, and alerting software.
 Database software
 -----------------
 OpenStack components often require access to back-end database services
 to store state and configuration information. Selecting an appropriate
 back-end database that satisfies the availability and fault tolerance
 requirements of the OpenStack services is required. OpenStack services
 supports connecting to a database that is supported by the SQLAlchemy
 python drivers, however, most common database deployments make use of
 MySQL or variations of it. We recommend that the database, which
 provides back-end service within a general purpose cloud, be made highly
 available when using an available technology which can accomplish that
 goal.
--- a/doc/arch-design-to-archive/source/generalpurpose-operational-considerations.rst
+++ b/doc/arch-design-to-archive/source/generalpurpose-operational-considerations.rst
@ -1,124 +0,0 @@
 ==========================
 Operational considerations
 ==========================
 In the planning and design phases of the build out, it is important to
 include the operation's function. Operational factors affect the design
 choices for a general purpose cloud, and operations staff are often
 tasked with the maintenance of cloud environments for larger
 installations.
 Expectations set by the Service Level Agreements (SLAs) directly affect
 knowing when and where you should implement redundancy and high
 availability. SLAs are contractual obligations that provide assurances
 for service availability. They define the levels of availability that
 drive the technical design, often with penalties for not meeting
 contractual obligations.
 SLA terms that affect design include:
 * API availability guarantees implying multiple infrastructure services
  and highly available load balancers.
 * Network uptime guarantees affecting switch design, which might
  require redundant switching and power.
 * Factor in networking security policy requirements in to your
  deployments.
 Support and maintainability
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~
 To be able to support and maintain an installation, OpenStack cloud
 management requires operations staff to understand and comprehend design
 architecture content. The operations and engineering staff skill level,
 and level of separation, are dependent on size and purpose of the
 installation. Large cloud service providers, or telecom providers, are
 more likely to be managed by specially trained, dedicated operations
 organizations. Smaller implementations are more likely to rely on
 support staff that need to take on combined engineering, design and
 operations functions.
 Maintaining OpenStack installations requires a variety of technical
 skills. You may want to consider using a third-party management company
 with special expertise in managing OpenStack deployment.
 Monitoring
 ~~~~~~~~~~
 OpenStack clouds require appropriate monitoring platforms to ensure
 errors are caught and managed appropriately. Specific meters that are
 critically important to monitor include:
 * Image disk utilization
 * Response time to the :term:`Compute API <Compute API (Nova API)>`
 Leveraging existing monitoring systems is an effective check to ensure
 OpenStack environments can be monitored.
 Downtime
 ~~~~~~~~
 To effectively run cloud installations, initial downtime planning
 includes creating processes and architectures that support the
 following:
 * Planned (maintenance)
 * Unplanned (system faults)
 Resiliency of overall system and individual components are going to be
 dictated by the requirements of the SLA, meaning designing for
 :term:`high availability (HA)` can have cost ramifications.
 Capacity planning
 ~~~~~~~~~~~~~~~~~
 Capacity constraints for a general purpose cloud environment include:
 * Compute limits
 * Storage limits
 A relationship exists between the size of the compute environment and
 the supporting OpenStack infrastructure controller nodes requiring
 support.
 Increasing the size of the supporting compute environment increases the
 network traffic and messages, adding load to the controller or
 networking nodes. Effective monitoring of the environment will help with
 capacity decisions on scaling.
 Compute nodes automatically attach to OpenStack clouds, resulting in a
 horizontally scaling process when adding extra compute capacity to an
 OpenStack cloud. Additional processes are required to place nodes into
 appropriate availability zones and host aggregates. When adding
 additional compute nodes to environments, ensure identical or functional
 compatible CPUs are used, otherwise live migration features will break.
 It is necessary to add rack capacity or network switches as scaling out
 compute hosts directly affects network and datacenter resources.
 Assessing the average workloads and increasing the number of instances
 that can run within the compute environment by adjusting the overcommit
 ratio is another option. It is important to remember that changing the
 CPU overcommit ratio can have a detrimental effect and cause a potential
 increase in a noisy neighbor. The additional risk of increasing the
 overcommit ratio is more instances failing when a compute host fails.
 Compute host components can also be upgraded to account for increases in
 demand; this is known as vertical scaling. Upgrading CPUs with more
 cores, or increasing the overall server memory, can add extra needed
 capacity depending on whether the running applications are more CPU
 intensive or memory intensive.
 Insufficient disk capacity could also have a negative effect on overall
 performance including CPU and memory usage. Depending on the back-end
 architecture of the OpenStack Block Storage layer, capacity includes
 adding disk shelves to enterprise storage systems or installing
 additional block storage nodes. Upgrading directly attached storage
 installed in compute hosts, and adding capacity to the shared storage
 for additional ephemeral storage to instances, may be necessary.
 For a deeper discussion on many of these topics, refer to the `OpenStack
 Operations Guide <https://docs.openstack.org/ops>`_.
--- a/doc/arch-design-to-archive/source/generalpurpose-prescriptive-example.rst
+++ b/doc/arch-design-to-archive/source/generalpurpose-prescriptive-example.rst
@ -1,85 +0,0 @@
 ====================
 Prescriptive example
 ====================
 An online classified advertising company wants to run web applications
 consisting of Tomcat, Nginx and MariaDB in a private cloud. To be able
 to meet policy requirements, the cloud infrastructure will run in their
 own data center. The company has predictable load requirements, but
 requires scaling to cope with nightly increases in demand. Their current
 environment does not have the flexibility to align with their goal of
 running an open source API environment. The current environment consists
 of the following:
 * Between 120 and 140 installations of Nginx and Tomcat, each with 2
  vCPUs and 4 GB of RAM
 * A three-node MariaDB and Galera cluster, each with 4 vCPUs and 8 GB
  RAM
 The company runs hardware load balancers and multiple web applications
 serving their websites, and orchestrates environments using combinations
 of scripts and Puppet. The website generates large amounts of log data
 daily that requires archiving.
 The solution would consist of the following OpenStack components:
 * A firewall, switches and load balancers on the public facing network
  connections.
 * OpenStack Controller service running Image, Identity, Networking,
  combined with support services such as MariaDB and RabbitMQ,
  configured for high availability on at least three controller nodes.
 * OpenStack compute nodes running the KVM hypervisor.
 * OpenStack Block Storage for use by compute instances, requiring
  persistent storage (such as databases for dynamic sites).
 * OpenStack Object Storage for serving static objects (such as images).
 .. figure:: figures/General_Architecture3.png
 Running up to 140 web instances and the small number of MariaDB
 instances requires 292 vCPUs available, as well as 584 GB RAM. On a
 typical 1U server using dual-socket hex-core Intel CPUs with
 Hyperthreading, and assuming 2:1 CPU overcommit ratio, this would
 require 8 OpenStack compute nodes.
 The web application instances run from local storage on each of the
 OpenStack compute nodes. The web application instances are stateless,
 meaning that any of the instances can fail and the application will
 continue to function.
 MariaDB server instances store their data on shared enterprise storage,
 such as NetApp or Solidfire devices. If a MariaDB instance fails,
 storage would be expected to be re-attached to another instance and
 rejoined to the Galera cluster.
 Logs from the web application servers are shipped to OpenStack Object
 Storage for processing and archiving.
 Additional capabilities can be realized by moving static web content to
 be served from OpenStack Object Storage containers, and backing the
 OpenStack Image service with OpenStack Object Storage.
 .. note::
   Increasing OpenStack Object Storage means network bandwidth needs to
   be taken into consideration. Running OpenStack Object Storage with
   network connections offering 10 GbE or better connectivity is
   advised.
 Leveraging Orchestration and Telemetry services is also a potential
 issue when providing auto-scaling, orchestrated web application
 environments. Defining the web applications in a
 :term:`Heat Orchestration Template (HOT)`
 negates the reliance on the current scripted Puppet
 solution.
 OpenStack Networking can be used to control hardware load balancers
 through the use of plug-ins and the Networking API. This allows users to
 control hardware load balance pools and instances as members in these
 pools, but their use in production environments must be carefully
 weighed against current stability.
--- a/doc/arch-design-to-archive/source/generalpurpose-technical-considerations.rst
+++ b/doc/arch-design-to-archive/source/generalpurpose-technical-considerations.rst
@ -1,618 +0,0 @@
 ========================
 Technical considerations
 ========================
 General purpose clouds are expected to include these base services:
 * Compute
 * Network
 * Storage
 Each of these services have different resource requirements. As a
 result, you must make design decisions relating directly to the service,
 as well as provide a balanced infrastructure for all services.
 Take into consideration the unique aspects of each service, as
 individual characteristics and service mass can impact the hardware
 selection process. Hardware designs should be generated for each of the
 services.
 Hardware decisions are also made in relation to network architecture and
 facilities planning. These factors play heavily into the overall
 architecture of an OpenStack cloud.
 Compute resource design
 ~~~~~~~~~~~~~~~~~~~~~~~
 When designing compute resource pools, a number of factors can impact
 your design decisions. Factors such as number of processors, amount of
 memory, and the quantity of storage required for each hypervisor must be
 taken into account.
 You will also need to decide whether to provide compute resources in a
 single pool or in multiple pools. In most cases, multiple pools of
 resources can be allocated and addressed on demand. A compute design
 that allocates multiple pools of resources makes best use of application
 resources, and is commonly referred to as bin packing.
 In a bin packing design, each independent resource pool provides service
 for specific flavors. This helps to ensure that, as instances are
 scheduled onto compute hypervisors, each independent node's resources
 will be allocated in a way that makes the most efficient use of the
 available hardware. Bin packing also requires a common hardware design,
 with all hardware nodes within a compute resource pool sharing a common
 processor, memory, and storage layout. This makes it easier to deploy,
 support, and maintain nodes throughout their lifecycle.
 An overcommit ratio is the ratio of available virtual resources to
 available physical resources. This ratio is configurable for CPU and
 memory. The default CPU overcommit ratio is 16:1, and the default memory
 overcommit ratio is 1.5:1. Determining the tuning of the overcommit
 ratios during the design phase is important as it has a direct impact on
 the hardware layout of your compute nodes.
 When selecting a processor, compare features and performance
 characteristics. Some processors include features specific to
 virtualized compute hosts, such as hardware-assisted virtualization, and
 technology related to memory paging (also known as EPT shadowing). These
 types of features can have a significant impact on the performance of
 your virtual machine.
 You will also need to consider the compute requirements of
 non-hypervisor nodes (sometimes referred to as resource nodes). This
 includes controller, object storage, and block storage nodes, and
 networking services.
 The number of processor cores and threads impacts the number of worker
 threads which can be run on a resource node. Design decisions must
 relate directly to the service being run on it, as well as provide a
 balanced infrastructure for all services.
 Workload can be unpredictable in a general purpose cloud, so consider
 including the ability to add additional compute resource pools on
 demand. In some cases, however, the demand for certain instance types or
 flavors may not justify individual hardware design. In either case,
 start by allocating hardware designs that are capable of servicing the
 most common instance requests. If you want to add additional hardware to
 the overall architecture, this can be done later.
 Designing network resources
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~
 OpenStack clouds generally have multiple network segments, with each
 segment providing access to particular resources. The network services
 themselves also require network communication paths which should be
 separated from the other networks. When designing network services for a
 general purpose cloud, plan for either a physical or logical separation
 of network segments used by operators and projects. You can also create
 an additional network segment for access to internal services such as
 the message bus and database used by various services. Segregating these
 services onto separate networks helps to protect sensitive data and
 protects against unauthorized access to services.
 Choose a networking service based on the requirements of your instances.
 The architecture and design of your cloud will impact whether you choose
 OpenStack Networking (neutron), or legacy networking (nova-network).
 Legacy networking (nova-network)
 The legacy networking (nova-network) service is primarily a layer-2
 networking service that functions in two modes, which use VLANs in
 different ways. In a flat network mode, all network hardware nodes
 and devices throughout the cloud are connected to a single layer-2
 network segment that provides access to application data.
 When the network devices in the cloud support segmentation using
 VLANs, legacy networking can operate in the second mode. In this
 design model, each project within the cloud is assigned a network
 subnet which is mapped to a VLAN on the physical network. It is
 especially important to remember the maximum number of 4096 VLANs
 which can be used within a spanning tree domain. This places a hard
 limit on the amount of growth possible within the data center. When
 designing a general purpose cloud intended to support multiple
 projects, we recommend the use of legacy networking with VLANs, and
 not in flat network mode.
 Another consideration regarding network is the fact that legacy
 networking is entirely managed by the cloud operator; projects do not
 have control over network resources. If projects require the ability to
 manage and create network resources such as network segments and
 subnets, it will be necessary to install the OpenStack Networking
 service to provide network access to instances.
 Networking (neutron)
 OpenStack Networking (neutron) is a first class networking service
 that gives full control over creation of virtual network resources
 to projects. This is often accomplished in the form of tunneling
 protocols which will establish encapsulated communication paths over
 existing network infrastructure in order to segment project traffic.
 These methods vary depending on the specific implementation, but
 some of the more common methods include tunneling over GRE,
 encapsulating with VXLAN, and VLAN tags.
 We recommend you design at least three network segments:
 * The first segment is a public network, used for access to REST APIs
  by projects and operators. The controller nodes and swift proxies are
  the only devices connecting to this network segment. In some cases,
  this network might also be serviced by hardware load balancers and
  other network devices.
 * The second segment is used by administrators to manage hardware
  resources. Configuration management tools also use this for deploying
  software and services onto new hardware. In some cases, this network
  segment might also be used for internal services, including the
  message bus and database services. This network needs to communicate
  with every hardware node. Due to the highly sensitive nature of this
  network segment, you also need to secure this network from
  unauthorized access.
 * The third network segment is used by applications and consumers to
  access the physical network, and for users to access applications.
  This network is segregated from the one used to access the cloud APIs
  and is not capable of communicating directly with the hardware
  resources in the cloud. Compute resource nodes and network gateway
  services which allow application data to access the physical network
  from outside of the cloud need to communicate on this network
  segment.
 Designing Object Storage
 ~~~~~~~~~~~~~~~~~~~~~~~~
 When designing hardware resources for OpenStack Object Storage, the
 primary goal is to maximize the amount of storage in each resource node
 while also ensuring that the cost per terabyte is kept to a minimum.
 This often involves utilizing servers which can hold a large number of
 spinning disks. Whether choosing to use 2U server form factors with
 directly attached storage or an external chassis that holds a larger
 number of drives, the main goal is to maximize the storage available in
 each node.
 .. note::
   We do not recommended investing in enterprise class drives for an
   OpenStack Object Storage cluster. The consistency and partition
   tolerance characteristics of OpenStack Object Storage ensures that
   data stays up to date and survives hardware faults without the use
   of any specialized data replication devices.
 One of the benefits of OpenStack Object Storage is the ability to mix
 and match drives by making use of weighting within the swift ring. When
 designing your swift storage cluster, we recommend making use of the
 most cost effective storage solution available at the time.
 To achieve durability and availability of data stored as objects it is
 important to design object storage resource pools to ensure they can
 provide the suggested availability. Considering rack-level and
 zone-level designs to accommodate the number of replicas configured to
 be stored in the Object Storage service (the default number of replicas
 is three) is important when designing beyond the hardware node level.
 Each replica of data should exist in its own availability zone with its
 own power, cooling, and network resources available to service that
 specific zone.
 Object storage nodes should be designed so that the number of requests
 does not hinder the performance of the cluster. The object storage
 service is a chatty protocol, therefore making use of multiple
 processors that have higher core counts will ensure the IO requests do
 not inundate the server.
 Designing Block Storage
 ~~~~~~~~~~~~~~~~~~~~~~~
 When designing OpenStack Block Storage resource nodes, it is helpful to
 understand the workloads and requirements that will drive the use of
 block storage in the cloud. We recommend designing block storage pools
 so that projects can choose appropriate storage solutions for their
 applications. By creating multiple storage pools of different types, in
 conjunction with configuring an advanced storage scheduler for the block
 storage service, it is possible to provide projects with a large catalog
 of storage services with a variety of performance levels and redundancy
 options.
 Block storage also takes advantage of a number of enterprise storage
 solutions. These are addressed via a plug-in driver developed by the
 hardware vendor. A large number of enterprise storage plug-in drivers
 ship out-of-the-box with OpenStack Block Storage (and many more
 available via third party channels). General purpose clouds are more
 likely to use directly attached storage in the majority of block storage
 nodes, deeming it necessary to provide additional levels of service to
 projects which can only be provided by enterprise class storage
 solutions.
 Redundancy and availability requirements impact the decision to use a
 RAID controller card in block storage nodes. The input-output per second
 (IOPS) demand of your application will influence whether or not you
 should use a RAID controller, and which level of RAID is required.
 Making use of higher performing RAID volumes is suggested when
 considering performance. However, where redundancy of block storage
 volumes is more important we recommend making use of a redundant RAID
 configuration such as RAID 5 or RAID 6. Some specialized features, such
 as automated replication of block storage volumes, may require the use
 of third-party plug-ins and enterprise block storage solutions in order
 to provide the high demand on storage. Furthermore, where extreme
 performance is a requirement it may also be necessary to make use of
 high speed SSD disk drives' high performing flash storage solutions.
 Software selection
 ~~~~~~~~~~~~~~~~~~
 The software selection process plays a large role in the architecture of
 a general purpose cloud. The following have a large impact on the design
 of the cloud:
 * Choice of operating system
 * Selection of OpenStack software components
 * Choice of hypervisor
 * Selection of supplemental software
 Operating system (OS) selection plays a large role in the design and
 architecture of a cloud. There are a number of OSes which have native
 support for OpenStack including:
 * Ubuntu
 * Red Hat Enterprise Linux (RHEL)
 * CentOS
 * SUSE Linux Enterprise Server (SLES)
 .. note::
   Native support is not a constraint on the choice of OS; users are
   free to choose just about any Linux distribution (or even Microsoft
   Windows) and install OpenStack directly from source (or compile
   their own packages). However, many organizations will prefer to
   install OpenStack from distribution-supplied packages or
   repositories (although using the distribution vendor's OpenStack
   packages might be a requirement for support).
 OS selection also directly influences hypervisor selection. A cloud
 architect who selects Ubuntu, RHEL, or SLES has some flexibility in
 hypervisor; KVM, Xen, and LXC are supported virtualization methods
 available under OpenStack Compute (nova) on these Linux distributions.
 However, a cloud architect who selects Windows Server is limited to Hyper-V.
 Similarly, a cloud architect who selects XenServer is limited to the
 CentOS-based dom0 operating system provided with XenServer.
 The primary factors that play into OS-hypervisor selection include:
 User requirements
 The selection of OS-hypervisor combination first and foremost needs
 to support the user requirements.
 Support
 The selected OS-hypervisor combination needs to be supported by
 OpenStack.
 Interoperability
 The OS-hypervisor needs to be interoperable with other features and
 services in the OpenStack design in order to meet the user
 requirements.
 Hypervisor
 ~~~~~~~~~~
 OpenStack supports a wide variety of hypervisors, one or more of which
 can be used in a single cloud. These hypervisors include:
 * KVM (and QEMU)
 * XCP/XenServer
 * vSphere (vCenter and ESXi)
 * Hyper-V
 * LXC
 * Docker
 * Bare-metal
 A complete list of supported hypervisors and their capabilities can be
 found at `OpenStack Hypervisor Support
 Matrix <https://wiki.openstack.org/wiki/HypervisorSupportMatrix>`_.
 We recommend general purpose clouds use hypervisors that support the
 most general purpose use cases, such as KVM and Xen. More specific
 hypervisors should be chosen to account for specific functionality or a
 supported feature requirement. In some cases, there may also be a
 mandated requirement to run software on a certified hypervisor including
 solutions from VMware, Microsoft, and Citrix.
 The features offered through the OpenStack cloud platform determine the
 best choice of a hypervisor. Each hypervisor has their own hardware
 requirements which may affect the decisions around designing a general
 purpose cloud.
 In a mixed hypervisor environment, specific aggregates of compute
 resources, each with defined capabilities, enable workloads to utilize
 software and hardware specific to their particular requirements. This
 functionality can be exposed explicitly to the end user, or accessed
 through defined metadata within a particular flavor of an instance.
 OpenStack components
 ~~~~~~~~~~~~~~~~~~~~
 A general purpose OpenStack cloud design should incorporate the core
 OpenStack services to provide a wide range of services to end-users. The
 OpenStack core services recommended in a general purpose cloud are:
 * :term:`Compute service (nova)`
 * :term:`Networking service (neutron)`
 * :term:`Image service (glance)`
 * :term:`Identity service (keystone)`
 * :term:`Dashboard (horizon)`
 * :term:`Telemetry service (telemetry)`
 A general purpose cloud may also include :term:`Object Storage service
 (swift)`. :term:`Block Storage service (cinder)`.
 These may be selected to provide storage to applications and instances.
 Supplemental software
 ~~~~~~~~~~~~~~~~~~~~~
 A general purpose OpenStack deployment consists of more than just
 OpenStack-specific components. A typical deployment involves services
 that provide supporting functionality, including databases and message
 queues, and may also involve software to provide high availability of
 the OpenStack environment. Design decisions around the underlying
 message queue might affect the required number of controller services,
 as well as the technology to provide highly resilient database
 functionality, such as MariaDB with Galera. In such a scenario,
 replication of services relies on quorum.
 Where many general purpose deployments use hardware load balancers to
 provide highly available API access and SSL termination, software
 solutions, for example HAProxy, can also be considered. It is vital to
 ensure that such software implementations are also made highly
 available. High availability can be achieved by using software such as
 Keepalived or Pacemaker with Corosync. Pacemaker and Corosync can
 provide active-active or active-passive highly available configuration
 depending on the specific service in the OpenStack environment. Using
 this software can affect the design as it assumes at least a 2-node
 controller infrastructure where one of those nodes may be running
 certain services in standby mode.
 Memcached is a distributed memory object caching system, and Redis is a
 key-value store. Both are deployed on general purpose clouds to assist
 in alleviating load to the Identity service. The memcached service
 caches tokens, and due to its distributed nature it can help alleviate
 some bottlenecks to the underlying authentication system. Using
 memcached or Redis does not affect the overall design of your
 architecture as they tend to be deployed onto the infrastructure nodes
 providing the OpenStack services.
 Controller infrastructure
 ~~~~~~~~~~~~~~~~~~~~~~~~~
 The Controller infrastructure nodes provide management services to the
 end-user as well as providing services internally for the operating of
 the cloud. The Controllers run message queuing services that carry
 system messages between each service. Performance issues related to the
 message bus would lead to delays in sending that message to where it
 needs to go. The result of this condition would be delays in operation
 functions such as spinning up and deleting instances, provisioning new
 storage volumes and managing network resources. Such delays could
 adversely affect an application’s ability to react to certain
 conditions, especially when using auto-scaling features. It is important
 to properly design the hardware used to run the controller
 infrastructure as outlined above in the Hardware Selection section.
 Performance of the controller services is not limited to processing
 power, but restrictions may emerge in serving concurrent users. Ensure
 that the APIs and Horizon services are load tested to ensure that you
 are able to serve your customers. Particular attention should be made to
 the OpenStack Identity Service (Keystone), which provides the
 authentication and authorization for all services, both internally to
 OpenStack itself and to end-users. This service can lead to a
 degradation of overall performance if this is not sized appropriately.
 Network performance
 ~~~~~~~~~~~~~~~~~~~
 In a general purpose OpenStack cloud, the requirements of the network
 help determine performance capabilities. It is possible to design
 OpenStack environments that run a mix of networking capabilities. By
 utilizing the different interface speeds, the users of the OpenStack
 environment can choose networks that are fit for their purpose.
 Network performance can be boosted considerably by implementing hardware
 load balancers to provide front-end service to the cloud APIs. The
 hardware load balancers also perform SSL termination if that is a
 requirement of your environment. When implementing SSL offloading, it is
 important to understand the SSL offloading capabilities of the devices
 selected.
 Compute host
 ~~~~~~~~~~~~
 The choice of hardware specifications used in compute nodes including
 CPU, memory and disk type directly affects the performance of the
 instances. Other factors which can directly affect performance include
 tunable parameters within the OpenStack services, for example the
 overcommit ratio applied to resources. The defaults in OpenStack Compute
 set a 16:1 over-commit of the CPU and 1.5 over-commit of the memory.
 Running at such high ratios leads to an increase in "noisy-neighbor"
 activity. Care must be taken when sizing your Compute environment to
 avoid this scenario. For running general purpose OpenStack environments
 it is possible to keep to the defaults, but make sure to monitor your
 environment as usage increases.
 Storage performance
 ~~~~~~~~~~~~~~~~~~~
 When considering performance of Block Storage, hardware and
 architecture choice is important. Block Storage can use enterprise
 back-end systems such as NetApp or EMC, scale out storage such as
 GlusterFS and Ceph, or simply use the capabilities of directly attached
 storage in the nodes themselves. Block Storage may be deployed so that
 traffic traverses the host network, which could affect, and be adversely
 affected by, the front-side API traffic performance. As such, consider
 using a dedicated data storage network with dedicated interfaces on the
 Controller and Compute hosts.
 When considering performance of Object Storage, a number of design
 choices will affect performance. A user’s access to the Object
 Storage is through the proxy services, which sit behind hardware load
 balancers. By the very nature of a highly resilient storage system,
 replication of the data would affect performance of the overall system.
 In this case, 10 GbE (or better) networking is recommended throughout
 the storage network architecture.
 High Availability
 ~~~~~~~~~~~~~~~~~
 In OpenStack, the infrastructure is integral to providing services and
 should always be available, especially when operating with SLAs.
 Ensuring network availability is accomplished by designing the network
 architecture so that no single point of failure exists. A consideration
 of the number of switches, routes and redundancies of power should be
 factored into core infrastructure, as well as the associated bonding of
 networks to provide diverse routes to your highly available switch
 infrastructure.
 The OpenStack services themselves should be deployed across multiple
 servers that do not represent a single point of failure. Ensuring API
 availability can be achieved by placing these services behind highly
 available load balancers that have multiple OpenStack servers as
 members.
 OpenStack lends itself to deployment in a highly available manner where
 it is expected that at least 2 servers be utilized. These can run all
 the services involved from the message queuing service, for example
 RabbitMQ or QPID, and an appropriately deployed database service such as
 MySQL or MariaDB. As services in the cloud are scaled out, back-end
 services will need to scale too. Monitoring and reporting on server
 utilization and response times, as well as load testing your systems,
 will help determine scale out decisions.
 Care must be taken when deciding network functionality. Currently,
 OpenStack supports both the legacy networking (nova-network) system and
 the newer, extensible OpenStack Networking (neutron). Both have their
 pros and cons when it comes to providing highly available access. Legacy
 networking, which provides networking access maintained in the OpenStack
 Compute code, provides a feature that removes a single point of failure
 when it comes to routing, and this feature is currently missing in
 OpenStack Networking. The effect of legacy networking’s multi-host
 functionality restricts failure domains to the host running that
 instance.
 When using Networking, the OpenStack controller servers or
 separate Networking hosts handle routing. For a deployment that requires
 features available in only Networking, it is possible to remove this
 restriction by using third party software that helps maintain highly
 available L3 routes. Doing so allows for common APIs to control network
 hardware, or to provide complex multi-tier web applications in a secure
 manner. It is also possible to completely remove routing from
 Networking, and instead rely on hardware routing capabilities. In this
 case, the switching infrastructure must support L3 routing.
 OpenStack Networking and legacy networking both have their advantages
 and disadvantages. They are both valid and supported options that fit
 different network deployment models described in the
 `Networking deployment options table <https://docs.openstack.org/ops-guide/arch-network-design.html#network-topology>`
 of OpenStack Operations Guide.
 Ensure your deployment has adequate back-up capabilities.
 Application design must also be factored into the capabilities of the
 underlying cloud infrastructure. If the compute hosts do not provide a
 seamless live migration capability, then it must be expected that when a
 compute host fails, that instance and any data local to that instance
 will be deleted. However, when providing an expectation to users that
 instances have a high-level of uptime guarantees, the infrastructure
 must be deployed in a way that eliminates any single point of failure
 when a compute host disappears. This may include utilizing shared file
 systems on enterprise storage or OpenStack Block storage to provide a
 level of guarantee to match service features.
 For more information on high availability in OpenStack, see the
 `OpenStack High Availability
 Guide <https://docs.openstack.org/ha-guide/>`_.
 Security
 ~~~~~~~~
 A security domain comprises users, applications, servers or networks
 that share common trust requirements and expectations within a system.
 Typically they have the same authentication and authorization
 requirements and users.
 These security domains are:
 * Public
 * Guest
 * Management
 * Data
 These security domains can be mapped to an OpenStack deployment
 individually, or combined. In each case, the cloud operator should be
 aware of the appropriate security concerns. Security domains should be
 mapped out against your specific OpenStack deployment topology. The
 domains and their trust requirements depend upon whether the cloud
 instance is public, private, or hybrid.
 * The public security domain is an entirely untrusted area of the cloud
  infrastructure. It can refer to the internet as a whole or simply to
  networks over which you have no authority. This domain should always
  be considered untrusted.
 * The guest security domain handles compute data generated by instances
  on the cloud but not services that support the operation of the
  cloud, such as API calls. Public cloud providers and private cloud
  providers who do not have stringent controls on instance use or who
  allow unrestricted internet access to instances should consider this
  domain to be untrusted. Private cloud providers may want to consider
  this network as internal and therefore trusted only if they have
  controls in place to assert that they trust instances and all their
  projects.
 * The management security domain is where services interact. Sometimes
  referred to as the control plane, the networks in this domain
  transport confidential data such as configuration parameters, user
  names, and passwords. In most deployments this domain is considered
  trusted.
 * The data security domain is concerned primarily with information
  pertaining to the storage services within OpenStack. Much of the data
  that crosses this network has high integrity and confidentiality
  requirements and, depending on the type of deployment, may also have
  strong availability requirements. The trust level of this network is
  heavily dependent on other deployment decisions.
 When deploying OpenStack in an enterprise as a private cloud it is
 usually behind the firewall and within the trusted network alongside
 existing systems. Users of the cloud are employees that are bound by the
 security requirements set forth by the company. This tends to push most
 of the security domains towards a more trusted model. However, when
 deploying OpenStack in a public facing role, no assumptions can be made
 and the attack vectors significantly increase.
 Consideration must be taken when managing the users of the system for
 both public and private clouds. The identity service allows for LDAP to
 be part of the authentication process. Including such systems in an
 OpenStack deployment may ease user management if integrating into
 existing systems.
 It is important to understand that user authentication requests include
 sensitive information including user names, passwords, and
 authentication tokens. For this reason, placing the API services behind
 hardware that performs SSL termination is strongly recommended.
 For more information OpenStack Security, see the `OpenStack Security
 Guide <https://docs.openstack.org/security-guide/>`_.
--- a/doc/arch-design-to-archive/source/generalpurpose-user-requirements.rst
+++ b/doc/arch-design-to-archive/source/generalpurpose-user-requirements.rst
@ -1,99 +0,0 @@
 =================
 User requirements
 =================
 When building a general purpose cloud, you should follow the
 :term:`Infrastructure-as-a-Service (IaaS)` model; a platform best suited
 for use cases with simple requirements. General purpose cloud user
 requirements are not complex. However, it is important to capture them
 even if the project has minimum business and technical requirements, such
 as a proof of concept (PoC), or a small lab platform.
 .. note::
   The following user considerations are written from the perspective
   of the cloud builder, not from the perspective of the end user.
 Business requirements
 ~~~~~~~~~~~~~~~~~~~~~
 Cost
 Financial factors are a primary concern for any organization. Cost
 is an important criterion as general purpose clouds are considered
 the baseline from which all other cloud architecture environments
 derive. General purpose clouds do not always provide the most
 cost-effective environment for specialized applications or
 situations. Unless razor-thin margins and costs have been mandated
 as a critical factor, cost should not be the sole consideration when
 choosing or designing a general purpose architecture.
 Time to market
 The ability to deliver services or products within a flexible time
 frame is a common business factor when building a general purpose
 cloud. Delivering a product in six months instead of two years is a
 driving force behind the decision to build general purpose clouds.
 General purpose clouds allow users to self-provision and gain access
 to compute, network, and storage resources on-demand thus decreasing
 time to market.
 Revenue opportunity
 Revenue opportunities for a cloud will vary greatly based on the
 intended use case of that particular cloud. Some general purpose
 clouds are built for commercial customer facing products, but there
 are alternatives that might make the general purpose cloud the right
 choice.
 Technical requirements
 ~~~~~~~~~~~~~~~~~~~~~~
 Technical cloud architecture requirements should be weighted against the
 business requirements.
 Performance
 As a baseline product, general purpose clouds do not provide
 optimized performance for any particular function. While a general
 purpose cloud should provide enough performance to satisfy average
 user considerations, performance is not a general purpose cloud
 customer driver.
 No predefined usage model
 The lack of a pre-defined usage model enables the user to run a wide
 variety of applications without having to know the application
 requirements in advance. This provides a degree of independence and
 flexibility that no other cloud scenarios are able to provide.
 On-demand and self-service application
 By definition, a cloud provides end users with the ability to
 self-provision computing power, storage, networks, and software in a
 simple and flexible way. The user must be able to scale their
 resources up to a substantial level without disrupting the
 underlying host operations. One of the benefits of using a general
 purpose cloud architecture is the ability to start with limited
 resources and increase them over time as the user demand grows.
 Public cloud
 For a company interested in building a commercial public cloud
 offering based on OpenStack, the general purpose architecture model
 might be the best choice. Designers are not always going to know the
 purposes or workloads for which the end users will use the cloud.
 Internal consumption (private) cloud
 Organizations need to determine if it is logical to create their own
 clouds internally. Using a private cloud, organizations are able to
 maintain complete control over architectural and cloud components.
 .. note::
    Users will want to combine using the internal cloud with access
    to an external cloud. If that case is likely, it might be worth
    exploring the possibility of taking a multi-cloud approach with
    regard to at least some of the architectural elements.
 Designs that incorporate the use of multiple clouds, such as a
 private cloud and a public cloud offering, are described in the
 "Multi-Cloud" scenario, see :doc:`multi-site`.
 Security
 Security should be implemented according to asset, threat, and
 vulnerability risk assessment matrices. For cloud domains that
 require increased computer security, network security, or
 information security, a general purpose cloud is not considered an
 appropriate choice.
--- a/doc/arch-design-to-archive/source/generalpurpose.rst
+++ b/doc/arch-design-to-archive/source/generalpurpose.rst
@ -1,57 +0,0 @@
 ===============
 General purpose
 ===============
 .. toctree::
   :maxdepth: 2
   generalpurpose-user-requirements.rst
   generalpurpose-technical-considerations.rst
   generalpurpose-operational-considerations.rst
   generalpurpose-architecture.rst
   generalpurpose-prescriptive-example.rst
 An OpenStack general purpose cloud is often considered a starting
 point for building a cloud deployment. They are designed to balance
 the components and do not emphasize any particular aspect of the
 overall computing environment. Cloud design must give equal weight
 to the compute, network, and storage components. General purpose clouds
 are found in private, public, and hybrid environments, lending
 themselves to many different use cases.
 .. note::
   General purpose clouds are homogeneous deployments.
   They are not suited to specialized environments or edge case situations.
 Common uses of a general purpose cloud include:
 * Providing a simple database
 * A web application runtime environment
 * A shared application development platform
 * Lab test bed
 Use cases that benefit from scale-out rather than scale-up approaches
 are good candidates for general purpose cloud architecture.
 A general purpose cloud is designed to have a range of potential
 uses or functions; not specialized for specific use cases. General
 purpose architecture is designed to address 80% of potential use
 cases available. The infrastructure, in itself, is a specific use
 case, enabling it to be used as a base model for the design process.
 General purpose clouds are designed to be platforms that are suited
 for general purpose applications.
 General purpose clouds are limited to the most basic components,
 but they can include additional resources such as:
 * Virtual-machine disk image library
 * Raw block storage
 * File or object storage
 * Firewalls
 * Load balancers
 * IP addresses
 * Network overlays or virtual local area networks (VLANs)
 * Software bundles
--- a/doc/arch-design-to-archive/source/hybrid-architecture.rst
+++ b/doc/arch-design-to-archive/source/hybrid-architecture.rst
@ -1,149 +0,0 @@
 ============
 Architecture
 ============
 Map out the dependencies of the expected workloads and the cloud
 infrastructures required to support them to architect a solution
 for the broadest compatibility between cloud platforms, minimizing
 the need to create workarounds and processes to fill identified gaps.
 For your chosen cloud management platform, note the relative
 levels of support for both monitoring and orchestration.
 .. figure:: figures/Multi-Cloud_Priv-AWS4.png
   :width: 100%
 Image portability
 ~~~~~~~~~~~~~~~~~
 The majority of cloud workloads currently run on instances using
 hypervisor technologies. The challenge is that each of these hypervisors
 uses an image format that may not be compatible with the others.
 When possible, standardize on a single hypervisor and instance image format.
 This may not be possible when using externally-managed public clouds.
 Conversion tools exist to address image format compatibility.
 Examples include `virt-p2v/virt-v2v <http://libguestfs.org/virt-v2v>`_
 and `virt-edit <http://libguestfs.org/virt-edit.1.html>`_.
 These tools cannot serve beyond basic cloud instance specifications.
 Alternatively, build a thin operating system image as the base for
 new instances.
 This facilitates rapid creation of cloud instances using cloud orchestration
 or configuration management tools for more specific templating.
 Remember if you intend to use portable images for disaster recovery,
 application diversity, or high availability, your users could move
 the images and instances between cloud platforms regularly.
 Upper-layer services
 ~~~~~~~~~~~~~~~~~~~~
 Many clouds offer complementary services beyond the
 basic compute, network, and storage components.
 These additional services often simplify the deployment
 and management of applications on a cloud platform.
 When moving workloads from the source to the destination
 cloud platforms, consider that the destination cloud platform
 may not have comparable services. Implement workloads
 in a different way or by using a different technology.
 For example, moving an application that uses a NoSQL database
 service such as MongoDB could cause difficulties in maintaining
 the application between the platforms.
 There are a number of options that are appropriate for
 the hybrid cloud use case:
 * Implementing a baseline of upper-layer services across all
  of the cloud platforms. For platforms that do not support
  a given service, create a service on top of that platform
  and apply it to the workloads as they are launched on that cloud.
 * For example, through the :term:`Database service <Database service
  (trove)>` for OpenStack (:term:`trove`), OpenStack supports MySQL
  as a service but not NoSQL databases in production.
  To move from or run alongside AWS, a NoSQL workload must use
  an automation tool, such as the Orchestration service (heat),
  to recreate the NoSQL database on top of OpenStack.
 * Deploying a :term:`Platform-as-a-Service (PaaS)` technology that
  abstracts the upper-layer services from the underlying cloud platform.
  The unit of application deployment and migration is the PaaS.
  It leverages the services of the PaaS and only consumes the base
  infrastructure services of the cloud platform.
 * Using automation tools to create the required upper-layer services
  that are portable across all cloud platforms.
  For example, instead of using database services that are inherent
  in the cloud platforms, launch cloud instances and deploy the
  databases on those instances using scripts or configuration and
  application deployment tools.
 Network services
 ~~~~~~~~~~~~~~~~
 Network services functionality is a critical component of
 multiple cloud architectures. It is an important factor
 to assess when choosing a CMP and cloud provider.
 Considerations include:
 * Functionality
 * Security
 * Scalability
 * High availability (HA)
 Verify and test critical cloud endpoint features.
 * After selecting the network functionality framework,
  you must confirm the functionality is compatible.
  This ensures testing and functionality persists
  during and after upgrades.
  .. note::
     Diverse cloud platforms may de-synchronize over time
     if you do not maintain their mutual compatibility.
     This is a particular issue with APIs.
 * Scalability across multiple cloud providers determines
  your choice of underlying network framework.
  It is important to have the network API functions presented
  and to verify that the desired functionality persists across
  all chosen cloud endpoint.
 * High availability implementations vary in functionality and design.
  Examples of some common methods are active-hot-standby,
  active-passive, and active-active.
  Develop your high availability implementation and a test framework to
  understand the functionality and limitations of the environment.
 * It is imperative to address security considerations.
  For example, addressing how data is secured between client and
  endpoint and any traffic that traverses the multiple clouds.
  Business and regulatory requirements dictate what security
  approach to take. For more information, see the
  :ref:`Security requirements <security>` chapter.
 Data
 ~~~~
 Traditionally, replication has been the best method of protecting
 object store implementations. A variety of replication methods exist
 in storage architectures, for example synchronous and asynchronous
 mirroring. Most object stores and back-end storage systems implement
 methods for replication at the storage subsystem layer.
 Object stores also tailor replication techniques
 to fit a cloud's requirements.
 Organizations must find the right balance between
 data integrity and data availability. Replication strategy may
 also influence disaster recovery methods.
 Replication across different racks, data centers, and geographical
 regions increases focus on determining and ensuring data locality.
 The ability to guarantee data is accessed from the nearest or
 fastest storage can be necessary for applications to perform well.
 .. note::
   When running embedded object store methods, ensure that you do not
   instigate extra data replication as this can cause performance issues.
--- a/doc/arch-design-to-archive/source/hybrid-operational-considerations.rst
+++ b/doc/arch-design-to-archive/source/hybrid-operational-considerations.rst
@ -1,80 +0,0 @@
 ==========================
 Operational considerations
 ==========================
 Hybrid cloud deployments present complex operational challenges.
 Differences between provider clouds can cause incompatibilities
 with workloads or Cloud Management Platforms (CMP).
 Cloud providers may also offer different levels of integration
 with competing cloud offerings.
 Monitoring is critical to maintaining a hybrid cloud, and it is
 important to determine if a CMP supports monitoring of all the
 clouds involved, or if compatible APIs are available to be queried
 for necessary information.
 Agility
 ~~~~~~~
 Hybrid clouds provide application availability across different
 cloud environments and technologies.
 This availability enables the deployment to survive disaster
 in any single cloud environment.
 Each cloud should provide the means to create instances quickly in
 response to capacity issues or failure elsewhere in the hybrid cloud.
 Application readiness
 ~~~~~~~~~~~~~~~~~~~~~
 Enterprise workloads that depend on the underlying infrastructure
 for availability are not designed to run on OpenStack.
 If the application cannot tolerate infrastructure failures,
 it is likely to require significant operator intervention to recover.
 Applications for hybrid clouds must be fault tolerant, with an SLA
 that is not tied to the underlying infrastructure.
 Ideally, cloud applications should be able to recover when entire
 racks and data centers experience an outage.
 Upgrades
 ~~~~~~~~
 If a deployment includes a public cloud, predicting upgrades may
 not be possible. Carefully examine provider SLAs.
 .. note::
   At massive scale, even when dealing with a cloud that offers
   an SLA with a high percentage of uptime, workloads must be able
   to recover quickly.
 When upgrading private cloud deployments, minimize disruption by
 making incremental changes and providing a facility to either rollback
 or continue to roll forward when using a continuous delivery model.
 You may need to coordinate CMP upgrades with hybrid cloud upgrades
 if there are API changes.
 Network Operation Center
 ~~~~~~~~~~~~~~~~~~~~~~~~
 Consider infrastructure control when planning the Network Operation
 Center (NOC) for a hybrid cloud environment.
 If a significant portion of the cloud is on externally managed systems,
 prepare for situations where it may not be possible to make changes.
 Additionally, providers may differ on how infrastructure must be
 managed and exposed.  This can lead to delays in root cause analysis
 where each insists the blame lies with the other provider.
 Ensure that the network structure connects all clouds to form
 integrated system, keeping in mind the state of handoffs.
 These handoffs must both be as reliable as possible and
 include as little latency as possible to ensure the best
 performance of the overall system.
 Maintainability
 ~~~~~~~~~~~~~~~
 Hybrid clouds rely on third party systems and processes.
 As a result, it is not possible to guarantee proper maintenance
 of the overall system. Instead, be prepared to abandon workloads
 and recreate them in an improved state.
--- a/doc/arch-design-to-archive/source/hybrid-prescriptive-examples.rst
+++ b/doc/arch-design-to-archive/source/hybrid-prescriptive-examples.rst
@ -1,155 +0,0 @@
 =====================
 Prescriptive examples
 =====================
 Hybrid cloud environments are designed for these use cases:
 * Bursting workloads from private to public OpenStack clouds
 * Bursting workloads from private to public non-OpenStack clouds
 * High availability across clouds (for technical diversity)
 This chapter provides examples of environments that address
 each of these use cases.
 Bursting to a public OpenStack cloud
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Company A's data center is running low on capacity.
 It is not possible to expand the data center in the foreseeable future.
 In order to accommodate the continuously growing need for
 development resources in the organization,
 Company A decides to use resources in the public cloud.
 Company A has an established data center with a substantial amount
 of hardware. Migrating the workloads to a public cloud is not feasible.
 The company has an internal cloud management platform that directs
 requests to the appropriate cloud, depending on the local capacity.
 This is a custom in-house application written for this specific purpose.
 This solution is depicted in the figure below:
 .. figure:: figures/Multi-Cloud_Priv-Pub3.png
   :width: 100%
 This example shows two clouds with a Cloud Management
 Platform (CMP) connecting them. This guide does not
 discuss a specific CMP, but describes how the Orchestration and
 Telemetry services handle, manage, and control workloads.
 The private OpenStack cloud has at least one controller and at least
 one compute node. It includes metering using the Telemetry service.
 The Telemetry service captures the load increase and the CMP
 processes the information.  If there is available capacity,
 the CMP uses the OpenStack API to call the Orchestration service.
 This creates instances on the private cloud in response to user requests.
 When capacity is not available on the private cloud, the CMP issues
 a request to the Orchestration service API of the public cloud.
 This creates the instance on the public cloud.
 In this example, Company A does not direct the deployments to an
 external public cloud due to concerns regarding resource control,
 security, and increased operational expense.
 Bursting to a public non-OpenStack cloud
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 The second example examines bursting workloads from the private cloud
 into a non-OpenStack public cloud using Amazon Web Services (AWS)
 to take advantage of additional capacity and to scale applications.
 The following diagram demonstrates an OpenStack-to-AWS hybrid cloud:
 .. figure:: figures/Multi-Cloud_Priv-AWS4.png
   :width: 100%
 Company B states that its developers are already using AWS
 and do not want to change to a different provider.
 If the CMP is capable of connecting to an external cloud
 provider with an appropriate API, the workflow process remains
 the same as the previous scenario.
 The actions the CMP takes, such as monitoring loads and
 creating new instances, stay the same.
 However, the CMP performs actions in the public cloud
 using applicable API calls.
 If the public cloud is AWS, the CMP would use the
 EC2 API to create a new instance and assign an Elastic IP.
 It can then add that IP to HAProxy in the private cloud.
 The CMP can also reference AWS-specific
 tools such as CloudWatch and CloudFormation.
 Several open source tool kits for building CMPs are
 available and can handle this kind of translation.
 Examples include ManageIQ, jClouds, and JumpGate.
 High availability and disaster recovery
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Company C requires their local data center to be able to
 recover from failure.  Some of the workloads currently in
 use are running on their private OpenStack cloud.
 Protecting the data involves Block Storage, Object Storage,
 and a database. The architecture supports the failure of
 large components of the system while ensuring that the
 system continues to deliver services.
 While the services remain available to users, the failed
 components are restored in the background based on standard
 best practice data replication policies.
 To achieve these objectives, Company C replicates data to
 a second cloud in a geographically distant location.
 The following diagram describes this system:
 .. figure:: figures/Multi-Cloud_failover2.png
   :width: 100%
 This example includes two private OpenStack clouds connected with a CMP.
 The source cloud, OpenStack Cloud 1, includes a controller and
 at least one instance running MySQL. It also includes at least
 one Block Storage volume and one Object Storage volume.
 This means that data is available to the users at all times.
 The details of the method for protecting each of these sources
 of data differs.
 Object Storage relies on the replication capabilities of
 the Object Storage provider.
 Company C enables OpenStack Object Storage so that it creates
 geographically separated replicas that take advantage of this feature.
 The company configures storage so that at least one replica
 exists in each cloud. In order to make this work, the company
 configures a single array spanning both clouds with OpenStack Identity.
 Using Federated Identity, the array talks to both clouds, communicating
 with OpenStack Object Storage through the Swift proxy.
 For Block Storage, the replication is a little more difficult,
 and involves tools outside of OpenStack itself.
 The OpenStack Block Storage volume is not set as the drive itself
 but as a logical object that points to a physical back end.
 Disaster recovery is configured for Block Storage for
 synchronous backup for the highest level of data protection,
 but asynchronous backup could have been set as an alternative
 that is not as latency sensitive.
 For asynchronous backup, the Block Storage API makes it possible
 to export the data and also the metadata of a particular volume,
 so that it can be moved and replicated elsewhere.
 More information can be found here:
 `Add volume metadata support to Cinder backup
 <https://blueprints.launchpad.net/cinder/+spec/cinder-backup-volume-metadata-support>`_.
 The synchronous backups create an identical volume in both
 clouds and chooses the appropriate flavor so that each cloud
 has an identical back end. This is done by creating volumes
 through the CMP. After this is configured, a solution
 involving DRDB synchronizes the physical drives.
 The database component is backed up using synchronous backups.
 MySQL does not support geographically diverse replication,
 so disaster recovery is provided by replicating the file itself.
 As it is not possible to use Object Storage as the back end of
 a database like MySQL, Swift replication is not an option.
 Company C decides not to store the data on another geo-tiered
 storage system, such as Ceph, as Block Storage.
 This would have given another layer of protection.
 Another option would have been to store the database on an OpenStack
 Block Storage volume and backing it up like any other Block Storage.
--- a/doc/arch-design-to-archive/source/hybrid-technical-considerations.rst
+++ b/doc/arch-design-to-archive/source/hybrid-technical-considerations.rst
@ -1,155 +0,0 @@
 ========================
 Technical considerations
 ========================
 A hybrid cloud environment requires inspection and
 understanding of technical issues in external data centers that may
 not be in your control. Ideally, select an architecture
 and CMP that are adaptable to changing environments.
 Using diverse cloud platforms increases the risk of compatibility
 issues, but clouds using the same version and distribution
 of OpenStack are less likely to experience problems.
 Clouds that exclusively use the same versions of OpenStack should
 have no issues, regardless of distribution. More recent distributions
 are less likely to encounter incompatibility between versions.
 An OpenStack community initiative defines core functions that need to
 remain backward compatible between supported versions. For example, the
 DefCore initiative defines basic functions that every distribution must
 support in order to use the name OpenStack.
 Vendors can add proprietary customization to their distributions.
 If an application or architecture makes use of these features, it can be
 difficult to migrate to or use other types of environments.
 If an environment includes non-OpenStack clouds, it may experience
 compatibility problems. CMP tools must account for the differences in
 the handling of operations and the implementation of services.
 **Possible cloud incompatibilities**
 * Instance deployment
 * Network management
 * Application management
 * Services implementation
 Capacity planning
 ~~~~~~~~~~~~~~~~~
 One of the primary reasons many organizations use a hybrid cloud
 is to increase capacity without making large capital investments.
 Capacity and the placement of workloads are key design considerations
 for hybrid clouds. The long-term capacity plan for these designs must
 incorporate growth over time to prevent permanent consumption of more
 expensive external clouds.
 To avoid this scenario, account for future applications' capacity
 requirements and plan growth appropriately.
 It is difficult to predict the amount of load a particular
 application might incur if the number of users fluctuates, or the
 application experiences an unexpected increase in use.
 It is possible to define application requirements in terms of
 vCPU, RAM, bandwidth, or other resources and plan appropriately.
 However, other clouds might not use the same meter or even the same
 oversubscription rates.
 Oversubscription is a method to emulate more capacity than
 may physically be present.
 For example, a physical hypervisor node with 32 GB RAM may host
 24 instances, each provisioned with 2 GB RAM.
 As long as all 24 instances do not concurrently use 2 full
 gigabytes, this arrangement works well.
 However, some hosts take oversubscription to extremes and,
 as a result, performance can be inconsistent.
 If at all possible, determine what the oversubscription rates
 of each host are and plan capacity accordingly.
 Utilization
 ~~~~~~~~~~~
 A CMP must be aware of what workloads are running, where they are
 running, and their preferred utilizations.
 For example, in most cases it is desirable to run as many workloads
 internally as possible, utilizing other resources only when necessary.
 On the other hand, situations exist in which the opposite is true,
 such as when an internal cloud is only for development and stressing
 it is undesirable. A cost model of various scenarios and
 consideration of internal priorities helps with this decision.
 To improve efficiency, automate these decisions when possible.
 The Telemetry service (ceilometer) provides information on the usage
 of various OpenStack components. Note the following:
 * If Telemetry must retain a large amount of data, for
  example when monitoring a large or active cloud, we recommend
  using a NoSQL back end such as MongoDB.
 * You must monitor connections to non-OpenStack clouds
  and report this information to the CMP.
 Performance
 ~~~~~~~~~~~
 Performance is critical to hybrid cloud deployments, and they are
 affected by many of the same issues as multi-site deployments, such
 as network latency between sites. Also consider the time required to
 run a workload in different clouds and methods for reducing this time.
 This may require moving data closer to applications or applications
 closer to the data they process, and grouping functionality so that
 connections that require low latency take place over a single cloud
 rather than spanning clouds.
 This may also require a CMP that can determine which cloud can most
 efficiently run which types of workloads.
 As with utilization, native OpenStack tools help improve performance.
 For example, you can use Telemetry to measure performance and the
 Orchestration service (heat) to react to changes in demand.
 .. note::
   Orchestration requires special client configurations to integrate
   with Amazon Web Services. For other types of clouds, use CMP features.
 Components
 ~~~~~~~~~~
 Using more than one cloud in any design requires consideration of
 four OpenStack tools:
 OpenStack Compute (nova)
  Regardless of deployment location, hypervisor choice has a direct
  effect on how difficult it is to integrate with additional clouds.
 Networking (neutron)
  Whether using OpenStack Networking (neutron) or legacy
  networking (nova-network), it is necessary to understand
  network integration capabilities in order to connect between clouds.
 Telemetry (ceilometer)
  Use of Telemetry depends, in large part, on what the other parts
  of the cloud you are using.
 Orchestration (heat)
  Orchestration can be a valuable tool in orchestrating tasks a
  CMP decides are necessary in an OpenStack-based cloud.
 Special considerations
 ~~~~~~~~~~~~~~~~~~~~~~
 Hybrid cloud deployments require consideration of two issues that
 are not common in other situations:
 Image portability
  As of the Kilo release, there is no common image format that is
  usable by all clouds. Conversion or recreation of images is necessary
  if migrating between clouds. To simplify deployment, use the smallest
  and simplest images feasible, install only what is necessary, and
  use a deployment manager such as Chef or Puppet. Do not use golden
  images to speed up the process unless you repeatedly deploy the same
  images on the same cloud.
 API differences
  Avoid using a hybrid cloud deployment with more than just
  OpenStack (or with different versions of OpenStack) as API changes
  can cause compatibility issues.
--- a/doc/arch-design-to-archive/source/hybrid-user-requirements.rst
+++ b/doc/arch-design-to-archive/source/hybrid-user-requirements.rst
@ -1,178 +0,0 @@
 =================
 User requirements
 =================
 Hybrid cloud architectures are complex, especially those
 that use heterogeneous cloud platforms.
 Ensure that design choices match requirements so that the
 benefits outweigh the inherent additional complexity and risks.
 Business considerations
 ~~~~~~~~~~~~~~~~~~~~~~~
 Business considerations when designing a hybrid cloud deployment
 ----------------------------------------------------------------
 Cost
 A hybrid cloud architecture involves multiple vendors and
 technical architectures.
 These architectures may be more expensive to deploy and maintain.
 Operational costs can be higher because of the need for more
 sophisticated orchestration and brokerage tools than in other architectures.
 In contrast, overall operational costs might be lower by
 virtue of using a cloud brokerage tool to deploy the
 workloads to the most cost effective platform.
 Revenue opportunity
 Revenue opportunities vary based on the intent and use case of the cloud.
 As a commercial, customer-facing product, you must consider whether building
 over multiple platforms makes the design more attractive to customers.
 Time-to-market
 One common reason to use cloud platforms is to improve the
 time-to-market of a new product or application.
 For example, using multiple cloud platforms is viable because
 there is an existing investment in several applications.
 It is faster to tie the investments together rather than migrate
 the components and refactoring them to a single platform.
 Business or technical diversity
 Organizations leveraging cloud-based services can embrace business
 diversity and utilize a hybrid cloud design to spread their
 workloads across multiple cloud providers.  This ensures that
 no single cloud provider is the sole host for an application.
 Application momentum
 Businesses with existing applications may find that it is
 more cost effective to integrate applications on multiple
 cloud platforms than migrating them to a single platform.
 Workload considerations
 ~~~~~~~~~~~~~~~~~~~~~~~
 A workload can be a single application or a suite of applications
 that work together. It can also be a duplicate set of applications that
 need to run on multiple cloud environments.
 In a hybrid cloud deployment, the same workload often needs to function
 equally well on radically different public and private cloud environments.
 The architecture needs to address these potential conflicts,
 complexity, and platform incompatibilities.
 Use cases for a hybrid cloud architecture
 -----------------------------------------
 Dynamic resource expansion or bursting
 An application that requires additional resources may suit a multiple
 cloud architecture. For example, a retailer needs additional resources
 during the holiday season, but does not want to add private cloud
 resources to meet the peak demand.
 The user can accommodate the increased load by bursting to
 a public cloud for these peak load periods. These bursts could be
 for long or short cycles ranging from hourly to yearly.
 Disaster recovery and business continuity
 Cheaper storage makes the public cloud suitable for maintaining
 backup applications.
 Federated hypervisor and instance management
 Adding self-service, charge back, and transparent delivery of
 the resources from a federated pool can be cost effective.
 In a hybrid cloud environment, this is a particularly important
 consideration. Look for a cloud that provides cross-platform
 hypervisor support and robust instance management tools.
 Application portfolio integration
 An enterprise cloud delivers efficient application portfolio
 management and deployments by leveraging self-service features
 and rules according to use.
 Integrating existing cloud environments is a common driver
 when building hybrid cloud architectures.
 Migration scenarios
 Hybrid cloud architecture enables the migration of
 applications between different clouds.
 High availability
 A combination of locations and platforms enables a level of
 availability that is not possible with a single platform.
 This approach increases design complexity.
 As running a workload on multiple cloud platforms increases design
 complexity, we recommend first exploring options such as transferring
 workloads across clouds at the application, instance, cloud platform,
 hypervisor, and network levels.
 Tools considerations
 ~~~~~~~~~~~~~~~~~~~~
 Hybrid cloud designs must incorporate tools to facilitate working
 across multiple clouds.
 Tool functions
 --------------
 Broker between clouds
 Brokering software evaluates relative costs between different
 cloud platforms. Cloud Management Platforms (CMP)
 allow the designer to determine the right location for the
 workload based on predetermined criteria.
 Facilitate orchestration across the clouds
 CMPs simplify the migration of application workloads between
 public, private, and hybrid cloud platforms.
 We recommend using cloud orchestration tools for managing a diverse
 portfolio of systems and applications across multiple cloud platforms.
 Network considerations
 ~~~~~~~~~~~~~~~~~~~~~~
 It is important to consider the functionality, security, scalability,
 availability, and testability of network when choosing a CMP and cloud
 provider.
 * Decide on a network framework and design minimum functionality tests.
  This ensures testing and functionality persists during and after
  upgrades.
 * Scalability across multiple cloud providers may dictate which underlying
  network framework you choose in different cloud providers.
  It is important to present the network API functions and to verify
  that functionality persists across all cloud endpoints chosen.
 * High availability implementations vary in functionality and design.
  Examples of some common methods are active-hot-standby, active-passive,
  and active-active.
  Development of high availability and test frameworks is necessary to
  insure understanding of functionality and limitations.
 * Consider the security of data between the client and the endpoint,
  and of traffic that traverses the multiple clouds.
 Risk mitigation and management considerations
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Hybrid cloud architectures introduce additional risk because
 they are more complex than a single cloud design and may involve
 incompatible components or tools. However, they also reduce
 risk by spreading workloads over multiple providers.
 Hybrid cloud risks
 ------------------
 Provider availability or implementation details
 Business changes can affect provider availability.
 Likewise, changes in a provider's service can disrupt
 a hybrid cloud environment or increase costs.
 Differing SLAs
 Hybrid cloud designs must accommodate differences in SLAs
 between providers, and consider their enforceability.
 Security levels
 Securing multiple cloud environments is more complex than
 securing single cloud environments.  We recommend addressing
 concerns at the application, network, and cloud platform levels.
 Be aware that each cloud platform approaches security differently,
 and a hybrid cloud design must address and compensate for these differences.
 Provider API changes
 Consumers of external clouds rarely have control over provider
 changes to APIs, and changes can break compatibility.
 Using only the most common and basic APIs can minimize potential conflicts.
--- a/doc/arch-design-to-archive/source/hybrid.rst
+++ b/doc/arch-design-to-archive/source/hybrid.rst
@ -1,45 +0,0 @@
 ======
 Hybrid
 ======
 .. toctree::
   :maxdepth: 2
   hybrid-user-requirements.rst
   hybrid-technical-considerations.rst
   hybrid-architecture.rst
   hybrid-operational-considerations.rst
   hybrid-prescriptive-examples.rst
 A :term:`hybrid cloud` design is one that uses more than one cloud.
 For example, designs that use both an OpenStack-based private
 cloud and an OpenStack-based public cloud, or that use an
 OpenStack cloud and a non-OpenStack cloud, are hybrid clouds.
 :term:`Bursting <bursting>` describes the practice of creating new instances
 in an external cloud to alleviate capacity issues in a private cloud.
 **Example scenarios suited to hybrid clouds**
 * Bursting from a private cloud to a public cloud
 * Disaster recovery
 * Development and testing
 * Federated cloud, enabling users to choose resources from multiple providers
 * Supporting legacy systems as they transition to the cloud
 Hybrid clouds interact with systems that are outside the
 control of the private cloud administrator, and require
 careful architecture to prevent conflicts with hardware,
 software, and APIs under external control.
 The degree to which the architecture is OpenStack-based affects your ability
 to accomplish tasks with native OpenStack tools. By definition,
 this is a situation in which no single cloud can provide all
 of the necessary functionality. In order to manage the entire
 system, we recommend using a cloud management platform (CMP).
 There are several commercial and open source CMPs available,
 but there is no single CMP that can address all needs in all
 scenarios, and sometimes a manually-built solution is the best
 option. This chapter includes discussion of using CMPs for
 managing a hybrid cloud.
--- a/doc/arch-design-to-archive/source/index.rst
+++ b/doc/arch-design-to-archive/source/index.rst
@ -1,35 +0,0 @@
 .. meta::
   :description: This guide targets OpenStack Architects
                 for architectural design
   :keywords: Architecture, OpenStack
 ===================================
 OpenStack Architecture Design Guide
 ===================================
 Abstract
 ~~~~~~~~
 To reap the benefits of OpenStack, you should plan, design,
 and architect your cloud properly, taking user's needs into
 account and understanding the use cases.
 Contents
 ~~~~~~~~
 .. toctree::
   :maxdepth: 2
   common/conventions.rst
   introduction.rst
   legal-security-requirements.rst
   generalpurpose.rst
   compute-focus.rst
   storage-focus.rst
   network-focus.rst
   multi-site.rst
   hybrid.rst
   massively-scalable.rst
   specialized.rst
   references.rst
   common/appendix.rst
--- a/doc/arch-design-to-archive/source/introduction-how-this-book-is-organized.rst
+++ b/doc/arch-design-to-archive/source/introduction-how-this-book-is-organized.rst
@ -1,33 +0,0 @@
 How this book is organized
 ~~~~~~~~~~~~~~~~~~~~~~~~~~
 This book examines some of the most common uses for OpenStack clouds,
 and explains the considerations for each use case. Cloud architects may
 use this book as a comprehensive guide by reading all of the use cases,
 but it is also possible to review only the chapters which pertain to a
 specific use case. The use cases covered in this guide include:
 *  :doc:`General purpose<generalpurpose>`: Uses common components that
   address 80% of common use cases.
 *  :doc:`Compute focused<compute-focus>`: For compute intensive workloads
   such as high performance computing (HPC).
 *  :doc:`Storage focused<storage-focus>`: For storage intensive workloads
   such as data analytics with parallel file systems.
 *  :doc:`Network focused<network-focus>`: For high performance and
   reliable networking, such as a :term:`content delivery network (CDN)`.
 *  :doc:`Multi-site<multi-site>`: For applications that require multiple
   site deployments for geographical, reliability or data locality
   reasons.
 *  :doc:`Hybrid cloud<hybrid>`: Uses multiple disparate clouds connected
   either for failover, hybrid cloud bursting, or availability.
 *  :doc:`Massively scalable<massively-scalable>`: For cloud service
   providers or other large installations.
 *  :doc:`Specialized cases<specialized>`: Architectures that have not
   previously been covered in the defined use cases.
--- a/doc/arch-design-to-archive/source/introduction-how-this-book-was-written.rst
+++ b/doc/arch-design-to-archive/source/introduction-how-this-book-was-written.rst
@ -1,55 +0,0 @@
 Why and how we wrote this book
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 We wrote this book to guide you through designing an OpenStack cloud
 architecture. This guide identifies design considerations for common
 cloud use cases and provides examples.
 The Architecture Design Guide was written in a book sprint format, which
 is a facilitated, rapid development production method for books. The
 Book Sprint was facilitated by Faith Bosworth and Adam Hyde of Book
 Sprints, for more information, see the Book Sprints website
 (www.booksprints.net).
 This book was written in five days during July 2014 while exhausting the
 M&M, Mountain Dew and healthy options supply, complete with juggling
 entertainment during lunches at VMware's headquarters in Palo Alto.
 We would like to thank VMware for their generous hospitality, as well as
 our employers, Cisco, Cloudscaling, Comcast, EMC, Mirantis, Rackspace,
 Red Hat, Verizon, and VMware, for enabling us to contribute our time. We
 would especially like to thank Anne Gentle and Kenneth Hui for all of
 their shepherding and organization in making this happen.
 The author team includes:
 *  Kenneth Hui (EMC) `@hui\_kenneth <http://twitter.com/hui_kenneth>`__
 *  Alexandra Settle (Rackspace)
   `@dewsday <http://twitter.com/dewsday>`__
 *  Anthony Veiga (Comcast) `@daaelar <http://twitter.com/daaelar>`__
 *  Beth Cohen (Verizon) `@bfcohen <http://twitter.com/bfcohen>`__
 *  Kevin Jackson (Rackspace)
   `@itarchitectkev <http://twitter.com/itarchitectkev>`__
 *  Maish Saidel-Keesing (Cisco)
   `@maishsk <http://twitter.com/maishsk>`__
 *  Nick Chase (Mirantis) `@NickChase <http://twitter.com/NickChase>`__
 *  Scott Lowe (VMware) `@scott\_lowe <http://twitter.com/scott_lowe>`__
 *  Sean Collins (Comcast) `@sc68cal <http://twitter.com/sc68cal>`__
 *  Sean Winn (Cloudscaling)
   `@seanmwinn <http://twitter.com/seanmwinn>`__
 *  Sebastian Gutierrez (Red Hat) `@gutseb <http://twitter.com/gutseb>`__
 *  Stephen Gordon (Red Hat) `@xsgordon <http://twitter.com/xsgordon>`__
 *  Vinny Valdez (Red Hat)
   `@VinnyValdez <http://twitter.com/VinnyValdez>`__
--- a/doc/arch-design-to-archive/source/introduction-intended-audience.rst
+++ b/doc/arch-design-to-archive/source/introduction-intended-audience.rst
@ -1,11 +0,0 @@
 Intended audience
 ~~~~~~~~~~~~~~~~~
 This book has been written for architects and designers of OpenStack
 clouds. For a guide on deploying and operating OpenStack, please refer
 to the `OpenStack Operations Guide <https://docs.openstack.org/ops-guide/>`_.
 Before reading this book, we recommend prior knowledge of cloud
 architecture and principles, experience in enterprise system design,
 Linux and virtualization experience, and a basic understanding of
 networking principles and protocols.
--- a/doc/arch-design-to-archive/source/introduction-methodology.rst
+++ b/doc/arch-design-to-archive/source/introduction-methodology.rst
@ -1,146 +0,0 @@
 Methodology
 ~~~~~~~~~~~
 The best way to design your cloud architecture is through creating and
 testing use cases. Planning for applications that support thousands of
 sessions per second, variable workloads, and complex, changing data,
 requires you to identify the key meters. Identifying these key meters,
 such as number of concurrent transactions per second, and size of
 database, makes it possible to build a method for testing your
 assumptions.
 Use a functional user scenario to develop test cases, and to measure
 overall project trajectory.
 .. note::
   If you do not want to use an application to develop user
   requirements automatically, you need to create requirements to build
   test harnesses and develop usable meters.
 Establishing these meters allows you to respond to changes quickly
 without having to set exact requirements in advance. This creates ways
 to configure the system, rather than redesigning it every time there is
 a requirements change.
 .. important::
   It is important to limit scope creep. Ensure you address tool
   limitations, but do not recreate the entire suite of tools. Work
   with technical product owners to establish critical features that
   are needed for a successful cloud deployment.
 Application cloud readiness
 ---------------------------
 The cloud does more than host virtual machines and their applications.
 This *lift and shift* approach works in certain situations, but there is
 a fundamental difference between clouds and traditional bare-metal-based
 environments, or even traditional virtualized environments.
 In traditional environments, with traditional enterprise applications,
 the applications and the servers that run on them are *pets*. They are
 lovingly crafted and cared for, the servers have names like Gandalf or
 Tardis, and if they get sick someone nurses them back to health. All of
 this is designed so that the application does not experience an outage.
 In cloud environments, servers are more like cattle. There are thousands
 of them, they get names like NY-1138-Q, and if they get sick, they get
 put down and a sysadmin installs another one. Traditional applications
 that are unprepared for this kind of environment may suffer outages,
 loss of data, or complete failure.
 There are other reasons to design applications with the cloud in mind.
 Some are defensive, such as the fact that because applications cannot be
 certain of exactly where or on what hardware they will be launched, they
 need to be flexible, or at least adaptable. Others are proactive. For
 example, one of the advantages of using the cloud is scalability.
 Applications need to be designed in such a way that they can take
 advantage of these and other opportunities.
 Determining whether an application is cloud-ready
 -------------------------------------------------
 There are several factors to take into consideration when looking at
 whether an application is a good fit for the cloud.
 Structure
 A large, monolithic, single-tiered, legacy application typically is
 not a good fit for the cloud. Efficiencies are gained when load can
 be spread over several instances, so that a failure in one part of
 the system can be mitigated without affecting other parts of the
 system, or so that scaling can take place where the app needs it.
 Dependencies
 Applications that depend on specific hardware, such as a particular
 chip set or an external device such as a fingerprint reader, might
 not be a good fit for the cloud, unless those dependencies are
 specifically addressed. Similarly, if an application depends on an
 operating system or set of libraries that cannot be used in the
 cloud, or cannot be virtualized, that is a problem.
 Connectivity
 Self-contained applications, or those that depend on resources that
 are not reachable by the cloud in question, will not run. In some
 situations, you can work around these issues with custom network
 setup, but how well this works depends on the chosen cloud
 environment.
 Durability and resilience
 Despite the existence of SLAs, things break: servers go down,
 network connections are disrupted, or too many projects on a server
 make a server unusable. An application must be sturdy enough to
 contend with these issues.
 Designing for the cloud
 -----------------------
 Here are some guidelines to keep in mind when designing an application
 for the cloud:
 *  Be a pessimist: Assume everything fails and design backwards.
 *  Put your eggs in multiple baskets: Leverage multiple providers,
   geographic regions and availability zones to accommodate for local
   availability issues. Design for portability.
 *  Think efficiency: Inefficient designs will not scale. Efficient
   designs become cheaper as they scale. Kill off unneeded components or
   capacity.
 *  Be paranoid: Design for defense in depth and zero tolerance by
   building in security at every level and between every component.
   Trust no one.
 *  But not too paranoid: Not every application needs the platinum
   solution. Architect for different SLA's, service tiers, and security
   levels.
 *  Manage the data: Data is usually the most inflexible and complex area
   of a cloud and cloud integration architecture. Do not short change
   the effort in analyzing and addressing data needs.
 *  Hands off: Leverage automation to increase consistency and quality
   and reduce response times.
 *  Divide and conquer: Pursue partitioning and parallel layering
   wherever possible. Make components as small and portable as possible.
   Use load balancing between layers.
 *  Think elasticity: Increasing resources should result in a
   proportional increase in performance and scalability. Decreasing
   resources should have the opposite effect.
 *  Be dynamic: Enable dynamic configuration changes such as auto
   scaling, failure recovery and resource discovery to adapt to changing
   environments, faults, and workload volumes.
 *  Stay close: Reduce latency by moving highly interactive components
   and data near each other.
 *  Keep it loose: Loose coupling, service interfaces, separation of
   concerns, abstraction, and well defined API's deliver flexibility.
 *  Be cost aware: Autoscaling, data transmission, virtual software
   licenses, reserved instances, and similar costs can rapidly increase
   monthly usage charges. Monitor usage closely.
--- a/doc/arch-design-to-archive/source/introduction.rst
+++ b/doc/arch-design-to-archive/source/introduction.rst
@ -1,15 +0,0 @@
 ============
 Introduction
 ============
 .. toctree::
   :maxdepth: 2
   introduction-intended-audience.rst
   introduction-how-this-book-is-organized.rst
   introduction-how-this-book-was-written.rst
   introduction-methodology.rst
 :term:`OpenStack` is a fully-featured, self-service cloud. This book takes you
 through some of the considerations you have to make when designing your
 cloud.
--- a/doc/arch-design-to-archive/source/legal-security-requirements.rst
+++ b/doc/arch-design-to-archive/source/legal-security-requirements.rst
@ -1,254 +0,0 @@
 ===============================
 Security and legal requirements
 ===============================
 This chapter discusses the legal and security requirements you
 need to consider for the different OpenStack scenarios.
 Legal requirements
 ~~~~~~~~~~~~~~~~~~
 Many jurisdictions have legislative and regulatory
 requirements governing the storage and management of data in
 cloud environments. Common areas of regulation include:
 * Data retention policies ensuring storage of persistent data
  and records management to meet data archival requirements.
 * Data ownership policies governing the possession and
  responsibility for data.
 * Data sovereignty policies governing the storage of data in
  foreign countries or otherwise separate jurisdictions.
 * Data compliance policies governing certain types of
  information needing to reside in certain locations due to
  regulatory issues - and more importantly, cannot reside in
  other locations for the same reason.
 Examples of such legal frameworks include the
 `data protection framework <http://ec.europa.eu/justice/data-protection/>`_
 of the European Union and the requirements of the
 `Financial Industry Regulatory Authority
 <http://www.finra.org/Industry/Regulation/FINRARules/>`_
 in the United States.
 Consult a local regulatory body for more information.
 .. _security:
 Security
 ~~~~~~~~
 When deploying OpenStack in an enterprise as a private cloud,
 despite activating a firewall and binding employees with security
 agreements, cloud architecture should not make assumptions about
 safety and protection.
 In addition to considering the users, operators, or administrators
 who will use the environment, consider also negative or hostile users who
 would attack or compromise the security of your deployment regardless
 of firewalls or security agreements.
 Attack vectors increase further in a public facing OpenStack deployment.
 For example, the API endpoints and the software behind it become
 vulnerable to hostile entities attempting to gain unauthorized access
 or prevent access to services.
 This can result in loss of reputation and you must protect against
 it through auditing and appropriate filtering.
 It is important to understand that user authentication requests
 encase sensitive information such as user names, passwords, and
 authentication tokens. For this reason, place the API services
 behind hardware that performs SSL termination.
 .. warning::
   Be mindful of consistency when utilizing third party
   clouds to explore authentication options.
 Security domains
 ~~~~~~~~~~~~~~~~
 A security domain comprises users, applications, servers or networks
 that share common trust requirements and expectations within a system.
 Typically, security domains have the same authentication and
 authorization requirements and users.
 You can map security domains individually to the installation,
 or combine them. For example, some deployment topologies combine both
 guest and data domains onto one physical network.
 In other cases these networks are physically separate.
 Map out the security domains against specific OpenStack topologies needs.
 The domains and their trust requirements depend on whether the cloud
 instance is public, private, or hybrid.
 Public security domains
 -----------------------
 The public security domain is an untrusted area of the cloud
 infrastructure. It can refer to the internet as a whole or simply
 to networks over which the user has no authority.
 Always consider this domain untrusted. For example,
 in a hybrid cloud deployment, any information traversing between and
 beyond the clouds is in the public domain and untrustworthy.
 Guest security domains
 ----------------------
 Typically used for compute instance-to-instance traffic, the
 guest security domain handles compute data generated by
 instances on the cloud but not services that support the
 operation of the cloud, such as API calls. Public cloud
 providers and private cloud providers who do not have
 stringent controls on instance use or who allow unrestricted
 internet access to instances should consider this domain to be
 untrusted. Private cloud providers may want to consider this
 network as internal and therefore trusted only if they have
 controls in place to assert that they trust instances and all
 their projects.
 Management security domains
 ---------------------------
 The management security domain is where services interact.
 The networks in this domain transport confidential data such as
 configuration parameters, user names, and passwords. Trust this
 domain when it is behind an organization's firewall in deployments.
 Data security domains
 ---------------------
 The data security domain is concerned primarily with
 information pertaining to the storage services within OpenStack.
 The data that crosses this network has integrity and
 confidentiality requirements. Depending on the type of deployment there
 may also be availability requirements. The trust level of this network
 is heavily dependent on deployment decisions and does not have a default
 level of trust.
 Hypervisor-security
 ~~~~~~~~~~~~~~~~~~~
 The hypervisor also requires a security assessment. In a
 public cloud, organizations typically do not have control
 over the choice of hypervisor. Properly securing your
 hypervisor is important. Attacks made upon the
 unsecured hypervisor are called a **hypervisor breakout**.
 Hypervisor breakout describes the event of a
 compromised or malicious instance breaking out of the resource
 controls of the hypervisor and gaining access to the bare
 metal operating system and hardware resources.
 There is not an issue if the security of instances is not important.
 However, enterprises need to avoid vulnerability. The only way to
 do this is to avoid the situation where the instances are running
 on a public cloud. That does not mean that there is a
 need to own all of the infrastructure on which an OpenStack
 installation operates; it suggests avoiding situations in which
 sharing hardware with others occurs.
 Baremetal security
 ~~~~~~~~~~~~~~~~~~
 There are other services worth considering that provide a
 bare metal instance instead of a cloud. In other cases, it is
 possible to replicate a second private cloud by integrating
 with a private Cloud-as-a-Service deployment. The
 organization does not buy the hardware, but also does not share
 with other projects. It is also possible to use a provider that
 hosts a bare-metal public cloud instance for which the
 hardware is dedicated only to one customer, or a provider that
 offers private Cloud-as-a-Service.
 .. important::
   Each cloud implements services differently.
   What keeps data secure in one cloud may not do the same in another.
   Be sure to know the security requirements of every cloud that
   handles the organization's data or workloads.
 More information on OpenStack Security can be found in the
 `OpenStack Security Guide <https://docs.openstack.org/security-guide>`_.
 Networking security
 ~~~~~~~~~~~~~~~~~~~
 Consider security implications and requirements before designing the
 physical and logical network topologies. Make sure that the networks are
 properly segregated and traffic flows are going to the correct
 destinations without crossing through locations that are undesirable.
 Consider the following example factors:
 * Firewalls
 * Overlay interconnects for joining separated project networks
 * Routing through or avoiding specific networks
 How networks attach to hypervisors can expose security
 vulnerabilities. To mitigate against exploiting hypervisor breakouts,
 separate networks from other systems and schedule instances for the
 network onto dedicated compute nodes. This prevents attackers
 from having access to the networks from a compromised instance.
 Multi-site security
 ~~~~~~~~~~~~~~~~~~~
 Securing a multi-site OpenStack installation brings
 extra challenges. Projects may expect a project-created network
 to be secure. In a multi-site installation the use of a
 non-private connection between sites may be required. This may
 mean that traffic would be visible to third parties and, in
 cases where an application requires security, this issue
 requires mitigation. In these instances, install a VPN or
 encrypted connection between sites to conceal sensitive traffic.
 Another security consideration with regard to multi-site
 deployments is Identity. Centralize authentication within a
 multi-site deployment. Centralization provides a
 single authentication point for users across the deployment,
 as well as a single point of administration for traditional
 create, read, update, and delete operations. Centralized
 authentication is also useful for auditing purposes because
 all authentication tokens originate from the same source.
 Just as projects in a single-site deployment need isolation
 from each other, so do projects in multi-site installations.
 The extra challenges in multi-site designs revolve around
 ensuring that project networks function across regions.
 OpenStack Networking (neutron) does not presently support
 a mechanism to provide this functionality, therefore an
 external system may be necessary to manage these mappings.
 Project networks may contain sensitive information requiring
 that this mapping be accurate and consistent to ensure that a
 project in one site does not connect to a different project in
 another site.
 OpenStack components
 ~~~~~~~~~~~~~~~~~~~~
 Most OpenStack installations require a bare minimum set of
 pieces to function. These include OpenStack Identity
 (keystone) for authentication, OpenStack Compute
 (nova) for compute, OpenStack Image service (glance) for image
 storage, OpenStack Networking (neutron) for networking, and
 potentially an object store in the form of OpenStack Object
 Storage (swift). Bringing multi-site into play also demands extra
 components in order to coordinate between regions. Centralized
 Identity service is necessary to provide the single authentication
 point. Centralized dashboard is also recommended to provide a
 single login point and a mapped experience to the API and CLI
 options available. If needed, use a centralized Object Storage service,
 installing the required swift proxy service alongside the Object
 Storage service.
 It may also be helpful to install a few extra options in
 order to facilitate certain use cases. For instance,
 installing DNS service may assist in automatically generating
 DNS domains for each region with an automatically-populated
 zone full of resource records for each instance. This
 facilitates using DNS as a mechanism for determining which
 region would be selected for certain applications.
 Another useful tool for managing a multi-site installation
 is Orchestration (heat). The Orchestration service allows
 the use of templates to define a set of instances to be launched
 together or for scaling existing sets.
 It can set up matching or differentiated groupings based on regions.
 For instance, if an application requires an equally balanced
 number of nodes across sites, the same heat template can be used
 to cover each site with small alterations to only the region name.
--- a/doc/arch-design-to-archive/source/massively-scalable-operational-considerations.rst
+++ b/doc/arch-design-to-archive/source/massively-scalable-operational-considerations.rst
@ -1,85 +0,0 @@
 Operational considerations
 ~~~~~~~~~~~~~~~~~~~~~~~~~~
 In order to run efficiently at massive scale, automate as many of the
 operational processes as possible. Automation includes the configuration of
 provisioning, monitoring and alerting systems. Part of the automation process
 includes the capability to determine when human intervention is required and
 who should act. The objective is to decrease the ratio of operational staff to
 running systems as much as possible in order to reduce maintenance costs. In a
 massively scaled environment, it is very difficult for staff to give each
 system individual care.
 Configuration management tools such as Puppet and Chef enable operations staff
 to categorize systems into groups based on their roles and thus create
 configurations and system states that the provisioning system enforces.
 Systems that fall out of the defined state due to errors or failures are
 quickly removed from the pool of active nodes and replaced.
 At large scale the resource cost of diagnosing failed individual systems is
 far greater than the cost of replacement. It is more economical to replace the
 failed system with a new system, provisioning and configuring it automatically
 and adding it to the pool of active nodes. By automating tasks that are
 labor-intensive, repetitive, and critical to operations, cloud operations
 teams can work more efficiently because fewer resources are required for these
 common tasks. Administrators are then free to tackle tasks that are not easy
 to automate and that have longer-term impacts on the business, for example,
 capacity planning.
 The bleeding edge
 -----------------
 Running OpenStack at massive scale requires striking a balance between
 stability and features. For example, it might be tempting to run an older
 stable release branch of OpenStack to make deployments easier. However, when
 running at massive scale, known issues that may be of some concern or only
 have minimal impact in smaller deployments could become pain points. Recent
 releases may address well known issues. The OpenStack community can help
 resolve reported issues by applying the collective expertise of the OpenStack
 developers.
 The number of organizations running at massive scales is a small proportion of
 the OpenStack community, therefore it is important to share related issues
 with the community and be a vocal advocate for resolving them. Some issues
 only manifest when operating at large scale, and the number of organizations
 able to duplicate and validate an issue is small, so it is important to
 document and dedicate resources to their resolution.
 In some cases, the resolution to the problem is ultimately to deploy a more
 recent version of OpenStack. Alternatively, when you must resolve an issue in
 a production environment where rebuilding the entire environment is not an
 option, it is sometimes possible to deploy updates to specific underlying
 components in order to resolve issues or gain significant performance
 improvements. Although this may appear to expose the deployment to increased
 risk and instability, in many cases it could be an undiscovered issue.
 We recommend building a development and operations organization that is
 responsible for creating desired features, diagnosing and resolving issues,
 and building the infrastructure for large scale continuous integration tests
 and continuous deployment. This helps catch bugs early and makes deployments
 faster and easier. In addition to development resources, we also recommend the
 recruitment of experts in the fields of message queues, databases, distributed
 systems, networking, cloud, and storage.
 Growth and capacity planning
 ----------------------------
 An important consideration in running at massive scale is projecting growth
 and utilization trends in order to plan capital expenditures for the short and
 long term. Gather utilization meters for compute, network, and storage, along
 with historical records of these meters. While securing major anchor projects
 can lead to rapid jumps in the utilization rates of all resources, the steady
 adoption of the cloud inside an organization or by consumers in a public
 offering also creates a steady trend of increased utilization.
 Skills and training
 -------------------
 Projecting growth for storage, networking, and compute is only one aspect of a
 growth plan for running OpenStack at massive scale. Growing and nurturing
 development and operational staff is an additional consideration. Sending team
 members to OpenStack conferences, meetup events, and encouraging active
 participation in the mailing lists and committees is a very important way to
 maintain skills and forge relationships in the community. For a list of
 OpenStack training providers in the marketplace, see the `Openstack Marketplace
 <https://www.openstack.org/marketplace/training/>`_.
--- a/doc/arch-design-to-archive/source/massively-scalable-technical-considerations.rst
+++ b/doc/arch-design-to-archive/source/massively-scalable-technical-considerations.rst
@ -1,110 +0,0 @@
 Technical considerations
 ~~~~~~~~~~~~~~~~~~~~~~~~
 Repurposing an existing OpenStack environment to be massively scalable is a
 formidable task. When building a massively scalable environment from the
 ground up, ensure you build the initial deployment with the same principles
 and choices that apply as the environment grows. For example, a good approach
 is to deploy the first site as a multi-site environment. This enables you to
 use the same deployment and segregation methods as the environment grows to
 separate locations across dedicated links or wide area networks. In a
 hyperscale cloud, scale trumps redundancy. Modify applications with this in
 mind, relying on the scale and homogeneity of the environment to provide
 reliability rather than redundant infrastructure provided by non-commodity
 hardware solutions.
 Infrastructure segregation
 --------------------------
 OpenStack services support massive horizontal scale. Be aware that this is
 not the case for the entire supporting infrastructure. This is particularly a
 problem for the database management systems and message queues that OpenStack
 services use for data storage and remote procedure call communications.
 Traditional clustering techniques typically provide high availability and some
 additional scale for these environments. In the quest for massive scale,
 however, you must take additional steps to relieve the performance pressure on
 these components in order to prevent them from negatively impacting the
 overall performance of the environment. Ensure that all the components are in
 balance so that if the massively scalable environment fails, all the
 components are near maximum capacity and a single component is not causing the
 failure.
 Regions segregate completely independent installations linked only by an
 Identity and Dashboard (optional) installation. Services have separate API
 endpoints for each region, and include separate database and queue
 installations. This exposes some awareness of the environment's fault domains
 to users and gives them the ability to ensure some degree of application
 resiliency while also imposing the requirement to specify which region to
 apply their actions to.
 Environments operating at massive scale typically need their regions or sites
 subdivided further without exposing the requirement to specify the failure
 domain to the user. This provides the ability to further divide the
 installation into failure domains while also providing a logical unit for
 maintenance and the addition of new hardware. At hyperscale, instead of adding
 single compute nodes, administrators can add entire racks or even groups of
 racks at a time with each new addition of nodes exposed via one of the
 segregation concepts mentioned herein.
 :term:`Cells <cell>` provide the ability to subdivide the compute portion of
 an OpenStack installation, including regions, while still exposing a single
 endpoint. Each region has an API cell along with a number of compute cells
 where the workloads actually run. Each cell has its own database and message
 queue setup (ideally clustered), providing the ability to subdivide the load
 on these subsystems, improving overall performance.
 Each compute cell provides a complete compute installation, complete with full
 database and queue installations, scheduler, conductor, and multiple compute
 hosts. The cells scheduler handles placement of user requests from the single
 API endpoint to a specific cell from those available. The normal filter
 scheduler then handles placement within the cell.
 Unfortunately, Compute is the only OpenStack service that provides good
 support for cells. In addition, cells do not adequately support some standard
 OpenStack functionality such as security groups and host aggregates. Due to
 their relative newness and specialized use, cells receive relatively little
 testing in the OpenStack gate. Despite these issues, cells play an important
 role in well known OpenStack installations operating at massive scale, such as
 those at CERN and Rackspace.
 Host aggregates
 ---------------
 Host aggregates enable partitioning of OpenStack Compute deployments into
 logical groups for load balancing and instance distribution. You can also use
 host aggregates to further partition an availability zone. Consider a cloud
 which might use host aggregates to partition an availability zone into groups
 of hosts that either share common resources, such as storage and network, or
 have a special property, such as trusted computing hardware. You cannot target
 host aggregates explicitly. Instead, select instance flavors that map to host
 aggregate metadata. These flavors target host aggregates implicitly.
 Availability zones
 ------------------
 Availability zones provide another mechanism for subdividing an installation
 or region. They are, in effect, host aggregates exposed for (optional)
 explicit targeting by users.
 Unlike cells, availability zones do not have their own database server or
 queue broker but represent an arbitrary grouping of compute nodes. Typically,
 nodes are grouped into availability zones using a shared failure domain based
 on a physical characteristic such as a shared power source or physical network
 connections. Users can target exposed availability zones; however, this is not
 a requirement. An alternative approach is to set a default availability zone
 to schedule instances to a non-default availability zone of nova.
 Segregation example
 -------------------
 In this example, the cloud is divided into two regions, an API cell and
 three child cells for each region, with three availability zones in each
 cell based on the power layout of the data centers.
 The below figure describes the relationship between them within one region.
 .. figure:: figures/Massively_Scalable_Cells_regions_azs.png
 A number of host aggregates enable targeting of virtual machine instances
 using flavors, that require special capabilities shared by the target hosts
 such as SSDs, 10 GbE networks, or GPU cards.
--- a/doc/arch-design-to-archive/source/massively-scalable-user-requirements.rst
+++ b/doc/arch-design-to-archive/source/massively-scalable-user-requirements.rst
@ -1,91 +0,0 @@
 User requirements
 ~~~~~~~~~~~~~~~~~
 Defining user requirements for a massively scalable OpenStack design
 architecture dictates approaching the design from two different, yet sometimes
 opposing, perspectives: the cloud user, and the cloud operator. The
 expectations and perceptions of the consumption and management of resources of
 a massively scalable OpenStack cloud from these two perspectives are
 distinctly different.
 Massively scalable OpenStack clouds have the following user requirements:
 * The cloud user expects repeatable, dependable, and deterministic processes
  for launching and deploying cloud resources. You could deliver this through
  a web-based interface or publicly available API endpoints. All appropriate
  options for requesting cloud resources must be available through some type
  of user interface, a command-line interface (CLI), or API endpoints.
 * Cloud users expect a fully self-service and on-demand consumption model.
  When an OpenStack cloud reaches the massively scalable size, expect
  consumption as a service in each and every way.
 * For a user of a massively scalable OpenStack public cloud, there are no
  expectations for control over security, performance, or availability. Users
  expect only SLAs related to uptime of API services, and very basic SLAs for
  services offered. It is the user's responsibility to address these issues on
  their own. The exception to this expectation is the rare case of a massively
  scalable cloud infrastructure built for a private or government organization
  that has specific requirements.
 The cloud user's requirements and expectations that determine the cloud design
 focus on the consumption model. The user expects to consume cloud resources in
 an automated and deterministic way, without any need for knowledge of the
 capacity, scalability, or other attributes of the cloud's underlying
 infrastructure.
 Operator requirements
 ---------------------
 While the cloud user can be completely unaware of the underlying
 infrastructure of the cloud and its attributes, the operator must build and
 support the infrastructure for operating at scale. This presents a very
 demanding set of requirements for building such a cloud from the operator's
 perspective:
 * Everything must be capable of automation. For example, everything from
  compute hardware, storage hardware, networking hardware, to the installation
  and configuration of the supporting software. Manual processes are
  impractical in a massively scalable OpenStack design architecture.
 * The cloud operator requires that capital expenditure (CapEx) is minimized at
  all layers of the stack. Operators of massively scalable OpenStack clouds
  require the use of dependable commodity hardware and freely available open
  source software components to reduce deployment costs and operational
  expenses. Initiatives like OpenCompute (more information available at
  `Open Compute Project <http://www.opencompute.org>`_)
  provide additional information and pointers. To
  cut costs, many operators sacrifice redundancy. For example, using redundant
  power supplies, network connections, and rack switches.
 * Companies operating a massively scalable OpenStack cloud also require that
  operational expenditures (OpEx) be minimized as much as possible. We
  recommend using cloud-optimized hardware when managing operational overhead.
  Some of the factors to consider include power, cooling, and the physical
  design of the chassis. Through customization, it is possible to optimize the
  hardware and systems for this type of workload because of the scale of these
  implementations.
 * Massively scalable OpenStack clouds require extensive metering and
  monitoring functionality to maximize the operational efficiency by keeping
  the operator informed about the status and state of the infrastructure. This
  includes full scale metering of the hardware and software status. A
  corresponding framework of logging and alerting is also required to store
  and enable operations to act on the meters provided by the metering and
  monitoring solutions. The cloud operator also needs a solution that uses the
  data provided by the metering and monitoring solution to provide capacity
  planning and capacity trending analysis.
 * Invariably, massively scalable OpenStack clouds extend over several sites.
  Therefore, the user-operator requirements for a multi-site OpenStack
  architecture design are also applicable here. This includes various legal
  requirements; other jurisdictional legal or compliance requirements; image
  consistency-availability; storage replication and availability (both block
  and file/object storage); and authentication, authorization, and auditing
  (AAA). See :doc:`multi-site` for more details on requirements and
  considerations for multi-site OpenStack clouds.
 * The design architecture of a massively scalable OpenStack cloud must address
  considerations around physical facilities such as space, floor weight, rack
  height and type, environmental considerations, power usage and power usage
  efficiency (PUE), and physical security.
--- a/doc/arch-design-to-archive/source/massively-scalable.rst
+++ b/doc/arch-design-to-archive/source/massively-scalable.rst
@ -1,57 +0,0 @@
 ==================
 Massively scalable
 ==================
 .. toctree::
   :maxdepth: 2
   massively-scalable-user-requirements.rst
   massively-scalable-technical-considerations.rst
   massively-scalable-operational-considerations.rst
 A massively scalable architecture is a cloud implementation
 that is either a very large deployment, such as a commercial
 service provider might build, or one that has the capability
 to support user requests for large amounts of cloud resources.
 An example is an infrastructure in which requests to service
 500 or more instances at a time is common. A massively scalable
 infrastructure fulfills such a request without exhausting the
 available cloud infrastructure resources. While the high capital
 cost of implementing such a cloud architecture means that it
 is currently in limited use, many organizations are planning for
 massive scalability in the future.
 A massively scalable OpenStack cloud design presents a unique
 set of challenges and considerations. For the most part it is
 similar to a general purpose cloud architecture, as it is built
 to address a non-specific range of potential use cases or
 functions. Typically, it is rare that particular workloads determine
 the design or configuration of massively scalable clouds. The
 massively scalable cloud is most often built as a platform for
 a variety of workloads. Because private organizations rarely
 require or have the resources for them, massively scalable
 OpenStack clouds are generally built as commercial, public
 cloud offerings.
 Services provided by a massively scalable OpenStack cloud
 include:
 * Virtual-machine disk image library
 * Raw block storage
 * File or object storage
 * Firewall functionality
 * Load balancing functionality
 * Private (non-routable) and public (floating) IP addresses
 * Virtualized network topologies
 * Software bundles
 * Virtual compute resources
 Like a general purpose cloud, the instances deployed in a
 massively scalable OpenStack cloud do not necessarily use
 any specific aspect of the cloud offering (compute, network, or storage).
 As the cloud grows in scale, the number of workloads can cause
 stress on all the cloud components. This adds further stresses
 to supporting infrastructure such as databases and message brokers.
 The architecture design for such a cloud must account for these
 performance pressures without negatively impacting user experience.
--- a/doc/arch-design-to-archive/source/multi-site-architecture.rst
+++ b/doc/arch-design-to-archive/source/multi-site-architecture.rst
@ -1,118 +0,0 @@
 ============
 Architecture
 ============
 :ref:`ms-openstack-architecture` illustrates a high level multi-site
 OpenStack architecture. Each site is an OpenStack cloud but it may be
 necessary to architect the sites on different versions. For example,
 if the second site is intended to be a replacement for the first site,
 they would be different. Another common design would be a private
 OpenStack cloud with a replicated site that would be used for high
 availability or disaster recovery. The most important design decision
 is configuring storage as a single shared pool or separate pools, depending
 on user and technical requirements.
 .. _ms-openstack-architecture:
 .. figure:: figures/Multi-Site_shared_keystone_horizon_swift1.png
   **Multi-site OpenStack architecture**
 OpenStack services architecture
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 The Identity service, which is used by all other OpenStack components
 for authorization and the catalog of service endpoints, supports the
 concept of regions. A region is a logical construct used to group
 OpenStack services in close proximity to one another. The concept of
 regions is flexible; it may contain OpenStack service endpoints located
 within a distinct geographic region or regions. It may be smaller in
 scope, where a region is a single rack within a data center, with
 multiple regions existing in adjacent racks in the same data center.
 The majority of OpenStack components are designed to run within the
 context of a single region. The Compute service is designed to manage
 compute resources within a region, with support for subdivisions of
 compute resources by using availability zones and cells. The Networking
 service can be used to manage network resources in the same broadcast
 domain or collection of switches that are linked. The OpenStack Block
 Storage service controls storage resources within a region with all
 storage resources residing on the same storage network. Like the
 OpenStack Compute service, the OpenStack Block Storage service also
 supports the availability zone construct which can be used to subdivide
 storage resources.
 The OpenStack dashboard, OpenStack Identity, and OpenStack Object
 Storage services are components that can each be deployed centrally in
 order to serve multiple regions.
 Storage
 ~~~~~~~
 With multiple OpenStack regions, it is recommended to configure a single
 OpenStack Object Storage service endpoint to deliver shared file storage
 for all regions. The Object Storage service internally replicates files
 to multiple nodes which can be used by applications or workloads in
 multiple regions. This simplifies high availability failover and
 disaster recovery rollback.
 In order to scale the Object Storage service to meet the workload of
 multiple regions, multiple proxy workers are run and load-balanced,
 storage nodes are installed in each region, and the entire Object
 Storage Service can be fronted by an HTTP caching layer. This is done so
 client requests for objects can be served out of caches rather than
 directly from the storage modules themselves, reducing the actual load
 on the storage network. In addition to an HTTP caching layer, use a
 caching layer like Memcache to cache objects between the proxy and
 storage nodes.
 If the cloud is designed with a separate Object Storage service endpoint
 made available in each region, applications are required to handle
 synchronization (if desired) and other management operations to ensure
 consistency across the nodes. For some applications, having multiple
 Object Storage Service endpoints located in the same region as the
 application may be desirable due to reduced latency, cross region
 bandwidth, and ease of deployment.
 .. note::
   For the Block Storage service, the most important decisions are the
   selection of the storage technology, and whether a dedicated network
   is used to carry storage traffic from the storage service to the
   compute nodes.
 Networking
 ~~~~~~~~~~
 When connecting multiple regions together, there are several design
 considerations. The overlay network technology choice determines how
 packets are transmitted between regions and how the logical network and
 addresses present to the application. If there are security or
 regulatory requirements, encryption should be implemented to secure the
 traffic between regions. For networking inside a region, the overlay
 network technology for project networks is equally important. The overlay
 technology and the network traffic that an application generates or
 receives can be either complementary or serve cross purposes. For
 example, using an overlay technology for an application that transmits a
 large amount of small packets could add excessive latency or overhead to
 each packet if not configured properly.
 Dependencies
 ~~~~~~~~~~~~
 The architecture for a multi-site OpenStack installation is dependent on
 a number of factors. One major dependency to consider is storage. When
 designing the storage system, the storage mechanism needs to be
 determined. Once the storage type is determined, how it is accessed is
 critical. For example, we recommend that storage should use a dedicated
 network. Another concern is how the storage is configured to protect the
 data. For example, the Recovery Point Objective (RPO) and the Recovery
 Time Objective (RTO). How quickly recovery from a fault can be
 completed, determines how often the replication of data is required.
 Ensure that enough storage is allocated to support the data protection
 strategy.
 Networking decisions include the encapsulation mechanism that can be
 used for the project networks, how large the broadcast domains should be,
 and the contracted SLAs for the interconnects.
--- a/doc/arch-design-to-archive/source/multi-site-operational-considerations.rst
+++ b/doc/arch-design-to-archive/source/multi-site-operational-considerations.rst
@ -1,156 +0,0 @@
 ==========================
 Operational considerations
 ==========================
 Multi-site OpenStack cloud deployment using regions requires that the
 service catalog contains per-region entries for each service deployed
 other than the Identity service. Most off-the-shelf OpenStack deployment
 tools have limited support for defining multiple regions in this
 fashion.
 Deployers should be aware of this and provide the appropriate
 customization of the service catalog for their site either manually, or
 by customizing deployment tools in use.
 .. note::
   As of the Kilo release, documentation for implementing this feature
   is in progress. See this bug for more information:
   https://bugs.launchpad.net/openstack-manuals/+bug/1340509.
 Licensing
 ~~~~~~~~~
 Multi-site OpenStack deployments present additional licensing
 considerations over and above regular OpenStack clouds, particularly
 where site licenses are in use to provide cost efficient access to
 software licenses. The licensing for host operating systems, guest
 operating systems, OpenStack distributions (if applicable),
 software-defined infrastructure including network controllers and
 storage systems, and even individual applications need to be evaluated.
 Topics to consider include:
 * The definition of what constitutes a site in the relevant licenses,
  as the term does not necessarily denote a geographic or otherwise
  physically isolated location.
 * Differentiations between "hot" (active) and "cold" (inactive) sites,
  where significant savings may be made in situations where one site is
  a cold standby for disaster recovery purposes only.
 * Certain locations might require local vendors to provide support and
  services for each site which may vary with the licensing agreement in
  place.
 Logging and monitoring
 ~~~~~~~~~~~~~~~~~~~~~~
 Logging and monitoring does not significantly differ for a multi-site
 OpenStack cloud. The tools described in the `Logging and monitoring
 chapter <https://docs.openstack.org/ops-guide/ops-logging-monitoring.html>`__
 of the OpenStack Operations Guide remain applicable. Logging and monitoring
 can be provided on a per-site basis, and in a common centralized location.
 When attempting to deploy logging and monitoring facilities to a
 centralized location, care must be taken with the load placed on the
 inter-site networking links.
 Upgrades
 ~~~~~~~~
 In multi-site OpenStack clouds deployed using regions, sites are
 independent OpenStack installations which are linked together using
 shared centralized services such as OpenStack Identity. At a high level
 the recommended order of operations to upgrade an individual OpenStack
 environment is (see the `Upgrades
 chapter <https://docs.openstack.org/ops-guide/ops-upgrades.html>`__
 of the OpenStack Operations Guide for details):
 #. Upgrade the OpenStack Identity service (keystone).
 #. Upgrade the OpenStack Image service (glance).
 #. Upgrade OpenStack Compute (nova), including networking components.
 #. Upgrade OpenStack Block Storage (cinder).
 #. Upgrade the OpenStack dashboard (horizon).
 The process for upgrading a multi-site environment is not significantly
 different:
 #. Upgrade the shared OpenStack Identity service (keystone) deployment.
 #. Upgrade the OpenStack Image service (glance) at each site.
 #. Upgrade OpenStack Compute (nova), including networking components, at
   each site.
 #. Upgrade OpenStack Block Storage (cinder) at each site.
 #. Upgrade the OpenStack dashboard (horizon), at each site or in the
   single central location if it is shared.
 Compute upgrades within each site can also be performed in a rolling
 fashion. Compute controller services (API, Scheduler, and Conductor) can
 be upgraded prior to upgrading of individual compute nodes. This allows
 operations staff to keep a site operational for users of Compute
 services while performing an upgrade.
 Quota management
 ~~~~~~~~~~~~~~~~
 Quotas are used to set operational limits to prevent system capacities
 from being exhausted without notification. They are currently enforced
 at the project level rather than at the user level.
 Quotas are defined on a per-region basis. Operators can define identical
 quotas for projects in each region of the cloud to provide a consistent
 experience, or even create a process for synchronizing allocated quotas
 across regions. It is important to note that only the operational limits
 imposed by the quotas will be aligned consumption of quotas by users
 will not be reflected between regions.
 For example, given a cloud with two regions, if the operator grants a
 user a quota of 25 instances in each region then that user may launch a
 total of 50 instances spread across both regions. They may not, however,
 launch more than 25 instances in any single region.
 For more information on managing quotas refer to the `Managing projects
 and users
 chapter <https://docs.openstack.org/ops-guide/ops-projects-users.html>`__
 of the OpenStack Operators Guide.
 Policy management
 ~~~~~~~~~~~~~~~~~
 OpenStack provides a default set of Role Based Access Control (RBAC)
 policies, defined in a ``policy.json`` file, for each service. Operators
 edit these files to customize the policies for their OpenStack
 installation. If the application of consistent RBAC policies across
 sites is a requirement, then it is necessary to ensure proper
 synchronization of the ``policy.json`` files to all installations.
 This must be done using system administration tools such as rsync as
 functionality for synchronizing policies across regions is not currently
 provided within OpenStack.
 Documentation
 ~~~~~~~~~~~~~
 Users must be able to leverage cloud infrastructure and provision new
 resources in the environment. It is important that user documentation is
 accessible by users to ensure they are given sufficient information to
 help them leverage the cloud. As an example, by default OpenStack
 schedules instances on a compute node automatically. However, when
 multiple regions are available, the end user needs to decide in which
 region to schedule the new instance. The dashboard presents the user
 with the first region in your configuration. The API and CLI tools do
 not execute commands unless a valid region is specified. It is therefore
 important to provide documentation to your users describing the region
 layout as well as calling out that quotas are region-specific. If a user
 reaches his or her quota in one region, OpenStack does not automatically
 build new instances in another. Documenting specific examples helps
 users understand how to operate the cloud, thereby reducing calls and
 tickets filed with the help desk.
--- a/doc/arch-design-to-archive/source/multi-site-prescriptive-examples.rst
+++ b/doc/arch-design-to-archive/source/multi-site-prescriptive-examples.rst
@ -1,192 +0,0 @@
 =====================
 Prescriptive examples
 =====================
 There are multiple ways to build a multi-site OpenStack installation,
 based on the needs of the intended workloads. Below are example
 architectures based on different requirements. These examples are meant
 as a reference, and not a hard and fast rule for deployments. Use the
 previous sections of this chapter to assist in selecting specific
 components and implementations based on specific needs.
 A large content provider needs to deliver content to customers that are
 geographically dispersed. The workload is very sensitive to latency and
 needs a rapid response to end-users. After reviewing the user, technical
 and operational considerations, it is determined beneficial to build a
 number of regions local to the customer's edge. Rather than build a few
 large, centralized data centers, the intent of the architecture is to
 provide a pair of small data centers in locations that are closer to the
 customer. In this use case, spreading applications out allows for
 different horizontal scaling than a traditional compute workload scale.
 The intent is to scale by creating more copies of the application in
 closer proximity to the users that need it most, in order to ensure
 faster response time to user requests. This provider deploys two
 datacenters at each of the four chosen regions. The implications of this
 design are based around the method of placing copies of resources in
 each of the remote regions. Swift objects, Glance images, and block
 storage need to be manually replicated into each region. This may be
 beneficial for some systems, such as the case of content service, where
 only some of the content needs to exist in some but not all regions. A
 centralized Keystone is recommended to ensure authentication and that
 access to the API endpoints is easily manageable.
 It is recommended that you install an automated DNS system such as
 Designate. Application administrators need a way to manage the mapping
 of which application copy exists in each region and how to reach it,
 unless an external Dynamic DNS system is available. Designate assists by
 making the process automatic and by populating the records in the each
 region's zone.
 Telemetry for each region is also deployed, as each region may grow
 differently or be used at a different rate. Ceilometer collects each
 region's meters from each of the controllers and report them back to a
 central location. This is useful both to the end user and the
 administrator of the OpenStack environment. The end user will find this
 method useful, as it makes possible to determine if certain locations
 are experiencing higher load than others, and take appropriate action.
 Administrators also benefit by possibly being able to forecast growth
 per region, rather than expanding the capacity of all regions
 simultaneously, therefore maximizing the cost-effectiveness of the
 multi-site design.
 One of the key decisions of running this infrastructure is whether or
 not to provide a redundancy model. Two types of redundancy and high
 availability models in this configuration can be implemented. The first
 type is the availability of central OpenStack components. Keystone can
 be made highly available in three central data centers that host the
 centralized OpenStack components. This prevents a loss of any one of the
 regions causing an outage in service. It also has the added benefit of
 being able to run a central storage repository as a primary cache for
 distributing content to each of the regions.
 The second redundancy type is the edge data center itself. A second data
 center in each of the edge regional locations house a second region near
 the first region. This ensures that the application does not suffer
 degraded performance in terms of latency and availability.
 :ref:`ms-customer-edge` depicts the solution designed to have both a
 centralized set of core data centers for OpenStack services and paired edge
 data centers:
 .. _ms-customer-edge:
 .. figure:: figures/Multi-Site_Customer_Edge.png
   **Multi-site architecture example**
 Geo-redundant load balancing
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 A large-scale web application has been designed with cloud principles in
 mind. The application is designed provide service to application store,
 on a 24/7 basis. The company has typical two tier architecture with a
 web front-end servicing the customer requests, and a NoSQL database back
 end storing the information.
 As of late there has been several outages in number of major public
 cloud providers due to applications running out of a single geographical
 location. The design therefore should mitigate the chance of a single
 site causing an outage for their business.
 The solution would consist of the following OpenStack components:
 * A firewall, switches and load balancers on the public facing network
  connections.
 * OpenStack Controller services running, Networking, dashboard, Block
  Storage and Compute running locally in each of the three regions.
  Identity service, Orchestration service, Telemetry service, Image
  service and Object Storage service can be installed centrally, with
  nodes in each of the region providing a redundant OpenStack
  Controller plane throughout the globe.
 * OpenStack compute nodes running the KVM hypervisor.
 * OpenStack Object Storage for serving static objects such as images
  can be used to ensure that all images are standardized across all the
  regions, and replicated on a regular basis.
 * A distributed DNS service available to all regions that allows for
  dynamic update of DNS records of deployed instances.
 * A geo-redundant load balancing service can be used to service the
  requests from the customers based on their origin.
 An autoscaling heat template can be used to deploy the application in
 the three regions. This template includes:
 * Web Servers, running Apache.
 * Appropriate ``user_data`` to populate the central DNS servers upon
  instance launch.
 * Appropriate Telemetry alarms that maintain state of the application
  and allow for handling of region or instance failure.
 Another autoscaling Heat template can be used to deploy a distributed
 MongoDB shard over the three locations, with the option of storing
 required data on a globally available swift container. According to the
 usage and load on the database server, additional shards can be
 provisioned according to the thresholds defined in Telemetry.
 Two data centers would have been sufficient had the requirements been
 met. But three regions are selected here to avoid abnormal load on a
 single region in the event of a failure.
 Orchestration is used because of the built-in functionality of
 autoscaling and auto healing in the event of increased load. Additional
 configuration management tools, such as Puppet or Chef could also have
 been used in this scenario, but were not chosen since Orchestration had
 the appropriate built-in hooks into the OpenStack cloud, whereas the
 other tools were external and not native to OpenStack. In addition,
 external tools were not needed since this deployment scenario was
 straight forward.
 OpenStack Object Storage is used here to serve as a back end for the
 Image service since it is the most suitable solution for a globally
 distributed storage solution with its own replication mechanism. Home
 grown solutions could also have been used including the handling of
 replication, but were not chosen, because Object Storage is already an
 intricate part of the infrastructure and a proven solution.
 An external load balancing service was used and not the LBaaS in
 OpenStack because the solution in OpenStack is not redundant and does
 not have any awareness of geo location.
 .. _ms-geo-redundant:
 .. figure:: figures/Multi-site_Geo_Redundant_LB.png
   **Multi-site geo-redundant architecture**
 Location-local service
 ~~~~~~~~~~~~~~~~~~~~~~
 A common use for multi-site OpenStack deployment is creating a Content
 Delivery Network. An application that uses a location-local architecture
 requires low network latency and proximity to the user to provide an
 optimal user experience and reduce the cost of bandwidth and transit.
 The content resides on sites closer to the customer, instead of a
 centralized content store that requires utilizing higher cost
 cross-country links.
 This architecture includes a geo-location component that places user
 requests to the closest possible node. In this scenario, 100% redundancy
 of content across every site is a goal rather than a requirement, with
 the intent to maximize the amount of content available within a minimum
 number of network hops for end users. Despite these differences, the
 storage replication configuration has significant overlap with that of a
 geo-redundant load balancing use case.
 In :ref:`ms-shared-keystone`, the application utilizing this multi-site
 OpenStack install that is location-aware would launch web server or content
 serving instances on the compute cluster in each site. Requests from clients
 are first sent to a global services load balancer that determines the location
 of the client, then routes the request to the closest OpenStack site where the
 application completes the request.
 .. _ms-shared-keystone:
 .. figure:: figures/Multi-Site_shared_keystone1.png
   **Multi-site shared keystone architecture**
--- a/doc/arch-design-to-archive/source/multi-site-technical-considerations.rst
+++ b/doc/arch-design-to-archive/source/multi-site-technical-considerations.rst
@ -1,164 +0,0 @@
 ========================
 Technical considerations
 ========================
 There are many technical considerations to take into account with regard
 to designing a multi-site OpenStack implementation. An OpenStack cloud
 can be designed in a variety of ways to handle individual application
 needs. A multi-site deployment has additional challenges compared to
 single site installations and therefore is a more complex solution.
 When determining capacity options be sure to take into account not just
 the technical issues, but also the economic or operational issues that
 might arise from specific decisions.
 Inter-site link capacity describes the capabilities of the connectivity
 between the different OpenStack sites. This includes parameters such as
 bandwidth, latency, whether or not a link is dedicated, and any business
 policies applied to the connection. The capability and number of the
 links between sites determine what kind of options are available for
 deployment. For example, if two sites have a pair of high-bandwidth
 links available between them, it may be wise to configure a separate
 storage replication network between the two sites to support a single
 Swift endpoint and a shared Object Storage capability between them. An
 example of this technique, as well as a configuration walk-through, is
 available at `Dedicated replication network
 <https://docs.openstack.org/developer/swift/replication_network.html#dedicated-replication-network>`_.
 Another option in this scenario is to build a dedicated set of project
 private networks across the secondary link, using overlay networks with
 a third party mapping the site overlays to each other.
 The capacity requirements of the links between sites is driven by
 application behavior. If the link latency is too high, certain
 applications that use a large number of small packets, for example RPC
 calls, may encounter issues communicating with each other or operating
 properly. Additionally, OpenStack may encounter similar types of issues.
 To mitigate this, Identity service call timeouts can be tuned to prevent
 issues authenticating against a central Identity service.
 Another network capacity consideration for a multi-site deployment is
 the amount and performance of overlay networks available for project
 networks. If using shared project networks across zones, it is imperative
 that an external overlay manager or controller be used to map these
 overlays together. It is necessary to ensure the amount of possible IDs
 between the zones are identical.
 .. note::
   As of the Kilo release, OpenStack Networking was not capable of
   managing tunnel IDs across installations. So if one site runs out of
   IDs, but another does not, that project's network is unable to reach
   the other site.
 Capacity can take other forms as well. The ability for a region to grow
 depends on scaling out the number of available compute nodes. This topic
 is covered in greater detail in the section for compute-focused
 deployments. However, it may be necessary to grow cells in an individual
 region, depending on the size of your cluster and the ratio of virtual
 machines per hypervisor.
 A third form of capacity comes in the multi-region-capable components of
 OpenStack. Centralized Object Storage is capable of serving objects
 through a single namespace across multiple regions. Since this works by
 accessing the object store through swift proxy, it is possible to
 overload the proxies. There are two options available to mitigate this
 issue:
 * Deploy a large number of swift proxies. The drawback is that the
  proxies are not load-balanced and a large file request could
  continually hit the same proxy.
 * Add a caching HTTP proxy and load balancer in front of the swift
  proxies. Since swift objects are returned to the requester via HTTP,
  this load balancer would alleviate the load required on the swift
  proxies.
 Utilization
 ~~~~~~~~~~~
 While constructing a multi-site OpenStack environment is the goal of
 this guide, the real test is whether an application can utilize it.
 The Identity service is normally the first interface for OpenStack users
 and is required for almost all major operations within OpenStack.
 Therefore, it is important that you provide users with a single URL for
 Identity service authentication, and document the configuration of
 regions within the Identity service. Each of the sites defined in your
 installation is considered to be a region in Identity nomenclature. This
 is important for the users, as it is required to define the region name
 when providing actions to an API endpoint or in the dashboard.
 Load balancing is another common issue with multi-site installations.
 While it is still possible to run HAproxy instances with
 Load-Balancer-as-a-Service, these are defined to a specific region. Some
 applications can manage this using internal mechanisms. Other
 applications may require the implementation of an external system,
 including global services load balancers or anycast-advertised DNS.
 Depending on the storage model chosen during site design, storage
 replication and availability are also a concern for end-users. If an
 application can support regions, then it is possible to keep the object
 storage system separated by region. In this case, users who want to have
 an object available to more than one region need to perform cross-site
 replication. However, with a centralized swift proxy, the user may need
 to benchmark the replication timing of the Object Storage back end.
 Benchmarking allows the operational staff to provide users with an
 understanding of the amount of time required for a stored or modified
 object to become available to the entire environment.
 Performance
 ~~~~~~~~~~~
 Determining the performance of a multi-site installation involves
 considerations that do not come into play in a single-site deployment.
 Being a distributed deployment, performance in multi-site deployments
 may be affected in certain situations.
 Since multi-site systems can be geographically separated, there may be
 greater latency or jitter when communicating across regions. This can
 especially impact systems like the OpenStack Identity service when
 making authentication attempts from regions that do not contain the
 centralized Identity implementation. It can also affect applications
 which rely on Remote Procedure Call (RPC) for normal operation. An
 example of this can be seen in high performance computing workloads.
 Storage availability can also be impacted by the architecture of a
 multi-site deployment. A centralized Object Storage service requires
 more time for an object to be available to instances locally in regions
 where the object was not created. Some applications may need to be tuned
 to account for this effect. Block Storage does not currently have a
 method for replicating data across multiple regions, so applications
 that depend on available block storage need to manually cope with this
 limitation by creating duplicate block storage entries in each region.
 OpenStack components
 ~~~~~~~~~~~~~~~~~~~~
 Most OpenStack installations require a bare minimum set of pieces to
 function. These include the OpenStack Identity (keystone) for
 authentication, OpenStack Compute (nova) for compute, OpenStack Image
 service (glance) for image storage, OpenStack Networking (neutron) for
 networking, and potentially an object store in the form of OpenStack
 Object Storage (swift). Deploying a multi-site installation also demands
 extra components in order to coordinate between regions. A centralized
 Identity service is necessary to provide the single authentication
 point. A centralized dashboard is also recommended to provide a single
 login point and a mapping to the API and CLI options available. A
 centralized Object Storage service may also be used, but will require
 the installation of the swift proxy service.
 It may also be helpful to install a few extra options in order to
 facilitate certain use cases. For example, installing Designate may
 assist in automatically generating DNS domains for each region with an
 automatically-populated zone full of resource records for each instance.
 This facilitates using DNS as a mechanism for determining which region
 will be selected for certain applications.
 Another useful tool for managing a multi-site installation is
 Orchestration (heat). The Orchestration service allows the use of
 templates to define a set of instances to be launched together or for
 scaling existing sets. It can also be used to set up matching or
 differentiated groupings based on regions. For instance, if an
 application requires an equally balanced number of nodes across sites,
 the same heat template can be used to cover each site with small
 alterations to only the region name.
--- a/doc/arch-design-to-archive/source/multi-site-user-requirements.rst
+++ b/doc/arch-design-to-archive/source/multi-site-user-requirements.rst
@ -1,168 +0,0 @@
 =================
 User requirements
 =================
 Workload characteristics
 ~~~~~~~~~~~~~~~~~~~~~~~~
 An understanding of the expected workloads for a desired multi-site
 environment and use case is an important factor in the decision-making
 process. In this context, ``workload`` refers to the way the systems are
 used. A workload could be a single application or a suite of
 applications that work together. It could also be a duplicate set of
 applications that need to run in multiple cloud environments. Often in a
 multi-site deployment, the same workload will need to work identically
 in more than one physical location.
 This multi-site scenario likely includes one or more of the other
 scenarios in this book with the additional requirement of having the
 workloads in two or more locations. The following are some possible
 scenarios:
 For many use cases the proximity of the user to their workloads has a
 direct influence on the performance of the application and therefore
 should be taken into consideration in the design. Certain applications
 require zero to minimal latency that can only be achieved by deploying
 the cloud in multiple locations. These locations could be in different
 data centers, cities, countries or geographical regions, depending on
 the user requirement and location of the users.
 Consistency of images and templates across different sites
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 It is essential that the deployment of instances is consistent across
 the different sites and built into the infrastructure. If the OpenStack
 Object Storage is used as a back end for the Image service, it is
 possible to create repositories of consistent images across multiple
 sites. Having central endpoints with multiple storage nodes allows
 consistent centralized storage for every site.
 Not using a centralized object store increases the operational overhead
 of maintaining a consistent image library. This could include
 development of a replication mechanism to handle the transport of images
 and the changes to the images across multiple sites.
 High availability
 ~~~~~~~~~~~~~~~~~
 If high availability is a requirement to provide continuous
 infrastructure operations, a basic requirement of high availability
 should be defined.
 The OpenStack management components need to have a basic and minimal
 level of redundancy. The simplest example is the loss of any single site
 should have minimal impact on the availability of the OpenStack
 services.
 The `OpenStack High Availability
 Guide <https://docs.openstack.org/ha-guide/>`_ contains more information
 on how to provide redundancy for the OpenStack components.
 Multiple network links should be deployed between sites to provide
 redundancy for all components. This includes storage replication, which
 should be isolated to a dedicated network or VLAN with the ability to
 assign QoS to control the replication traffic or provide priority for
 this traffic. Note that if the data store is highly changeable, the
 network requirements could have a significant effect on the operational
 cost of maintaining the sites.
 The ability to maintain object availability in both sites has
 significant implications on the object storage design and
 implementation. It also has a significant impact on the WAN network
 design between the sites.
 Connecting more than two sites increases the challenges and adds more
 complexity to the design considerations. Multi-site implementations
 require planning to address the additional topology used for internal
 and external connectivity. Some options include full mesh topology, hub
 spoke, spine leaf, and 3D Torus.
 If applications running in a cloud are not cloud-aware, there should be
 clear measures and expectations to define what the infrastructure can
 and cannot support. An example would be shared storage between sites. It
 is possible, however such a solution is not native to OpenStack and
 requires a third-party hardware vendor to fulfill such a requirement.
 Another example can be seen in applications that are able to consume
 resources in object storage directly. These applications need to be
 cloud aware to make good use of an OpenStack Object Store.
 Application readiness
 ~~~~~~~~~~~~~~~~~~~~~
 Some applications are tolerant of the lack of synchronized object
 storage, while others may need those objects to be replicated and
 available across regions. Understanding how the cloud implementation
 impacts new and existing applications is important for risk mitigation,
 and the overall success of a cloud project. Applications may have to be
 written or rewritten for an infrastructure with little to no redundancy,
 or with the cloud in mind.
 Cost
 ~~~~
 A greater number of sites increase cost and complexity for a multi-site
 deployment. Costs can be broken down into the following categories:
 *  Compute resources
 *  Networking resources
 *  Replication
 *  Storage
 *  Management
 *  Operational costs
 Site loss and recovery
 ~~~~~~~~~~~~~~~~~~~~~~
 Outages can cause partial or full loss of site functionality. Strategies
 should be implemented to understand and plan for recovery scenarios.
 *  The deployed applications need to continue to function and, more
   importantly, you must consider the impact on the performance and
   reliability of the application when a site is unavailable.
 *  It is important to understand what happens to the replication of
   objects and data between the sites when a site goes down. If this
   causes queues to start building up, consider how long these queues
   can safely exist until an error occurs.
 *  After an outage, ensure the method for resuming proper operations of
   a site is implemented when it comes back online. We recommend you
   architect the recovery to avoid race conditions.
 Compliance and geo-location
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~
 An organization may have certain legal obligations and regulatory
 compliance measures which could require certain workloads or data to not
 be located in certain regions.
 Auditing
 ~~~~~~~~
 A well thought-out auditing strategy is important in order to be able to
 quickly track down issues. Keeping track of changes made to security
 groups and project changes can be useful in rolling back the changes if
 they affect production. For example, if all security group rules for a
 project disappeared, the ability to quickly track down the issue would be
 important for operational and legal reasons.
 Separation of duties
 ~~~~~~~~~~~~~~~~~~~~
 A common requirement is to define different roles for the different
 cloud administration functions. An example would be a requirement to
 segregate the duties and permissions by site.
 Authentication between sites
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 It is recommended to have a single authentication domain rather than a
 separate implementation for each and every site. This requires an
 authentication mechanism that is highly available and distributed to
 ensure continuous operation. Authentication server locality might be
 required and should be planned for.
--- a/doc/arch-design-to-archive/source/multi-site.rst
+++ b/doc/arch-design-to-archive/source/multi-site.rst
@ -1,26 +0,0 @@
 ==========
 Multi-site
 ==========
 .. toctree::
   :maxdepth: 2
   multi-site-user-requirements.rst
   multi-site-technical-considerations.rst
   multi-site-operational-considerations.rst
   multi-site-architecture.rst
   multi-site-prescriptive-examples.rst
 OpenStack is capable of running in a multi-region configuration. This
 enables some parts of OpenStack to effectively manage a group of sites
 as a single cloud.
 Some use cases that might indicate a need for a multi-site deployment of
 OpenStack include:
 *  An organization with a diverse geographic footprint.
 *  Geo-location sensitive data.
 *  Data locality, in which specific data or functionality should be
   close to users.
--- a/doc/arch-design-to-archive/source/network-focus-architecture.rst
+++ b/doc/arch-design-to-archive/source/network-focus-architecture.rst
@ -1,184 +0,0 @@
 Architecture
 ~~~~~~~~~~~~
 Network-focused OpenStack architectures have many similarities to other
 OpenStack architecture use cases. There are several factors to consider
 when designing for a network-centric or network-heavy application
 environment.
 Networks exist to serve as a medium of transporting data between
 systems. It is inevitable that an OpenStack design has
 inter-dependencies with non-network portions of OpenStack as well as on
 external systems. Depending on the specific workload, there may be major
 interactions with storage systems both within and external to the
 OpenStack environment. For example, in the case of content delivery
 network, there is twofold interaction with storage. Traffic flows to and
 from the storage array for ingesting and serving content in a
 north-south direction. In addition, there is replication traffic flowing
 in an east-west direction.
 Compute-heavy workloads may also induce interactions with the network.
 Some high performance compute applications require network-based memory
 mapping and data sharing and, as a result, induce a higher network load
 when they transfer results and data sets. Others may be highly
 transactional and issue transaction locks, perform their functions, and
 revoke transaction locks at high rates. This also has an impact on the
 network performance.
 Some network dependencies are external to OpenStack. While OpenStack
 Networking is capable of providing network ports, IP addresses, some
 level of routing, and overlay networks, there are some other functions
 that it cannot provide. For many of these, you may require external
 systems or equipment to fill in the functional gaps. Hardware load
 balancers are an example of equipment that may be necessary to
 distribute workloads or offload certain functions. OpenStack Networking
 provides a tunneling feature, however it is constrained to a
 Networking-managed region. If the need arises to extend a tunnel beyond
 the OpenStack region to either another region or an external system,
 implement the tunnel itself outside OpenStack or use a tunnel management
 system to map the tunnel or overlay to an external tunnel.
 Depending on the selected design, Networking itself might not support
 the required :term:`layer-3 network<Layer-3 network>` functionality. If
 you choose to use the provider networking mode without running the layer-3
 agent, you must install an external router to provide layer-3 connectivity
 to outside systems.
 Interaction with orchestration services is inevitable in larger-scale
 deployments. The Orchestration service is capable of allocating network
 resource defined in templates to map to project networks and for port
 creation, as well as allocating floating IPs. If there is a requirement
 to define and manage network resources when using orchestration, we
 recommend that the design include the Orchestration service to meet the
 demands of users.
 Design impacts
 --------------
 A wide variety of factors can affect a network-focused OpenStack
 architecture. While there are some considerations shared with a general
 use case, specific workloads related to network requirements influence
 network design decisions.
 One decision includes whether or not to use Network Address Translation
 (NAT) and where to implement it. If there is a requirement for floating
 IPs instead of public fixed addresses then you must use NAT. An example
 of this is a DHCP relay that must know the IP of the DHCP server. In
 these cases it is easier to automate the infrastructure to apply the
 target IP to a new instance rather than to reconfigure legacy or
 external systems for each new instance.
 NAT for floating IPs managed by Networking resides within the hypervisor
 but there are also versions of NAT that may be running elsewhere. If
 there is a shortage of IPv4 addresses there are two common methods to
 mitigate this externally to OpenStack. The first is to run a load
 balancer either within OpenStack as an instance, or use an external load
 balancing solution. In the internal scenario, Networking's
 Load-Balancer-as-a-Service (LBaaS) can manage load balancing software,
 for example HAproxy. This is specifically to manage the Virtual IP (VIP)
 while a dual-homed connection from the HAproxy instance connects the
 public network with the project private network that hosts all of the
 content servers. In the external scenario, a load balancer needs to
 serve the VIP and also connect to the project overlay network through
 external means or through private addresses.
 Another kind of NAT that may be useful is protocol NAT. In some cases it
 may be desirable to use only IPv6 addresses on instances and operate
 either an instance or an external service to provide a NAT-based
 transition technology such as NAT64 and DNS64. This provides the ability
 to have a globally routable IPv6 address while only consuming IPv4
 addresses as necessary or in a shared manner.
 Application workloads affect the design of the underlying network
 architecture. If a workload requires network-level redundancy, the
 routing and switching architecture have to accommodate this. There are
 differing methods for providing this that are dependent on the selected
 network hardware, the performance of the hardware, and which networking
 model you deploy. Examples include Link aggregation (LAG) and Hot
 Standby Router Protocol (HSRP). Also consider whether to deploy
 OpenStack Networking or legacy networking (nova-network), and which
 plug-in to select for OpenStack Networking. If using an external system,
 configure Networking to run :term:`layer-2<Layer-2 network>` with a provider
 network configuration. For example, implement HSRP to terminate layer-3
 connectivity.
 Depending on the workload, overlay networks may not be the best
 solution. Where application network connections are small, short lived,
 or bursty, running a dynamic overlay can generate as much bandwidth as
 the packets it carries. It also can induce enough latency to cause
 issues with certain applications. There is an impact to the device
 generating the overlay which, in most installations, is the hypervisor.
 This causes performance degradation on packet per second and connection
 per second rates.
 Overlays also come with a secondary option that may not be appropriate
 to a specific workload. While all of them operate in full mesh by
 default, there might be good reasons to disable this function because it
 may cause excessive overhead for some workloads. Conversely, other
 workloads operate without issue. For example, most web services
 applications do not have major issues with a full mesh overlay network,
 while some network monitoring tools or storage replication workloads
 have performance issues with throughput or excessive broadcast traffic.
 Many people overlook an important design decision: The choice of layer-3
 protocols. While OpenStack was initially built with only IPv4 support,
 Networking now supports IPv6 and dual-stacked networks. Some workloads
 are possible through the use of IPv6 and IPv6 to IPv4 reverse transition
 mechanisms such as NAT64 and DNS64 or :term:`6to4`. This alters the
 requirements for any address plan as single-stacked and transitional IPv6
 deployments can alleviate the need for IPv4 addresses.
 OpenStack has limited support for dynamic routing, however there are a
 number of options available by incorporating third party solutions to
 implement routing within the cloud including network equipment, hardware
 nodes, and instances. Some workloads perform well with nothing more than
 static routes and default gateways configured at the layer-3 termination
 point. In most cases this is sufficient, however some cases require the
 addition of at least one type of dynamic routing protocol if not
 multiple protocols. Having a form of interior gateway protocol (IGP)
 available to the instances inside an OpenStack installation opens up the
 possibility of use cases for anycast route injection for services that
 need to use it as a geographic location or failover mechanism. Other
 applications may wish to directly participate in a routing protocol,
 either as a passive observer, as in the case of a looking glass, or as
 an active participant in the form of a route reflector. Since an
 instance might have a large amount of compute and memory resources, it
 is trivial to hold an entire unpartitioned routing table and use it to
 provide services such as network path visibility to other applications
 or as a monitoring tool.
 Path maximum transmission unit (MTU) failures are lesser known but
 harder to diagnose. The MTU must be large enough to handle normal
 traffic, overhead from an overlay network, and the desired layer-3
 protocol. Adding externally built tunnels reduces the MTU packet size.
 In this case, you must pay attention to the fully calculated MTU size
 because some systems ignore or drop path MTU discovery packets.
 Tunable networking components
 -----------------------------
 Consider configurable networking components related to an OpenStack
 architecture design when designing for network intensive workloads that
 include MTU and QoS. Some workloads require a larger MTU than normal due
 to the transfer of large blocks of data. When providing network service
 for applications such as video streaming or storage replication, we
 recommend that you configure both OpenStack hardware nodes and the
 supporting network equipment for jumbo frames where possible. This
 allows for better use of available bandwidth. Configure jumbo frames
 across the complete path the packets traverse. If one network component
 is not capable of handling jumbo frames then the entire path reverts to
 the default MTU.
 :term:`Quality of Service (QoS)` also has a great impact on network intensive
 workloads as it provides instant service to packets which have a higher
 priority due to the impact of poor network performance. In applications
 such as Voice over IP (VoIP), differentiated services code points are a
 near requirement for proper operation. You can also use QoS in the
 opposite direction for mixed workloads to prevent low priority but high
 bandwidth applications, for example backup services, video conferencing,
 or file sharing, from blocking bandwidth that is needed for the proper
 operation of other workloads. It is possible to tag file storage traffic
 as a lower class, such as best effort or scavenger, to allow the higher
 priority traffic through. In cases where regions within a cloud might be
 geographically distributed it may also be necessary to plan accordingly
 to implement WAN optimization to combat latency or packet loss.
--- a/doc/arch-design-to-archive/source/network-focus-operational-considerations.rst
+++ b/doc/arch-design-to-archive/source/network-focus-operational-considerations.rst
@ -1,64 +0,0 @@
 Operational considerations
 ~~~~~~~~~~~~~~~~~~~~~~~~~~
 Network-focused OpenStack clouds have a number of operational
 considerations that influence the selected design, including:
 *  Dynamic routing of static routes
 *  Service level agreements (SLAs)
 *  Ownership of user management
 An initial network consideration is the selection of a telecom company
 or transit provider.
 Make additional design decisions about monitoring and alarming. This can
 be an internal responsibility or the responsibility of the external
 provider. In the case of using an external provider, service level
 agreements (SLAs) likely apply. In addition, other operational
 considerations such as bandwidth, latency, and jitter can be part of an
 SLA.
 Consider the ability to upgrade the infrastructure. As demand for
 network resources increase, operators add additional IP address blocks
 and add additional bandwidth capacity. In addition, consider managing
 hardware and software lifecycle events, for example upgrades,
 decommissioning, and outages, while avoiding service interruptions for
 projects.
 Factor maintainability into the overall network design. This includes
 the ability to manage and maintain IP addresses as well as the use of
 overlay identifiers including VLAN tag IDs, GRE tunnel IDs, and MPLS
 tags. As an example, if you may need to change all of the IP addresses
 on a network, a process known as renumbering, then the design must
 support this function.
 Address network-focused applications when considering certain
 operational realities. For example, consider the impending exhaustion of
 IPv4 addresses, the migration to IPv6, and the use of private networks
 to segregate different types of traffic that an application receives or
 generates. In the case of IPv4 to IPv6 migrations, applications should
 follow best practices for storing IP addresses. We recommend you avoid
 relying on IPv4 features that did not carry over to the IPv6 protocol or
 have differences in implementation.
 To segregate traffic, allow applications to create a private project
 network for database and storage network traffic. Use a public network
 for services that require direct client access from the internet. Upon
 segregating the traffic, consider :term:`quality of service (QoS)` and
 security to ensure each network has the required level of service.
 Finally, consider the routing of network traffic. For some applications,
 develop a complex policy framework for routing. To create a routing
 policy that satisfies business requirements, consider the economic cost
 of transmitting traffic over expensive links versus cheaper links, in
 addition to bandwidth, latency, and jitter requirements.
 Additionally, consider how to respond to network events. As an example,
 how load transfers from one link to another during a failure scenario
 could be a factor in the design. If you do not plan network capacity
 correctly, failover traffic could overwhelm other ports or network links
 and create a cascading failure scenario. In this case, traffic that
 fails over to one link overwhelms that link and then moves to the
 subsequent links until all network traffic stops.
--- a/doc/arch-design-to-archive/source/network-focus-prescriptive-examples.rst
+++ b/doc/arch-design-to-archive/source/network-focus-prescriptive-examples.rst
@ -1,165 +0,0 @@
 Prescriptive examples
 ~~~~~~~~~~~~~~~~~~~~~
 An organization designs a large-scale web application with cloud
 principles in mind. The application scales horizontally in a bursting
 fashion and generates a high instance count. The application requires an
 SSL connection to secure data and must not lose connection state to
 individual servers.
 The figure below depicts an example design for this workload. In this
 example, a hardware load balancer provides SSL offload functionality and
 connects to project networks in order to reduce address consumption. This
 load balancer links to the routing architecture as it services the VIP
 for the application. The router and load balancer use the GRE tunnel ID
 of the application's project network and an IP address within the project
 subnet but outside of the address pool. This is to ensure that the load
 balancer can communicate with the application's HTTP servers without
 requiring the consumption of a public IP address.
 Because sessions persist until closed, the routing and switching
 architecture provides high availability. Switches mesh to each
 hypervisor and each other, and also provide an MLAG implementation to
 ensure that layer-2 connectivity does not fail. Routers use VRRP and
 fully mesh with switches to ensure layer-3 connectivity. Since GRE is
 provides an overlay network, Networking is present and uses the Open
 vSwitch agent in GRE tunnel mode. This ensures all devices can reach all
 other devices and that you can create project networks for private
 addressing links to the load balancer.
 .. figure:: figures/Network_Web_Services1.png
 A web service architecture has many options and optional components. Due
 to this, it can fit into a large number of other OpenStack designs. A
 few key components, however, need to be in place to handle the nature of
 most web-scale workloads. You require the following components:
 *  OpenStack Controller services (Image, Identity, Networking and
   supporting services such as MariaDB and RabbitMQ)
 *  OpenStack Compute running KVM hypervisor
 *  OpenStack Object Storage
 *  Orchestration service
 *  Telemetry service
 Beyond the normal Identity, Compute, Image service, and Object Storage
 components, we recommend the Orchestration service component to handle
 the proper scaling of workloads to adjust to demand. Due to the
 requirement for auto-scaling, the design includes the Telemetry service.
 Web services tend to be bursty in load, have very defined peak and
 valley usage patterns and, as a result, benefit from automatic scaling
 of instances based upon traffic. At a network level, a split network
 configuration works well with databases residing on private project
 networks since these do not emit a large quantity of broadcast traffic
 and may need to interconnect to some databases for content.
 Load balancing
 --------------
 Load balancing spreads requests across multiple instances. This workload
 scales well horizontally across large numbers of instances. This enables
 instances to run without publicly routed IP addresses and instead to
 rely on the load balancer to provide a globally reachable service. Many
 of these services do not require direct server return. This aids in
 address planning and utilization at scale since only the virtual IP
 (VIP) must be public.
 Overlay networks
 ----------------
 The overlay functionality design includes OpenStack Networking in Open
 vSwitch GRE tunnel mode. In this case, the layer-3 external routers pair
 with VRRP, and switches pair with an implementation of MLAG to ensure
 that you do not lose connectivity with the upstream routing
 infrastructure.
 Performance tuning
 ------------------
 Network level tuning for this workload is minimal. :term:`Quality of Service
 (QoS)` applies to these workloads for a middle ground Class Selector
 depending on existing policies. It is higher than a best effort queue
 but lower than an Expedited Forwarding or Assured Forwarding queue.
 Since this type of application generates larger packets with
 longer-lived connections, you can optimize bandwidth utilization for
 long duration TCP. Normal bandwidth planning applies here with regards
 to benchmarking a session's usage multiplied by the expected number of
 concurrent sessions with overhead.
 Network functions
 -----------------
 Network functions is a broad category but encompasses workloads that
 support the rest of a system's network. These workloads tend to consist
 of large amounts of small packets that are very short lived, such as DNS
 queries or SNMP traps. These messages need to arrive quickly and do not
 deal with packet loss as there can be a very large volume of them. There
 are a few extra considerations to take into account for this type of
 workload and this can change a configuration all the way to the
 hypervisor level. For an application that generates 10 TCP sessions per
 user with an average bandwidth of 512 kilobytes per second per flow and
 expected user count of ten thousand concurrent users, the expected
 bandwidth plan is approximately 4.88 gigabits per second.
 The supporting network for this type of configuration needs to have a
 low latency and evenly distributed availability. This workload benefits
 from having services local to the consumers of the service. Use a
 multi-site approach as well as deploying many copies of the application
 to handle load as close as possible to consumers. Since these
 applications function independently, they do not warrant running
 overlays to interconnect project networks. Overlays also have the
 drawback of performing poorly with rapid flow setup and may incur too
 much overhead with large quantities of small packets and therefore we do
 not recommend them.
 QoS is desirable for some workloads to ensure delivery. DNS has a major
 impact on the load times of other services and needs to be reliable and
 provide rapid responses. Configure rules in upstream devices to apply a
 higher Class Selector to DNS to ensure faster delivery or a better spot
 in queuing algorithms.
 Cloud storage
 -------------
 Another common use case for OpenStack environments is providing a
 cloud-based file storage and sharing service. You might consider this a
 storage-focused use case, but its network-side requirements make it a
 network-focused use case.
 For example, consider a cloud backup application. This workload has two
 specific behaviors that impact the network. Because this workload is an
 externally-facing service and an internally-replicating application, it
 has both :term:`north-south<north-south traffic>` and
 :term:`east-west<east-west traffic>` traffic considerations:
 north-south traffic
 When a user uploads and stores content, that content moves into the
 OpenStack installation. When users download this content, the
 content moves out from the OpenStack installation. Because this
 service operates primarily as a backup, most of the traffic moves
 southbound into the environment. In this situation, it benefits you
 to configure a network to be asymmetrically downstream because the
 traffic that enters the OpenStack installation is greater than the
 traffic that leaves the installation.
 east-west traffic
 Likely to be fully symmetric. Because replication originates from
 any node and might target multiple other nodes algorithmically, it
 is less likely for this traffic to have a larger volume in any
 specific direction. However this traffic might interfere with
 north-south traffic.
 .. figure:: figures/Network_Cloud_Storage2.png
 This application prioritizes the north-south traffic over east-west
 traffic: the north-south traffic involves customer-facing data.
 The network design in this case is less dependent on availability and
 more dependent on being able to handle high bandwidth. As a direct
 result, it is beneficial to forgo redundant links in favor of bonding
 those connections. This increases available bandwidth. It is also
 beneficial to configure all devices in the path, including OpenStack, to
 generate and pass jumbo frames.
--- a/doc/arch-design-to-archive/source/network-focus-technical-considerations.rst
+++ b/doc/arch-design-to-archive/source/network-focus-technical-considerations.rst
@ -1,367 +0,0 @@
 Technical considerations
 ~~~~~~~~~~~~~~~~~~~~~~~~
 When you design an OpenStack network architecture, you must consider
 layer-2 and layer-3 issues. Layer-2 decisions involve those made at the
 data-link layer, such as the decision to use Ethernet versus Token Ring.
 Layer-3 decisions involve those made about the protocol layer and the
 point when IP comes into the picture. As an example, a completely
 internal OpenStack network can exist at layer 2 and ignore layer 3. In
 order for any traffic to go outside of that cloud, to another network,
 or to the Internet, however, you must use a layer-3 router or switch.
 The past few years have seen two competing trends in networking. One
 trend leans towards building data center network architectures based on
 layer-2 networking. Another trend treats the cloud environment
 essentially as a miniature version of the Internet. This approach is
 radically different from the network architecture approach in the
 staging environment: the Internet only uses layer-3 routing rather than
 layer-2 switching.
 A network designed on layer-2 protocols has advantages over one designed
 on layer-3 protocols. In spite of the difficulties of using a bridge to
 perform the network role of a router, many vendors, customers, and
 service providers choose to use Ethernet in as many parts of their
 networks as possible. The benefits of selecting a layer-2 design are:
 * Ethernet frames contain all the essentials for networking. These
  include, but are not limited to, globally unique source addresses,
  globally unique destination addresses, and error control.
 * Ethernet frames can carry any kind of packet. Networking at layer-2
  is independent of the layer-3 protocol.
 * Adding more layers to the Ethernet frame only slows the networking
  process down. This is known as 'nodal processing delay'.
 * You can add adjunct networking features, for example class of service
  (CoS) or multicasting, to Ethernet as readily as IP networks.
 * VLANs are an easy mechanism for isolating networks.
 Most information starts and ends inside Ethernet frames. Today this
 applies to data, voice (for example, VoIP), and video (for example, web
 cameras). The concept is that if you can perform more of the end-to-end
 transfer of information from a source to a destination in the form of
 Ethernet frames, the network benefits more from the advantages of
 Ethernet. Although it is not a substitute for IP networking, networking
 at layer-2 can be a powerful adjunct to IP networking.
 Layer-2 Ethernet usage has these advantages over layer-3 IP network
 usage:
 * Speed
 * Reduced overhead of the IP hierarchy.
 * No need to keep track of address configuration as systems move
  around. Whereas the simplicity of layer-2 protocols might work well
  in a data center with hundreds of physical machines, cloud data
  centers have the additional burden of needing to keep track of all
  virtual machine addresses and networks. In these data centers, it is
  not uncommon for one physical node to support 30-40 instances.
  .. important::
     Networking at the frame level says nothing about the presence or
     absence of IP addresses at the packet level. Almost all ports,
     links, and devices on a network of LAN switches still have IP
     addresses, as do all the source and destination hosts. There are
     many reasons for the continued need for IP addressing. The largest
     one is the need to manage the network. A device or link without an
     IP address is usually invisible to most management applications.
     Utilities including remote access for diagnostics, file transfer of
     configurations and software, and similar applications cannot run
     without IP addresses as well as MAC addresses.
 Layer-2 architecture limitations
 --------------------------------
 Outside of the traditional data center the limitations of layer-2
 network architectures become more obvious.
 * Number of VLANs is limited to 4096.
 * The number of MACs stored in switch tables is limited.
 * You must accommodate the need to maintain a set of layer-4 devices to
  handle traffic control.
 * MLAG, often used for switch redundancy, is a proprietary solution
  that does not scale beyond two devices and forces vendor lock-in.
 * It can be difficult to troubleshoot a network without IP addresses
  and ICMP.
 * Configuring :term:`ARP<Address Resolution Protocol (ARP)>` can be
  complicated on large layer-2 networks.
 * All network devices need to be aware of all MACs, even instance MACs,
  so there is constant churn in MAC tables and network state changes as
  instances start and stop.
 * Migrating MACs (instance migration) to different physical locations
  are a potential problem if you do not set ARP table timeouts
  properly.
 It is important to know that layer-2 has a very limited set of network
 management tools. It is very difficult to control traffic, as it does
 not have mechanisms to manage the network or shape the traffic, and
 network troubleshooting is very difficult. One reason for this
 difficulty is network devices have no IP addresses. As a result, there
 is no reasonable way to check network delay in a layer-2 network.
 On large layer-2 networks, configuring ARP learning can also be
 complicated. The setting for the MAC address timer on switches is
 critical and, if set incorrectly, can cause significant performance
 problems. As an example, the Cisco default MAC address timer is
 extremely long. Migrating MACs to different physical locations to
 support instance migration can be a significant problem. In this case,
 the network information maintained in the switches could be out of sync
 with the new location of the instance.
 In a layer-2 network, all devices are aware of all MACs, even those that
 belong to instances. The network state information in the backbone
 changes whenever an instance starts or stops. As a result there is far
 too much churn in the MAC tables on the backbone switches.
 Layer-3 architecture advantages
 -------------------------------
 In the layer-3 case, there is no churn in the routing tables due to
 instances starting and stopping. The only time there would be a routing
 state change is in the case of a Top of Rack (ToR) switch failure or a
 link failure in the backbone itself. Other advantages of using a layer-3
 architecture include:
 * Layer-3 networks provide the same level of resiliency and scalability
  as the Internet.
 * Controlling traffic with routing metrics is straightforward.
 * You can configure layer 3 to use :term:`BGP<Border Gateway Protocol (BGP)>`
  confederation for scalability so core routers have state proportional to the
  number of racks, not to the number of servers or instances.
 * Routing takes instance MAC and IP addresses out of the network core,
  reducing state churn. Routing state changes only occur in the case of
  a ToR switch failure or backbone link failure.
 * There are a variety of well tested tools, for example ICMP, to
  monitor and manage traffic.
 * Layer-3 architectures enable the use of :term:`quality of service (QoS)` to
  manage network performance.
 Layer-3 architecture limitations
 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 The main limitation of layer 3 is that there is no built-in isolation
 mechanism comparable to the VLANs in layer-2 networks. Furthermore, the
 hierarchical nature of IP addresses means that an instance is on the
 same subnet as its physical host. This means that you cannot migrate it
 outside of the subnet easily. For these reasons, network virtualization
 needs to use IP :term:`encapsulation` and software at the end hosts for
 isolation and the separation of the addressing in the virtual layer from
 the addressing in the physical layer. Other potential disadvantages of
 layer 3 include the need to design an IP addressing scheme rather than
 relying on the switches to keep track of the MAC addresses automatically
 and to configure the interior gateway routing protocol in the switches.
 Network recommendations overview
 --------------------------------
 OpenStack has complex networking requirements for several reasons. Many
 components interact at different levels of the system stack that adds
 complexity. Data flows are complex. Data in an OpenStack cloud moves
 both between instances across the network (also known as East-West), as
 well as in and out of the system (also known as North-South). Physical
 server nodes have network requirements that are independent of instance
 network requirements, which you must isolate from the core network to
 account for scalability. We recommend functionally separating the
 networks for security purposes and tuning performance through traffic
 shaping.
 You must consider a number of important general technical and business
 factors when planning and designing an OpenStack network. They include:
 * A requirement for vendor independence. To avoid hardware or software
  vendor lock-in, the design should not rely on specific features of a
  vendor's router or switch.
 * A requirement to massively scale the ecosystem to support millions of
  end users.
 * A requirement to support indeterminate platforms and applications.
 * A requirement to design for cost efficient operations to take
  advantage of massive scale.
 * A requirement to ensure that there is no single point of failure in
  the cloud ecosystem.
 * A requirement for high availability architecture to meet customer SLA
  requirements.
 * A requirement to be tolerant of rack level failure.
 * A requirement to maximize flexibility to architect future production
  environments.
 Bearing in mind these considerations, we recommend the following:
 * Layer-3 designs are preferable to layer-2 architectures.
 * Design a dense multi-path network core to support multi-directional
  scaling and flexibility.
 * Use hierarchical addressing because it is the only viable option to
  scale network ecosystem.
 * Use virtual networking to isolate instance service network traffic
  from the management and internal network traffic.
 * Isolate virtual networks using encapsulation technologies.
 * Use traffic shaping for performance tuning.
 * Use eBGP to connect to the Internet up-link.
 * Use iBGP to flatten the internal traffic on the layer-3 mesh.
 * Determine the most effective configuration for block storage network.
 Additional considerations
 -------------------------
 There are several further considerations when designing a
 network-focused OpenStack cloud.
 OpenStack Networking versus legacy networking (nova-network) considerations
 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 Selecting the type of networking technology to implement depends on many
 factors. OpenStack Networking (neutron) and legacy networking
 (nova-network) both have their advantages and disadvantages. They are
 both valid and supported options that fit different use cases:
 .. list-table:: **Redundant networking: ToR switch high availability risk
                analysis**
   :widths: 50 40
   :header-rows: 1
   * - Legacy networking (nova-network)
     - OpenStack Networking
   * - Simple, single agent
     - Complex, multiple agents
   * - More mature, established
     - Newer, maturing
   * - Flat or VLAN
     - Flat, VLAN, Overlays, L2-L3, SDN
   * - No plug-in support
     - Plug-in support for 3rd parties
   * - Scales well
     - Scaling requires 3rd party plug-ins
   * - No multi-tier topologies
     - Multi-tier topologies
 Redundant networking: ToR switch high availability risk analysis
 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 A technical consideration of networking is the idea that you should
 install switching gear in a data center with backup switches in case of
 hardware failure.
 Research indicates the mean time between failures (MTBF) on switches is
 between 100,000 and 200,000 hours. This number is dependent on the
 ambient temperature of the switch in the data center. When properly
 cooled and maintained, this translates to between 11 and 22 years before
 failure. Even in the worst case of poor ventilation and high ambient
 temperatures in the data center, the MTBF is still 2-3 years.  See
 `Ethernet switch reliablity: Temperature vs. moving parts
 <http://media.beldensolutions.com/garrettcom/techsupport/papers/ethernet_switch_reliability.pdf>`_
 for further information.
 In most cases, it is much more economical to use a single switch with a
 small pool of spare switches to replace failed units than it is to
 outfit an entire data center with redundant switches. Applications
 should tolerate rack level outages without affecting normal operations,
 since network and compute resources are easily provisioned and
 plentiful.
 Preparing for the future: IPv6 support
 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 One of the most important networking topics today is the impending
 exhaustion of IPv4 addresses. In early 2014, ICANN announced that they
 started allocating the final IPv4 address blocks to the `Regional
 Internet Registries
 <http://www.internetsociety.org/deploy360/blog/2014/05/goodbye-ipv4-iana-starts-allocating-final-address-blocks/>`_.
 This means the IPv4 address space is close to being fully allocated. As
 a result, it will soon become difficult to allocate more IPv4 addresses
 to an application that has experienced growth, or that you expect to
 scale out, due to the lack of unallocated IPv4 address blocks.
 For network focused applications the future is the IPv6 protocol. IPv6
 increases the address space significantly, fixes long standing issues in
 the IPv4 protocol, and will become essential for network focused
 applications in the future.
 OpenStack Networking supports IPv6 when configured to take advantage of
 it. To enable IPv6, create an IPv6 subnet in Networking and use IPv6
 prefixes when creating security groups.
 Asymmetric links
 ^^^^^^^^^^^^^^^^
 When designing a network architecture, the traffic patterns of an
 application heavily influence the allocation of total bandwidth and the
 number of links that you use to send and receive traffic. Applications
 that provide file storage for customers allocate bandwidth and links to
 favor incoming traffic, whereas video streaming applications allocate
 bandwidth and links to favor outgoing traffic.
 Performance
 ^^^^^^^^^^^
 It is important to analyze the applications' tolerance for latency and
 jitter when designing an environment to support network focused
 applications. Certain applications, for example VoIP, are less tolerant
 of latency and jitter. Where latency and jitter are concerned, certain
 applications may require tuning of QoS parameters and network device
 queues to ensure that they queue for transmit immediately or guarantee
 minimum bandwidth. Since OpenStack currently does not support these
 functions, consider carefully your selected network plug-in.
 The location of a service may also impact the application or consumer
 experience. If an application serves differing content to different
 users it must properly direct connections to those specific locations.
 Where appropriate, use a multi-site installation for these situations.
 You can implement networking in two separate ways. Legacy networking
 (nova-network) provides a flat DHCP network with a single broadcast
 domain. This implementation does not support project isolation networks
 or advanced plug-ins, but it is currently the only way to implement a
 distributed :term:`layer-3 (L3) agent` using the multi_host configuration.
 OpenStack Networking (neutron) is the official networking implementation and
 provides a pluggable architecture that supports a large variety of
 network methods. Some of these include a layer-2 only provider network
 model, external device plug-ins, or even OpenFlow controllers.
 Networking at large scales becomes a set of boundary questions. The
 determination of how large a layer-2 domain must be is based on the
 amount of nodes within the domain and the amount of broadcast traffic
 that passes between instances. Breaking layer-2 boundaries may require
 the implementation of overlay networks and tunnels. This decision is a
 balancing act between the need for a smaller overhead or a need for a
 smaller domain.
 When selecting network devices, be aware that making this decision based
 on the greatest port density often comes with a drawback. Aggregation
 switches and routers have not all kept pace with Top of Rack switches
 and may induce bottlenecks on north-south traffic. As a result, it may
 be possible for massive amounts of downstream network utilization to
 impact upstream network devices, impacting service to the cloud. Since
 OpenStack does not currently provide a mechanism for traffic shaping or
 rate limiting, it is necessary to implement these features at the
 network hardware level.
--- a/doc/arch-design-to-archive/source/network-focus-user-requirements.rst
+++ b/doc/arch-design-to-archive/source/network-focus-user-requirements.rst
@ -1,71 +0,0 @@
 User requirements
 ~~~~~~~~~~~~~~~~~
 Network-focused architectures vary from the general-purpose architecture
 designs. Certain network-intensive applications influence these
 architectures. Some of the business requirements that influence the
 design include network latency through slow page loads, degraded video
 streams, and low quality VoIP sessions impacts the user experience.
 Users are often not aware of how network design and architecture affects their
 experiences. Both enterprise customers and end-users rely on the network for
 delivery of an application. Network performance problems can result in a
 negative experience for the end-user, as well as productivity and economic
 loss.
 High availability issues
 ------------------------
 Depending on the application and use case, network-intensive OpenStack
 installations can have high availability requirements. Financial
 transaction systems have a much higher requirement for high availability
 than a development application. Use network availability technologies,
 for example :term:`quality of service (QoS)`, to improve the network
 performance of sensitive applications such as VoIP and video streaming.
 High performance systems have SLA requirements for a minimum QoS with
 regard to guaranteed uptime, latency, and bandwidth. The level of the
 SLA can have a significant impact on the network architecture and
 requirements for redundancy in the systems.
 Risks
 -----
 Network misconfigurations
 Configuring incorrect IP addresses, VLANs, and routers can cause
 outages to areas of the network or, in the worst-case scenario, the
 entire cloud infrastructure. Automate network configurations to
 minimize the opportunity for operator error as it can cause
 disruptive problems.
 Capacity planning
 Cloud networks require management for capacity and growth over time.
 Capacity planning includes the purchase of network circuits and
 hardware that can potentially have lead times measured in months or
 years.
 Network tuning
 Configure cloud networks to minimize link loss, packet loss, packet
 storms, broadcast storms, and loops.
 Single Point Of Failure (SPOF)
 Consider high availability at the physical and environmental layers.
 If there is a single point of failure due to only one upstream link,
 or only one power supply, an outage can become unavoidable.
 Complexity
 An overly complex network design can be difficult to maintain and
 troubleshoot. While device-level configuration can ease maintenance
 concerns and automated tools can handle overlay networks, avoid or
 document non-traditional interconnects between functions and
 specialized hardware to prevent outages.
 Non-standard features
 There are additional risks that arise from configuring the cloud
 network to take advantage of vendor specific features. One example
 is multi-link aggregation (MLAG) used to provide redundancy at the
 aggregator switch level of the network. MLAG is not a standard and,
 as a result, each vendor has their own proprietary implementation of
 the feature. MLAG architectures are not interoperable across switch
 vendors, which leads to vendor lock-in, and can cause delays or
 inability when upgrading components.
--- a/doc/arch-design-to-archive/source/network-focus.rst
+++ b/doc/arch-design-to-archive/source/network-focus.rst
@ -1,101 +0,0 @@
 ===============
 Network focused
 ===============
 .. toctree::
   :maxdepth: 2
   network-focus-user-requirements.rst
   network-focus-technical-considerations.rst
   network-focus-operational-considerations.rst
   network-focus-architecture.rst
   network-focus-prescriptive-examples.rst
 All OpenStack deployments depend on network communication in order to function
 properly due to its service-based nature. In some cases, however, the network
 elevates beyond simple infrastructure. This chapter discusses architectures
 that are more reliant or focused on network services. These architectures
 depend on the network infrastructure and require network services that
 perform reliably in order to satisfy user and application requirements.
 Some possible use cases include:
 Content delivery network
 This includes streaming video, viewing photographs, or accessing any other
 cloud-based data repository distributed to a large number of end users.
 Network configuration affects latency, bandwidth, and the distribution of
 instances. Therefore, it impacts video streaming. Not all video streaming
 is consumer-focused. For example, multicast videos (used for media, press
 conferences, corporate presentations, and web conferencing services) can
 also use a content delivery network. The location of the video repository
 and its relationship to end users affects content delivery. Network
 throughput of the back-end systems, as well as the WAN architecture and
 the cache methodology, also affect performance.
 Network management functions
 Use this cloud to provide network service functions built to support the
 delivery of back-end network services such as DNS, NTP, or SNMP.
 Network service offerings
 Use this cloud to run customer-facing network tools to support services.
 Examples include VPNs, MPLS private networks, and GRE tunnels.
 Web portals or web services
 Web servers are a common application for cloud services, and we recommend
 an understanding of their network requirements. The network requires scaling
 out to meet user demand and deliver web pages with a minimum latency.
 Depending on the details of the portal architecture, consider the internal
 east-west and north-south network bandwidth.
 High speed and high volume transactional systems
 These types of applications are sensitive to network configurations. Examples
 include financial systems, credit card transaction applications, and trading
 and other extremely high volume systems. These systems are sensitive to
 network jitter and latency. They must balance a high volume of East-West and
 North-South network traffic to maximize efficiency of the data delivery. Many
 of these systems must access large, high performance database back ends.
 High availability
 These types of use cases are dependent on the proper sizing of the network to
 maintain replication of data between sites for high availability. If one site
 becomes unavailable, the extra sites can serve the displaced load until the
 original site returns to service. It is important to size network capacity to
 handle the desired loads.
 Big data
 Clouds used for the management and collection of big data (data ingest) have
 a significant demand on network resources. Big data often uses partial
 replicas of the data to maintain integrity over large distributed clouds.
 Other big data applications that require a large amount of network resources
 are Hadoop, Cassandra, NuoDB, Riak, and other NoSQL and distributed
 databases.
 Virtual desktop infrastructure (VDI)
 This use case is sensitive to network congestion, latency, jitter, and other
 network characteristics. Like video streaming, the user experience is
 important. However, unlike video streaming, caching is not an option to
 offset the network issues. VDI requires both upstream and downstream traffic
 and cannot rely on caching for the delivery of the application to the end
 user.
 Voice over IP (VoIP)
 This is sensitive to network congestion, latency, jitter, and other network
 characteristics. VoIP has a symmetrical traffic pattern and it requires
 network :term:`quality of service (QoS)` for best performance. In addition,
 you can implement active queue management to deliver voice and multimedia
 content. Users are sensitive to latency and jitter fluctuations and can detect
 them at very low levels.
 Video Conference or web conference
 This is sensitive to network congestion, latency, jitter, and other network
 characteristics. Video Conferencing has a symmetrical traffic pattern, but
 unless the network is on an MPLS private network, it cannot use network
 :term:`quality of service (QoS)` to improve performance. Similar to VoIP,
 users are sensitive to network performance issues even at low levels.
 High performance computing (HPC)
 This is a complex use case that requires careful consideration of the traffic
 flows and usage patterns to address the needs of cloud clusters. It has high
 east-west traffic patterns for distributed computing, but there can be
 substantial north-south traffic depending on the specific application.
--- a/doc/arch-design-to-archive/source/references.rst
+++ b/doc/arch-design-to-archive/source/references.rst
@ -1,85 +0,0 @@
 ==========
 References
 ==========
 `Data Protection framework of the European Union
 <http://ec.europa.eu/justice/data-protection/>`_
 : Guidance on Data Protection laws governed by the EU.
 `Depletion of IPv4 Addresses
 <http://www.internetsociety.org/deploy360/blog/2014/05/
 goodbye-ipv4-iana-starts-allocating-final-address-blocks/>`_
 : describing how IPv4 addresses and the migration to IPv6 is inevitable.
 `Ethernet Switch Reliability <http://www.garrettcom.com/
 techsupport/papers/ethernet_switch_reliability.pdf>`_
 : Research white paper on Ethernet Switch reliability.
 `Financial Industry Regulatory Authority
 <http://www.finra.org/Industry/Regulation/FINRARules/>`_
 : Requirements of the Financial Industry Regulatory Authority in the USA.
 `Image Service property keys <https://docs.openstack.org/
 cli-reference/glance.html#image-service-property-keys>`_
 : Glance API property keys allows the administrator to attach custom
 characteristics to images.
 `LibGuestFS Documentation <http://libguestfs.org>`_
 : Official LibGuestFS documentation.
 `Logging and Monitoring
 <https://docs.openstack.org/ops-guide/ops-logging-monitoring.html>`_
 : Official OpenStack Operations documentation.
 `ManageIQ Cloud Management Platform <http://manageiq.org/>`_
 : An Open Source Cloud Management Platform for managing multiple clouds.
 `N-Tron Network Availability
 <https://www.scribd.com/doc/298973976/Network-Availability>`_
 : Research white paper on network availability.
 `Nested KVM <http://davejingtian.org/2014/03/30/nested-kvm-just-for-fun>`_
 : Post on how to nest KVM under KVM.
 `Open Compute Project <http://www.opencompute.org/>`_
 : The Open Compute Project Foundation's mission is to design
 and enable the delivery of the most efficient server,
 storage and data center hardware designs for scalable computing.
 `OpenStack Flavors
 <https://docs.openstack.org/ops-guide/ops-user-facing-operations.html#flavors>`_
 : Official OpenStack documentation.
 `OpenStack High Availability Guide <https://docs.openstack.org/ha-guide/>`_
 : Information on how to provide redundancy for the OpenStack components.
 `OpenStack Hypervisor Support Matrix
 <https://wiki.openstack.org/wiki/HypervisorSupportMatrix>`_
 : Matrix of supported hypervisors and capabilities when used with OpenStack.
 `OpenStack Object Store (Swift) Replication Reference
 <https://docs.openstack.org/developer/swift/replication_network.html>`_
 : Developer documentation of Swift replication.
 `OpenStack Operations Guide <https://docs.openstack.org/ops-guide/>`_
 : The OpenStack Operations Guide provides information on setting up
 and installing OpenStack.
 `OpenStack Security Guide <https://docs.openstack.org/security-guide/>`_
 : The OpenStack Security Guide provides information on securing
 OpenStack deployments.
 `OpenStack Training Marketplace
 <https://www.openstack.org/marketplace/training>`_
 : The OpenStack Market for training and Vendors providing training
 on OpenStack.
 `PCI passthrough <https://wiki.openstack.org/wiki/
 Pci_passthrough#How_to_check_PCI_status_with_PCI_api_paches>`_
 : The PCI API patches extend the servers/os-hypervisor to
 show PCI information for instance and compute node,
 and also provides a resource endpoint to show PCI information.
 `TripleO <https://wiki.openstack.org/wiki/TripleO>`_
 : TripleO is a program aimed at installing, upgrading and operating
 OpenStack clouds using OpenStack's own cloud facilities as the foundation.
--- a/doc/arch-design-to-archive/source/specialized-desktop-as-a-service.rst
+++ b/doc/arch-design-to-archive/source/specialized-desktop-as-a-service.rst
@ -1,47 +0,0 @@
 ====================
 Desktop-as-a-Service
 ====================
 Virtual Desktop Infrastructure (VDI) is a service that hosts
 user desktop environments on remote servers. This application
 is very sensitive to network latency and requires a high
 performance compute environment. Traditionally these types of
 services do not use cloud environments because few clouds
 support such a demanding workload for user-facing applications.
 As cloud environments become more robust, vendors are starting
 to provide services that provide virtual desktops in the cloud.
 OpenStack may soon provide the infrastructure for these types of deployments.
 Challenges
 ~~~~~~~~~~
 Designing an infrastructure that is suitable to host virtual
 desktops is a very different task to that of most virtual workloads.
 For example, the design must consider:
 * Boot storms, when a high volume of logins occur in a short period of time
 * The performance of the applications running on virtual desktops
 * Operating systems and their compatibility with the OpenStack hypervisor
 Broker
 ~~~~~~
 The connection broker determines which remote desktop host
 users can access. Medium and large scale environments require a broker
 since its service represents a central component of the architecture.
 The broker is a complete management product, and enables automated
 deployment and provisioning of remote desktop hosts.
 Possible solutions
 ~~~~~~~~~~~~~~~~~~
 There are a number of commercial products currently available that
 provide a broker solution. However, no native OpenStack projects
 provide broker services.
 Not providing a broker is also an option, but managing this manually
 would not suffice for a large scale, enterprise solution.
 Diagram
 ~~~~~~~
 .. figure:: figures/Specialized_VDI1.png
--- a/doc/arch-design-to-archive/source/specialized-hardware.rst
+++ b/doc/arch-design-to-archive/source/specialized-hardware.rst
@ -1,43 +0,0 @@
 ====================
 Specialized hardware
 ====================
 Certain workloads require specialized hardware devices that
 have significant virtualization or sharing challenges.
 Applications such as load balancers, highly parallel brute
 force computing, and direct to wire networking may need
 capabilities that basic OpenStack components do not provide.
 Challenges
 ~~~~~~~~~~
 Some applications need access to hardware devices to either
 improve performance or provide capabilities that are not
 virtual CPU, RAM, network, or storage. These can be a shared
 resource, such as a cryptography processor, or a dedicated
 resource, such as a Graphics Processing Unit (GPU). OpenStack can
 provide some of these, while others may need extra work.
 Solutions
 ~~~~~~~~~
 To provide cryptography offloading to a set of instances,
 you can use Image service configuration options.
 For example, assign the cryptography chip to a device node in the guest.
 The OpenStack Command Line Reference contains further information on
 configuring this solution in the section `Image service property keys
 <https://docs.openstack.org/cli-reference/glance.html#image-service-property-keys>`_.
 A challenge, however, is this option allows all guests using the
 configured images to access the hypervisor cryptography device.
 If you require direct access to a specific device, PCI pass-through
 enables you to dedicate the device to a single instance per hypervisor.
 You must define a flavor that has the PCI device specifically in order
 to properly schedule instances.
 More information regarding PCI pass-through, including instructions for
 implementing and using it, is available at
 `https://wiki.openstack.org/wiki/Pci_passthrough <https://wiki.openstack.org/
 wiki/Pci_passthrough#How_to_check_PCI_status_with_PCI_api_patches>`_.
 .. figure:: figures/Specialized_Hardware2.png
   :width: 100%
--- a/doc/arch-design-to-archive/source/specialized-multi-hypervisor.rst
+++ b/doc/arch-design-to-archive/source/specialized-multi-hypervisor.rst
@ -1,78 +0,0 @@
 ========================
 Multi-hypervisor example
 ========================
 A financial company requires its applications migrated
 from a traditional, virtualized environment to an API driven,
 orchestrated environment. The new environment needs
 multiple hypervisors since many of the company's applications
 have strict hypervisor requirements.
 Currently, the company's vSphere environment runs 20 VMware
 ESXi hypervisors. These hypervisors support 300 instances of
 various sizes. Approximately 50 of these instances must run
 on ESXi. The remaining 250 or so have more flexible requirements.
 The financial company decides to manage the
 overall system with a common OpenStack platform.
 .. figure:: figures/Compute_NSX.png
   :width: 100%
 Architecture planning teams decided to run a host aggregate
 containing KVM hypervisors for the general purpose instances.
 A separate host aggregate targets instances requiring ESXi.
 Images in the OpenStack Image service have particular
 hypervisor metadata attached. When a user requests a
 certain image, the instance spawns on the relevant aggregate.
 Images for ESXi use the VMDK format. You can convert
 QEMU disk images to VMDK, VMFS Flat Disks. These disk images
 can also be thin, thick, zeroed-thick, and eager-zeroed-thick.
 After exporting a VMFS thin disk from VMFS to the
 OpenStack Image service (a non-VMFS location), it becomes a
 preallocated flat disk. This impacts the transfer time from the
 OpenStack Image service to the data store since transfers require
 moving the full preallocated flat disk rather than the thin disk.
 The VMware host aggregate compute nodes communicate with
 vCenter rather than spawning directly on a hypervisor.
 The vCenter then requests scheduling for the instance to run on
 an ESXi hypervisor.
 This functionality requires that VMware Distributed Resource
 Scheduler (DRS) is enabled on a cluster and set to **Fully Automated**.
 The vSphere requires shared storage because the DRS uses vMotion
 which is a service that relies on shared storage.
 This solution to the company's migration uses shared storage
 to provide Block Storage capabilities to the KVM instances while
 also providing vSphere storage. The new environment provides this
 storage functionality using a dedicated data network. The
 compute hosts should have dedicated NICs to support the
 dedicated data network. vSphere supports OpenStack Block Storage. This
 support gives storage from a VMFS datastore to an instance. For the
 financial company, Block Storage in their new architecture supports
 both hypervisors.
 OpenStack Networking provides network connectivity in this new
 architecture, with the VMware NSX plug-in driver configured. legacy
 networking (nova-network) supports both hypervisors in this new
 architecture example, but has limitations. Specifically, vSphere
 with legacy networking does not support security groups. The new
 architecture uses VMware NSX as a part of the design. When users launch an
 instance within either of the host aggregates, VMware NSX ensures the
 instance attaches to the appropriate network overlay-based logical networks.
 The architecture planning teams also consider OpenStack Compute integration.
 When running vSphere in an OpenStack environment, nova-compute
 communications with vCenter appear as a single large hypervisor.
 This hypervisor represents the entire ESXi cluster. Multiple nova-compute
 instances can represent multiple ESXi clusters. They can connect to
 multiple vCenter servers. If the process running nova-compute
 crashes it cuts the connection to the vCenter server.
 Any ESXi clusters will stop running, and you will not be able to
 provision further instances on the vCenter, even if you enable high
 availability. You must monitor the nova-compute service connected
 to vSphere carefully for any disruptions as a result of this failure point.
--- a/doc/arch-design-to-archive/source/specialized-networking.rst
+++ b/doc/arch-design-to-archive/source/specialized-networking.rst
@ -1,32 +0,0 @@
 ==============================
 Specialized networking example
 ==============================
 Some applications that interact with a network require
 specialized connectivity. Applications such as a looking glass
 require the ability to connect to a BGP peer, or route participant
 applications may need to join a network at a layer2 level.
 Challenges
 ~~~~~~~~~~
 Connecting specialized network applications to their required
 resources alters the design of an OpenStack installation.
 Installations that rely on overlay networks are unable to
 support a routing participant, and may also block layer-2 listeners.
 Possible solutions
 ~~~~~~~~~~~~~~~~~~
 Deploying an OpenStack installation using OpenStack Networking with a
 provider network allows direct layer-2 connectivity to an
 upstream networking device.
 This design provides the layer-2 connectivity required to communicate
 via Intermediate System-to-Intermediate System (ISIS) protocol or
 to pass packets controlled by an OpenFlow controller.
 Using the multiple layer-2 plug-in with an agent such as
 :term:`Open vSwitch` allows a private connection through a VLAN
 directly to a specific port in a layer-3 device.
 This allows a BGP point-to-point link to join the autonomous system.
 Avoid using layer-3 plug-ins as they divide the broadcast
 domain and prevent router adjacencies from forming.
--- a/doc/arch-design-to-archive/source/specialized-openstack-on-openstack.rst
+++ b/doc/arch-design-to-archive/source/specialized-openstack-on-openstack.rst
@ -1,71 +0,0 @@
 ======================
 OpenStack on OpenStack
 ======================
 In some cases, users may run OpenStack nested on top
 of another OpenStack cloud. This scenario describes how to
 manage and provision complete OpenStack environments on instances
 supported by hypervisors and servers, which an underlying OpenStack
 environment controls.
 Public cloud providers can use this technique to manage the
 upgrade and maintenance process on complete OpenStack environments.
 Developers and those testing OpenStack can also use this
 technique to provision their own OpenStack environments on
 available OpenStack Compute resources, whether public or private.
 Challenges
 ~~~~~~~~~~
 The network aspect of deploying a nested cloud is the most
 complicated aspect of this architecture.
 You must expose VLANs to the physical ports on which the underlying
 cloud runs because the bare metal cloud owns all the hardware.
 You must also expose them to the nested levels as well.
 Alternatively, you can use the network overlay technologies on the
 OpenStack environment running on the host OpenStack environment to
 provide the required software defined networking for the deployment.
 Hypervisor
 ~~~~~~~~~~
 In this example architecture, consider which
 approach you should take to provide a nested
 hypervisor in OpenStack. This decision influences which
 operating systems you use for the deployment of the nested
 OpenStack deployments.
 Possible solutions: deployment
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Deployment of a full stack can be challenging but you can mitigate
 this difficulty by creating a Heat template to deploy the
 entire stack, or a configuration management system. After creating
 the Heat template, you can automate the deployment of additional stacks.
 The OpenStack-on-OpenStack project (:term:`TripleO`)
 addresses this issue. Currently, however, the project does
 not completely cover nested stacks. For more information, see
 https://wiki.openstack.org/wiki/TripleO.
 Possible solutions: hypervisor
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 In the case of running TripleO, the underlying OpenStack
 cloud deploys the compute nodes as bare-metal. You then deploy
 OpenStack on these Compute bare-metal servers with the
 appropriate hypervisor, such as KVM.
 In the case of running smaller OpenStack clouds for testing
 purposes, where performance is not a critical factor, you can use
 QEMU instead. It is also possible to run a KVM hypervisor in an instance
 (see `davejingtian.org
 <http://davejingtian.org/2014/03/30/nested-kvm-just-for-fun/>`_),
 though this is not a supported configuration, and could be a
 complex solution for such a use case.
 Diagram
 ~~~~~~~
 .. figure:: figures/Specialized_OOO.png
   :width: 100%
--- a/doc/arch-design-to-archive/source/specialized-software-defined-networking.rst
+++ b/doc/arch-design-to-archive/source/specialized-software-defined-networking.rst
@ -1,46 +0,0 @@
 ===========================
 Software-defined networking
 ===========================
 Software-defined networking (SDN) is the separation of the data
 plane and control plane. SDN is a popular method of
 managing and controlling packet flows within networks.
 SDN uses overlays or directly controlled layer-2 devices to
 determine flow paths, and as such presents challenges to a
 cloud environment. Some designers may wish to run their
 controllers within an OpenStack installation. Others may wish
 to have their installations participate in an SDN-controlled network.
 Challenges
 ~~~~~~~~~~
 SDN is a relatively new concept that is not yet standardized,
 so SDN systems come in a variety of different implementations.
 Because of this, a truly prescriptive architecture is not feasible.
 Instead, examine the differences between an existing and a planned
 OpenStack design and determine where potential conflicts and gaps exist.
 Possible solutions
 ~~~~~~~~~~~~~~~~~~
 If an SDN implementation requires layer-2 access because it
 directly manipulates switches, we do not recommend running an
 overlay network or a layer-3 agent.
 If the controller resides within an OpenStack installation,
 it may be necessary to build an ML2 plug-in and schedule the
 controller instances to connect to project VLANs that they can
 talk directly to the switch hardware.
 Alternatively, depending on the external device support,
 use a tunnel that terminates at the switch hardware itself.
 Diagram
 -------
 OpenStack hosted SDN controller:
 .. figure:: figures/Specialized_SDN_hosted.png
 OpenStack participating in an SDN controller network:
 .. figure:: figures/Specialized_SDN_external.png
--- a/doc/arch-design-to-archive/source/specialized.rst
+++ b/doc/arch-design-to-archive/source/specialized.rst
@ -1,39 +0,0 @@
 =================
 Specialized cases
 =================
 .. toctree::
   :maxdepth: 2
   specialized-multi-hypervisor.rst
   specialized-networking.rst
   specialized-software-defined-networking.rst
   specialized-desktop-as-a-service.rst
   specialized-openstack-on-openstack.rst
   specialized-hardware.rst
 Although most OpenStack architecture designs fall into one
 of the seven major scenarios outlined in other sections
 (compute focused, network focused, storage focused, general
 purpose, multi-site, hybrid cloud, and massively scalable),
 there are a few use cases that do not fit into these categories.
 This section discusses these specialized cases and provide some
 additional details and design considerations for each use case:
 * :doc:`Specialized networking <specialized-networking>`:
  describes running networking-oriented software that may involve reading
  packets directly from the wire or participating in routing protocols.
 * :doc:`Software-defined networking (SDN)
  <specialized-software-defined-networking>`:
  describes both running an SDN controller from within OpenStack
  as well as participating in a software-defined network.
 * :doc:`Desktop-as-a-Service <specialized-desktop-as-a-service>`:
  describes running a virtualized desktop environment in a cloud
  (:term:`Desktop-as-a-Service`).
  This applies to private and public clouds.
 * :doc:`OpenStack on OpenStack <specialized-openstack-on-openstack>`:
  describes building a multi-tiered cloud by running OpenStack
  on top of an OpenStack installation.
 * :doc:`Specialized hardware <specialized-hardware>`:
  describes the use of specialized hardware devices from within
  the OpenStack environment.
--- a/doc/arch-design-to-archive/source/storage-focus-architecture.rst
+++ b/doc/arch-design-to-archive/source/storage-focus-architecture.rst
@ -1,440 +0,0 @@
 Architecture
 ~~~~~~~~~~~~
 Consider the following factors when selecting storage hardware:
 * Cost
 * Performance
 * Reliability
 Storage-focused OpenStack clouds must address I/O intensive workloads.
 These workloads are not CPU intensive, nor are they consistently network
 intensive. The network may be heavily utilized to transfer storage, but
 they are not otherwise network intensive.
 The selection of storage hardware determines the overall performance and
 scalability of a storage-focused OpenStack design architecture. Several
 factors impact the design process, including:
 Cost
 The cost of components affects which storage architecture and
 hardware you choose.
 Performance
 The latency of storage I/O requests indicates performance.
 Performance requirements affect which solution you choose.
 Scalability
 Scalability refers to how the storage solution performs as it
 expands to its maximum size. Storage solutions that perform well in
 small configurations but have degraded performance in large
 configurations are not scalable. A solution that performs well at
 maximum expansion is scalable. Large deployments require a storage
 solution that performs well as it expands.
 Latency is a key consideration in a storage-focused OpenStack cloud.
 Using solid-state disks (SSDs) to minimize latency and, to reduce CPU
 delays caused by waiting for the storage, increases performance. Use
 RAID controller cards in compute hosts to improve the performance of the
 underlying disk subsystem.
 Depending on the storage architecture, you can adopt a scale-out
 solution, or use a highly expandable and scalable centralized storage
 array. If a centralized storage array is the right fit for your
 requirements, then the array vendor determines the hardware selection.
 It is possible to build a storage array using commodity hardware with
 Open Source software, but requires people with expertise to build such a
 system.
 On the other hand, a scale-out storage solution that uses
 direct-attached storage (DAS) in the servers may be an appropriate
 choice. This requires configuration of the server hardware to support
 the storage solution.
 Considerations affecting storage architecture (and corresponding storage
 hardware) of a Storage-focused OpenStack cloud include:
 Connectivity
 Based on the selected storage solution, ensure the connectivity
 matches the storage solution requirements. We recommended confirming
 that the network characteristics minimize latency to boost the
 overall performance of the design.
 Latency
 Determine if the use case has consistent or highly variable latency.
 Throughput
 Ensure that the storage solution throughput is optimized for your
 application requirements.
 Server hardware
 Use of DAS impacts the server hardware choice and affects host
 density, instance density, power density, OS-hypervisor, and
 management tools.
 Compute (server) hardware selection
 -----------------------------------
 Four opposing factors determine the compute (server) hardware selection:
 Server density
 A measure of how many servers can fit into a given measure of
 physical space, such as a rack unit [U].
 Resource capacity
 The number of CPU cores, how much RAM, or how much storage a given
 server delivers.
 Expandability
 The number of additional resources you can add to a server before it
 reaches capacity.
 Cost
 The relative cost of the hardware weighed against the level of
 design effort needed to build the system.
 You must weigh the dimensions against each other to determine the best
 design for the desired purpose. For example, increasing server density
 can mean sacrificing resource capacity or expandability. Increasing
 resource capacity and expandability can increase cost but decrease
 server density. Decreasing cost often means decreasing supportability,
 server density, resource capacity, and expandability.
 Compute capacity (CPU cores and RAM capacity) is a secondary
 consideration for selecting server hardware. As a result, the required
 server hardware must supply adequate CPU sockets, additional CPU cores,
 and more RAM; network connectivity and storage capacity are not as
 critical. The hardware needs to provide enough network connectivity and
 storage capacity to meet the user requirements, however they are not the
 primary consideration.
 Some server hardware form factors are better suited to storage-focused
 designs than others. The following is a list of these form factors:
 * Most blade servers support dual-socket multi-core CPUs. Choose either
  full width or full height blades to avoid the limit. High density
  blade servers support up to 16 servers in only 10 rack units using
  half height or half width blades.
  .. warning::
     This decreases density by 50% (only 8 servers in 10 U) if a full
     width or full height option is used.
 * 1U rack-mounted servers have the ability to offer greater server
  density than a blade server solution, but are often limited to
  dual-socket, multi-core CPU configurations.
  .. note::
     Due to cooling requirements, it is rare to see 1U rack-mounted
     servers with more than 2 CPU sockets.
  To obtain greater than dual-socket support in a 1U rack-mount form
  factor, customers need to buy their systems from Original Design
  Manufacturers (ODMs) or second-tier manufacturers.
 .. warning::
   This may cause issues for organizations that have preferred
   vendor policies or concerns with support and hardware warranties
   of non-tier 1 vendors.
 * 2U rack-mounted servers provide quad-socket, multi-core CPU support
  but with a corresponding decrease in server density (half the density
  offered by 1U rack-mounted servers).
 * Larger rack-mounted servers, such as 4U servers, often provide even
  greater CPU capacity. Commonly supporting four or even eight CPU
  sockets. These servers have greater expandability but such servers
  have much lower server density and usually greater hardware cost.
 * Rack-mounted servers that support multiple independent servers in a
  single 2U or 3U enclosure, "sled servers", deliver increased density
  as compared to a typical 1U-2U rack-mounted servers.
 Other factors that influence server hardware selection for a
 storage-focused OpenStack design architecture include:
 Instance density
 In this architecture, instance density and CPU-RAM oversubscription
 are lower. You require more hosts to support the anticipated scale,
 especially if the design uses dual-socket hardware designs.
 Host density
 Another option to address the higher host count is to use a
 quad-socket platform. Taking this approach decreases host density
 which also increases rack count. This configuration affects the
 number of power connections and also impacts network and cooling
 requirements.
 Power and cooling density
 The power and cooling density requirements might be lower than with
 blade, sled, or 1U server designs due to lower host density (by
 using 2U, 3U or even 4U server designs). For data centers with older
 infrastructure, this might be a desirable feature.
 Storage-focused OpenStack design architecture server hardware selection
 should focus on a "scale-up" versus "scale-out" solution. The
 determination of which is the best solution (a smaller number of larger
 hosts or a larger number of smaller hosts), depends on a combination of
 factors including cost, power, cooling, physical rack and floor space,
 support-warranty, and manageability.
 Networking hardware selection
 -----------------------------
 Key considerations for the selection of networking hardware include:
 Port count
 The user requires networking hardware that has the requisite port
 count.
 Port density
 The physical space required to provide the requisite port count
 affects the network design. A switch that provides 48 10 GbE ports
 in 1U has a much higher port density than a switch that provides 24
 10 GbE ports in 2U. On a general scale, a higher port density leaves
 more rack space for compute or storage components which is
 preferred. It is also important to consider fault domains and power
 density. Finally, higher density switches are more expensive,
 therefore it is important not to over design the network.
 Port speed
 The networking hardware must support the proposed network speed, for
 example: 1 GbE, 10 GbE, or 40 GbE (or even 100 GbE).
 Redundancy
 User requirements for high availability and cost considerations
 influence the required level of network hardware redundancy. Achieve
 network redundancy by adding redundant power supplies or paired
 switches.
 .. note::
    If this is a requirement, the hardware must support this
    configuration. User requirements determine if a completely
    redundant network infrastructure is required.
 Power requirements
 Ensure that the physical data center provides the necessary power
 for the selected network hardware. This is not an issue for top of
 rack (ToR) switches, but may be an issue for spine switches in a
 leaf and spine fabric, or end of row (EoR) switches.
 Protocol support
 It is possible to gain more performance out of a single storage
 system by using specialized network technologies such as RDMA, SRP,
 iSER and SCST. The specifics for using these technologies is beyond
 the scope of this book.
 Software selection
 ------------------
 Factors that influence the software selection for a storage-focused
 OpenStack architecture design include:
 * Operating system (OS) and hypervisor
 * OpenStack components
 * Supplemental software
 Design decisions made in each of these areas impacts the rest of the
 OpenStack architecture design.
 Operating system and hypervisor
 -------------------------------
 Operating system (OS) and hypervisor have a significant impact on the
 overall design and also affect server hardware selection. Ensure the
 selected operating system and hypervisor combination support the storage
 hardware and work with the networking hardware selection and topology.
 Operating system and hypervisor selection affect the following areas:
 Cost
 Selecting a commercially supported hypervisor, such as Microsoft
 Hyper-V, results in a different cost model than a
 community-supported open source hypervisor like Kinstance or Xen.
 Similarly, choosing Ubuntu over Red Hat (or vice versa) impacts cost
 due to support contracts. However, business or application
 requirements might dictate a specific or commercially supported
 hypervisor.
 Supportability
 Staff must have training with the chosen hypervisor. Consider the
 cost of training when choosing a solution. The support of a
 commercial product such as Red Hat, SUSE, or Windows, is the
 responsibility of the OS vendor. If an open source platform is
 chosen, the support comes from in-house resources.
 Management tools
 Ubuntu and Kinstance use different management tools than VMware
 vSphere. Although both OS and hypervisor combinations are supported
 by OpenStack, there are varying impacts to the rest of the design as
 a result of the selection of one combination versus the other.
 Scale and performance
 Ensure the selected OS and hypervisor combination meet the
 appropriate scale and performance requirements needed for this
 storage focused OpenStack cloud. The chosen architecture must meet
 the targeted instance-host ratios with the selected OS-hypervisor
 combination.
 Security
 Ensure the design can accommodate the regular periodic installation
 of application security patches while maintaining the required
 workloads. The frequency of security patches for the proposed
 OS-hypervisor combination impacts performance and the patch
 installation process could affect maintenance windows.
 Supported features
 Selecting the OS-hypervisor combination often determines the
 required features of OpenStack. Certain features are only available
 with specific OSes or hypervisors. For example, if certain features
 are not available, you might need to modify the design to meet user
 requirements.
 Interoperability
 The OS-hypervisor combination should be chosen based on the
 interoperability with one another, and other OS-hyervisor
 combinations. Operational and troubleshooting tools for one
 OS-hypervisor combination may differ from the tools used for another
 OS-hypervisor combination. As a result, the design must address if
 the two sets of tools need to interoperate.
 OpenStack components
 --------------------
 The OpenStack components you choose can have a significant impact on the
 overall design. While there are certain components that are always
 present (Compute and Image service, for example), there are other
 services that may not be required. As an example, a certain design may
 not require the Orchestration service. Omitting Orchestration would not
 typically have a significant impact on the overall design, however, if
 the architecture uses a replacement for OpenStack Object Storage for its
 storage component, this could potentially have significant impacts on
 the rest of the design.
 A storage-focused design might require the ability to use Orchestration
 to launch instances with Block Storage volumes to perform
 storage-intensive processing.
 A storage-focused OpenStack design architecture uses the following
 components:
 * OpenStack Identity (keystone)
 * OpenStack dashboard (horizon)
 * OpenStack Compute (nova) (including the use of multiple hypervisor
   drivers)
 * OpenStack Object Storage (swift) (or another object storage solution)
 * OpenStack Block Storage (cinder)
 * OpenStack Image service (glance)
 * OpenStack Networking (neutron) or legacy networking (nova-network)
 Excluding certain OpenStack components may limit or constrain the
 functionality of other components. If a design opts to include
 Orchestration but exclude Telemetry, then the design cannot take
 advantage of Orchestration's auto scaling functionality (which relies on
 information from Telemetry). Due to the fact that you can use
 Orchestration to spin up a large number of instances to perform the
 compute-intensive processing, we strongly recommend including
 Orchestration in a compute-focused architecture design.
 Networking software
 -------------------
 OpenStack Networking (neutron) provides a wide variety of networking
 services for instances. There are many additional networking software
 packages that may be useful to manage the OpenStack components
 themselves. Some examples include HAProxy, Keepalived, and various
 routing daemons (like Quagga). The OpenStack High Availability Guide
 describes some of these software packages, HAProxy in particular. See
 the `Network controller cluster stack
 chapter <https://docs.openstack.org/ha-guide/networking-ha.html>`_ of
 the OpenStack High Availability Guide.
 Management software
 -------------------
 Management software includes software for providing:
 * Clustering
 * Logging
 * Monitoring
 * Alerting
 .. important::
   The factors for determining which software packages in this category
   to select is outside the scope of this design guide.
 The availability design requirements determine the selection of
 Clustering Software, such as Corosync or Pacemaker. The availability of
 the cloud infrastructure and the complexity of supporting the
 configuration after deployment determines the impact of including these
 software packages. The OpenStack High Availability Guide provides more
 details on the installation and configuration of Corosync and Pacemaker.
 Operational considerations determine the requirements for logging,
 monitoring, and alerting. Each of these sub-categories includes options.
 For example, in the logging sub-category you could select Logstash,
 Splunk, Log Insight, or another log aggregation-consolidation tool.
 Store logs in a centralized location to facilitate performing analytics
 against the data. Log data analytics engines can also provide automation
 and issue notification, by providing a mechanism to both alert and
 automatically attempt to remediate some of the more commonly known
 issues.
 If you require any of these software packages, the design must account
 for the additional resource consumption. Some other potential design
 impacts include:
 * OS-Hypervisor combination: Ensure that the selected logging,
  monitoring, or alerting tools support the proposed OS-hypervisor
  combination.
 * Network hardware: The network hardware selection needs to be
  supported by the logging, monitoring, and alerting software.
 Database software
 -----------------
 Most OpenStack components require access to back-end database services
 to store state and configuration information. Choose an appropriate
 back-end database which satisfies the availability and fault tolerance
 requirements of the OpenStack services.
 MySQL is the default database for OpenStack, but other compatible
 databases are available.
 .. note::
   Telemetry uses MongoDB.
 The chosen high availability database solution changes according to the
 selected database. MySQL, for example, provides several options. Use a
 replication technology such as Galera for active-active clustering. For
 active-passive use some form of shared storage. Each of these potential
 solutions has an impact on the design:
 * Solutions that employ Galera/MariaDB require at least three MySQL
  nodes.
 * MongoDB has its own design considerations for high availability.
 * OpenStack design, generally, does not include shared storage.
  However, for some high availability designs, certain components might
  require it depending on the specific implementation.
--- a/doc/arch-design-to-archive/source/storage-focus-operational-considerations.rst
+++ b/doc/arch-design-to-archive/source/storage-focus-operational-considerations.rst
@ -1,252 +0,0 @@
 Operational Considerations
 ~~~~~~~~~~~~~~~~~~~~~~~~~~
 Several operational factors affect the design choices for a general
 purpose cloud. Operations staff receive tasks regarding the maintenance
 of cloud environments for larger installations, including:
 Maintenance tasks
    The storage solution should take into account storage maintenance
    and the impact on underlying workloads.
 Reliability and availability
    Reliability and availability depend on wide area network
    availability and on the level of precautions taken by the service
    provider.
 Flexibility
    Organizations need to have the flexibility to choose between
    off-premise and on-premise cloud storage options. This relies on
    relevant decision criteria with potential cost savings. For example,
    continuity of operations, disaster recovery, security, records
    retention laws, regulations, and policies.
 Monitoring and alerting services are vital in cloud environments with
 high demands on storage resources. These services provide a real-time
 view into the health and performance of the storage systems. An
 integrated management console, or other dashboards capable of
 visualizing SNMP data, is helpful when discovering and resolving issues
 that arise within the storage cluster.
 A storage-focused cloud design should include:
 *  Monitoring of physical hardware resources.
 *  Monitoring of environmental resources such as temperature and
   humidity.
 *  Monitoring of storage resources such as available storage, memory,
   and CPU.
 *  Monitoring of advanced storage performance data to ensure that
   storage systems are performing as expected.
 *  Monitoring of network resources for service disruptions which would
   affect access to storage.
 *  Centralized log collection.
 *  Log analytics capabilities.
 *  Ticketing system (or integration with a ticketing system) to track
   issues.
 *  Alerting and notification of responsible teams or automated systems
   which remediate problems with storage as they arise.
 *  Network Operations Center (NOC) staffed and always available to
   resolve issues.
 Application awareness
 ---------------------
 Well-designed applications should be aware of underlying storage
 subsystems in order to use cloud storage solutions effectively.
 If natively available replication is not available, operations personnel
 must be able to modify the application so that they can provide their
 own replication service. In the event that replication is unavailable,
 operations personnel can design applications to react such that they can
 provide their own replication services. An application designed to
 detect underlying storage systems can function in a wide variety of
 infrastructures, and still have the same basic behavior regardless of
 the differences in the underlying infrastructure.
 Fault tolerance and availability
 --------------------------------
 Designing for fault tolerance and availability of storage systems in an
 OpenStack cloud is vastly different when comparing the Block Storage and
 Object Storage services.
 Block Storage fault tolerance and availability
 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 Configure Block Storage resource nodes with advanced RAID controllers
 and high performance disks to provide fault tolerance at the hardware
 level.
 Deploy high performing storage solutions such as SSD disk drives or
 flash storage systems for applications requiring extreme performance out
 of Block Storage devices.
 In environments that place extreme demands on Block Storage, we
 recommend using multiple storage pools. In this case, each pool of
 devices should have a similar hardware design and disk configuration
 across all hardware nodes in that pool. This allows for a design that
 provides applications with access to a wide variety of Block Storage
 pools, each with their own redundancy, availability, and performance
 characteristics. When deploying multiple pools of storage it is also
 important to consider the impact on the Block Storage scheduler which is
 responsible for provisioning storage across resource nodes. Ensuring
 that applications can schedule volumes in multiple regions, each with
 their own network, power, and cooling infrastructure, can give projects
 the ability to build fault tolerant applications that are distributed
 across multiple availability zones.
 In addition to the Block Storage resource nodes, it is important to
 design for high availability and redundancy of the APIs, and related
 services that are responsible for provisioning and providing access to
 storage. We recommend designing a layer of hardware or software load
 balancers in order to achieve high availability of the appropriate REST
 API services to provide uninterrupted service. In some cases, it may
 also be necessary to deploy an additional layer of load balancing to
 provide access to back-end database services responsible for servicing
 and storing the state of Block Storage volumes. We also recommend
 designing a highly available database solution to store the Block
 Storage databases. Leverage highly available database solutions such as
 Galera and MariaDB to help keep database services online for
 uninterrupted access, so that projects can manage Block Storage volumes.
 In a cloud with extreme demands on Block Storage, the network
 architecture should take into account the amount of East-West bandwidth
 required for instances to make use of the available storage resources.
 The selected network devices should support jumbo frames for
 transferring large blocks of data. In some cases, it may be necessary to
 create an additional back-end storage network dedicated to providing
 connectivity between instances and Block Storage resources so that there
 is no contention of network resources.
 Object Storage fault tolerance and availability
 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 While consistency and partition tolerance are both inherent features of
 the Object Storage service, it is important to design the overall
 storage architecture to ensure that the implemented system meets those
 goals. The OpenStack Object Storage service places a specific number of
 data replicas as objects on resource nodes. These replicas are
 distributed throughout the cluster based on a consistent hash ring which
 exists on all nodes in the cluster.
 Design the Object Storage system with a sufficient number of zones to
 provide quorum for the number of replicas defined. For example, with
 three replicas configured in the Swift cluster, the recommended number
 of zones to configure within the Object Storage cluster in order to
 achieve quorum is five. While it is possible to deploy a solution with
 fewer zones, the implied risk of doing so is that some data may not be
 available and API requests to certain objects stored in the cluster
 might fail. For this reason, ensure you properly account for the number
 of zones in the Object Storage cluster.
 Each Object Storage zone should be self-contained within its own
 availability zone. Each availability zone should have independent access
 to network, power and cooling infrastructure to ensure uninterrupted
 access to data. In addition, a pool of Object Storage proxy servers
 providing access to data stored on the object nodes should service each
 availability zone. Object proxies in each region should leverage local
 read and write affinity so that local storage resources facilitate
 access to objects wherever possible. We recommend deploying upstream
 load balancing to ensure that proxy services are distributed across the
 multiple zones and, in some cases, it may be necessary to make use of
 third-party solutions to aid with geographical distribution of services.
 A zone within an Object Storage cluster is a logical division. Any of
 the following may represent a zone:
 *  A disk within a single node
 *  One zone per node
 *  Zone per collection of nodes
 *  Multiple racks
 *  Multiple DCs
 Selecting the proper zone design is crucial for allowing the Object
 Storage cluster to scale while providing an available and redundant
 storage system. It may be necessary to configure storage policies that
 have different requirements with regards to replicas, retention and
 other factors that could heavily affect the design of storage in a
 specific zone.
 Scaling storage services
 ------------------------
 Adding storage capacity and bandwidth is a very different process when
 comparing the Block and Object Storage services. While adding Block
 Storage capacity is a relatively simple process, adding capacity and
 bandwidth to the Object Storage systems is a complex task that requires
 careful planning and consideration during the design phase.
 Scaling Block Storage
 ^^^^^^^^^^^^^^^^^^^^^
 You can upgrade Block Storage pools to add storage capacity without
 interrupting the overall Block Storage service. Add nodes to the pool by
 installing and configuring the appropriate hardware and software and
 then allowing that node to report in to the proper storage pool via the
 message bus. This is because Block Storage nodes report into the
 scheduler service advertising their availability. After the node is
 online and available, projects can make use of those storage resources
 instantly.
 In some cases, the demand on Block Storage from instances may exhaust
 the available network bandwidth. As a result, design network
 infrastructure that services Block Storage resources in such a way that
 you can add capacity and bandwidth easily. This often involves the use
 of dynamic routing protocols or advanced networking solutions to add
 capacity to downstream devices easily. Both the front-end and back-end
 storage network designs should encompass the ability to quickly and
 easily add capacity and bandwidth.
 Scaling Object Storage
 ^^^^^^^^^^^^^^^^^^^^^^
 Adding back-end storage capacity to an Object Storage cluster requires
 careful planning and consideration. In the design phase, it is important
 to determine the maximum partition power required by the Object Storage
 service, which determines the maximum number of partitions which can
 exist. Object Storage distributes data among all available storage, but
 a partition cannot span more than one disk, although a disk can have
 multiple partitions.
 For example, a system that starts with a single disk and a partition
 power of 3 can have 8 (2^3) partitions. Adding a second disk means that
 each has 4 partitions. The one-disk-per-partition limit means that this
 system can never have more than 8 partitions, limiting its scalability.
 However, a system that starts with a single disk and a partition power
 of 10 can have up to 1024 (2^10) partitions.
 As you add back-end storage capacity to the system, the partition maps
 redistribute data amongst the storage nodes. In some cases, this
 replication consists of extremely large data sets. In these cases, we
 recommend using back-end replication links that do not contend with
 projects' access to data.
 As more projects begin to access data within the cluster and their data
 sets grow, it is necessary to add front-end bandwidth to service data
 access requests. Adding front-end bandwidth to an Object Storage cluster
 requires careful planning and design of the Object Storage proxies that
 projects use to gain access to the data, along with the high availability
 solutions that enable easy scaling of the proxy layer. We recommend
 designing a front-end load balancing layer that projects and consumers
 use to gain access to data stored within the cluster. This load
 balancing layer may be distributed across zones, regions or even across
 geographic boundaries, which may also require that the design encompass
 geo-location solutions.
 In some cases, you must add bandwidth and capacity to the network
 resources servicing requests between proxy servers and storage nodes.
 For this reason, the network architecture used for access to storage
 nodes and proxy servers should make use of a design which is scalable.
--- a/doc/arch-design-to-archive/source/storage-focus-prescriptive-examples.rst
+++ b/doc/arch-design-to-archive/source/storage-focus-prescriptive-examples.rst
@ -1,142 +0,0 @@
 Prescriptive Examples
 ~~~~~~~~~~~~~~~~~~~~~
 Storage-focused architecture depends on specific use cases. This section
 discusses three example use cases:
 *  An object store with a RESTful interface
 *  Compute analytics with parallel file systems
 *  High performance database
 The example below shows a REST interface without a high performance
 requirement.
 Swift is a highly scalable object store that is part of the OpenStack
 project. This diagram explains the example architecture:
 .. figure:: figures/Storage_Object.png
 The example REST interface, presented as a traditional Object store
 running on traditional spindles, does not require a high performance
 caching tier.
 This example uses the following components:
 Network:
 *  10 GbE horizontally scalable spine leaf back-end storage and front
   end network.
 Storage hardware:
 *  10 storage servers each with 12x4 TB disks equaling 480 TB total
   space with approximately 160 TB of usable space after replicas.
 Proxy:
 *  3x proxies
 *  2x10 GbE bonded front end
 *  2x10 GbE back-end bonds
 *  Approximately 60 Gb of total bandwidth to the back-end storage
   cluster
 .. note::
   It may be necessary to implement a 3rd-party caching layer for some
   applications to achieve suitable performance.
 Compute analytics with Data processing service
 ----------------------------------------------
 Analytics of large data sets are dependent on the performance of the
 storage system. Clouds using storage systems such as Hadoop Distributed
 File System (HDFS) have inefficiencies which can cause performance
 issues.
 One potential solution to this problem is the implementation of storage
 systems designed for performance. Parallel file systems have previously
 filled this need in the HPC space and are suitable for large scale
 performance-orientated systems.
 OpenStack has integration with Hadoop to manage the Hadoop cluster
 within the cloud. The following diagram shows an OpenStack store with a
 high performance requirement:
 .. figure:: figures/Storage_Hadoop3.png
 The hardware requirements and configuration are similar to those of the
 High Performance Database example below. In this case, the architecture
 uses Ceph's Swift-compatible REST interface, features that allow for
 connecting a caching pool to allow for acceleration of the presented
 pool.
 High performance database with Database service
 -----------------------------------------------
 Databases are a common workload that benefit from high performance
 storage back ends. Although enterprise storage is not a requirement,
 many environments have existing storage that OpenStack cloud can use as
 back ends. You can create a storage pool to provide block devices with
 OpenStack Block Storage for instances as well as object interfaces. In
 this example, the database I-O requirements are high and demand storage
 presented from a fast SSD pool.
 A storage system presents a LUN backed by a set of SSDs using a
 traditional storage array with OpenStack Block Storage integration or a
 storage platform such as Ceph or Gluster.
 This system can provide additional performance. For example, in the
 database example below, a portion of the SSD pool can act as a block
 device to the Database server. In the high performance analytics
 example, the inline SSD cache layer accelerates the REST interface.
 .. figure:: figures/Storage_Database_+_Object5.png
 In this example, Ceph presents a Swift-compatible REST interface, as
 well as a block level storage from a distributed storage cluster. It is
 highly flexible and has features that enable reduced cost of operations
 such as self healing and auto balancing. Using erasure coded pools are a
 suitable way of maximizing the amount of usable space.
 .. note::
   There are special considerations around erasure coded pools. For
   example, higher computational requirements and limitations on the
   operations allowed on an object; erasure coded pools do not support
   partial writes.
 Using Ceph as an applicable example, a potential architecture would have
 the following requirements:
 Network:
 *  10 GbE horizontally scalable spine leaf back-end storage and
   front-end network
 Storage hardware:
 *  5 storage servers for caching layer 24x1 TB SSD
 *  10 storage servers each with 12x4 TB disks which equals 480 TB total
   space with about approximately 160 TB of usable space after 3
   replicas
 REST proxy:
 *  3x proxies
 *  2x10 GbE bonded front end
 *  2x10 GbE back-end bonds
 *  Approximately 60 Gb of total bandwidth to the back-end storage
   cluster
 Using an SSD cache layer, you can present block devices directly to
 hypervisors or instances. The REST interface can also use the SSD cache
 systems as an inline cache.
--- a/doc/arch-design-to-archive/source/storage-focus-technical-considerations.rst
+++ b/doc/arch-design-to-archive/source/storage-focus-technical-considerations.rst
@ -1,62 +0,0 @@
 Technical considerations
 ~~~~~~~~~~~~~~~~~~~~~~~~
 Some of the key technical considerations that are critical to a
 storage-focused OpenStack design architecture include:
 Input-Output requirements
 Input-Output performance requirements require researching and
 modeling before deciding on a final storage framework. Running
 benchmarks for Input-Output performance provides a baseline for
 expected performance levels. If these tests include details, then
 the resulting data can help model behavior and results during
 different workloads. Running scripted smaller benchmarks during the
 lifecycle of the architecture helps record the system health at
 different points in time. The data from these scripted benchmarks
 assist in future scoping and gaining a deeper understanding of an
 organization's needs.
 Scale
 Scaling storage solutions in a storage-focused OpenStack
 architecture design is driven by initial requirements, including
 :term:`IOPS <Input/output Operations Per Second (IOPS)>`, capacity,
 bandwidth, and future needs. Planning capacity based on projected
 needs over the course of a budget cycle is important for a design.
 The architecture should balance cost and capacity, while also allowing
 flexibility to implement new technologies and methods as they become
 available.
 Security
 Designing security around data has multiple points of focus that
 vary depending on SLAs, legal requirements, industry regulations,
 and certifications needed for systems or people. Consider compliance
 with HIPPA, ISO9000, and SOX based on the type of data. For certain
 organizations, multiple levels of access control are important.
 OpenStack compatibility
 Interoperability and integration with OpenStack can be paramount in
 deciding on a storage hardware and storage management platform.
 Interoperability and integration includes factors such as OpenStack
 Block Storage interoperability, OpenStack Object Storage
 compatibility, and hypervisor compatibility (which affects the
 ability to use storage for ephemeral instance storage).
 Storage management
 You must address a range of storage management-related
 considerations in the design of a storage-focused OpenStack cloud.
 These considerations include, but are not limited to, backup
 strategy (and restore strategy, since a backup that cannot be
 restored is useless), data valuation-hierarchical storage
 management, retention strategy, data placement, and workflow
 automation.
 Data grids
 Data grids are helpful when answering questions around data
 valuation. Data grids improve decision making through correlation of
 access patterns, ownership, and business-unit revenue with other
 metadata values to deliver actionable information about data.
 When building a storage-focused OpenStack architecture, strive to build
 a flexible design based on an industry standard core. One way of
 accomplishing this might be through the use of different back ends
 serving different use cases.
--- a/doc/arch-design-to-archive/source/storage-focus.rst
+++ b/doc/arch-design-to-archive/source/storage-focus.rst
@ -1,61 +0,0 @@
 ===============
 Storage focused
 ===============
 .. toctree::
   :maxdepth: 2
   storage-focus-technical-considerations.rst
   storage-focus-operational-considerations.rst
   storage-focus-architecture.rst
   storage-focus-prescriptive-examples.rst
 Cloud storage is a model of data storage that stores digital data in
 logical pools and physical storage that spans across multiple servers
 and locations. Cloud storage commonly refers to a hosted object storage
 service, however the term also includes other types of data storage that
 are available as a service, for example block storage.
 Cloud storage runs on virtualized infrastructure and resembles broader
 cloud computing in terms of accessible interfaces, elasticity,
 scalability, multi-tenancy, and metered resources. You can use cloud
 storage services from an off-premises service or deploy on-premises.
 Cloud storage consists of many distributed, synonymous resources, which
 are often referred to as integrated storage clouds. Cloud storage is
 highly fault tolerant through redundancy and the distribution of data.
 It is highly durable through the creation of versioned copies, and can
 be consistent with regard to data replicas.
 At large scale, management of data operations is a resource intensive
 process for an organization. Hierarchical storage management (HSM)
 systems and data grids help annotate and report a baseline data
 valuation to make intelligent decisions and automate data decisions. HSM
 enables automated tiering and movement, as well as orchestration of data
 operations. A data grid is an architecture, or set of services evolving
 technology, that brings together sets of services enabling users to
 manage large data sets.
 Example applications deployed with cloud storage characteristics:
 *  Active archive, backups and hierarchical storage management.
 *  General content storage and synchronization. An example of this is
   private dropbox.
 *  Data analytics with parallel file systems.
 *  Unstructured data store for services. For example, social media
   back-end storage.
 *  Persistent block storage.
 *  Operating system and application image store.
 *  Media streaming.
 *  Databases.
 *  Content distribution.
 *  Cloud storage peering.