[arch-design] Migrate cloud architecture examples

1. Migrate and tidy up cloud architecture examples from the current guide 2. Migrate figures 3. Add placeholder sections for new content Change-Id: I290f555f6e0cd4200deccb4d705127d99e61c343 Partial-Bug: #1548176 Implements: blueprint archguide-mitaka-reorg
2016-03-09 11:25:53 +11:00 · 2016-03-09 11:25:53 +11:00 · bfe2987752
commit bfe2987752
parent 54bc787a56
43 changed files with 1254 additions and 10 deletions
--- a/doc/arch-design-draft/source/arch-examples-compute.rst
+++ b/doc/arch-design-draft/source/arch-examples-compute.rst
@ -0,0 +1,126 @@
 =============================
 Compute-focused cloud example
 =============================
 The Conseil Européen pour la Recherche Nucléaire (CERN), also known as
 the European Organization for Nuclear Research, provides particle
 accelerators and other infrastructure for high-energy physics research.
 As of 2011 CERN operated these two compute centers in Europe with plans
 to add a third.
 +-----------------------+------------------------+
 | Data center           | Approximate capacity   |
 +=======================+========================+
 | Geneva, Switzerland   | -  3.5 Mega Watts      |
 |                       |                        |
 |                       | -  91000 cores         |
 |                       |                        |
 |                       | -  120 PB HDD          |
 |                       |                        |
 |                       | -  100 PB Tape         |
 |                       |                        |
 |                       | -  310 TB Memory       |
 +-----------------------+------------------------+
 | Budapest, Hungary     | -  2.5 Mega Watts      |
 |                       |                        |
 |                       | -  20000 cores         |
 |                       |                        |
 |                       | -  6 PB HDD            |
 +-----------------------+------------------------+
 To support a growing number of compute-heavy users of experiments
 related to the Large Hadron Collider (LHC), CERN ultimately elected to
 deploy an OpenStack cloud using Scientific Linux and RDO. This effort
 aimed to simplify the management of the center's compute resources with
 a view to doubling compute capacity through the addition of a data
 center in 2013 while maintaining the same levels of compute staff.
 The CERN solution uses :term:`cells <cell>` for segregation of compute
 resources and for transparently scaling between different data centers.
 This decision meant trading off support for security groups and live
 migration. In addition, they must manually replicate some details, like
 flavors, across cells. In spite of these drawbacks cells provide the
 required scale while exposing a single public API endpoint to users.
 CERN created a compute cell for each of the two original data centers
 and created a third when it added a new data center in 2013. Each cell
 contains three availability zones to further segregate compute resources
 and at least three RabbitMQ message brokers configured for clustering
 with mirrored queues for high availability.
 The API cell, which resides behind a HAProxy load balancer, is in the
 data center in Switzerland and directs API calls to compute cells using
 a customized variation of the cell scheduler. The customizations allow
 certain workloads to route to a specific data center or all data
 centers, with cell RAM availability determining cell selection in the
 latter case.
 .. figure:: figures/Generic_CERN_Example.png
 There is also some customization of the filter scheduler that handles
 placement within the cells:
 ImagePropertiesFilter
 Provides special handling depending on the guest operating system in
 use (Linux-based or Windows-based).
 ProjectsToAggregateFilter
 Provides special handling depending on which project the instance is
 associated with.
 default_schedule_zones
 Allows the selection of multiple default availability zones, rather
 than a single default.
 A central database team manages the MySQL database server in each cell
 in an active/passive configuration with a NetApp storage back end.
 Backups run every 6 hours.
 Network architecture
 ~~~~~~~~~~~~~~~~~~~~
 To integrate with existing networking infrastructure, CERN made
 customizations to legacy networking (nova-network). This was in the form
 of a driver to integrate with CERN's existing database for tracking MAC
 and IP address assignments.
 The driver facilitates selection of a MAC address and IP for new
 instances based on the compute node where the scheduler places the
 instance.
 The driver considers the compute node where the scheduler placed an
 instance and selects a MAC address and IP from the pre-registered list
 associated with that node in the database. The database updates to
 reflect the address assignment to that instance.
 Storage architecture
 ~~~~~~~~~~~~~~~~~~~~
 CERN deploys the OpenStack Image service in the API cell and configures
 it to expose version 1 (V1) of the API. This also requires the image
 registry. The storage back end in use is a 3 PB Ceph cluster.
 CERN maintains a small set of Scientific Linux 5 and 6 images onto which
 orchestration tools can place applications. Puppet manages instance
 configuration and customization.
 Monitoring
 ~~~~~~~~~~
 CERN does not require direct billing, but uses the Telemetry service to
 perform metering for the purposes of adjusting project quotas. CERN uses
 a sharded, replicated, MongoDB back-end. To spread API load, CERN
 deploys instances of the nova-api service within the child cells for
 Telemetry to query against. This also requires the configuration of
 supporting services such as keystone, glance-api, and glance-registry in
 the child cells.
 .. figure:: figures/Generic_CERN_Architecture.png
 Additional monitoring tools in use include
 `Flume <http://flume.apache.org/>`_, `Elastic
 Search <http://www.elasticsearch.org/>`_,
 `Kibana <http://www.elasticsearch.org/overview/kibana/>`_, and the CERN
 developed `Lemon <http://lemon.web.cern.ch/lemon/index.shtml>`_
 project.
--- a/doc/arch-design-draft/source/arch-examples-general.rst
+++ b/doc/arch-design-draft/source/arch-examples-general.rst
@ -0,0 +1,85 @@
 =====================
 General cloud example
 =====================
 An online classified advertising company wants to run web applications
 consisting of Tomcat, Nginx and MariaDB in a private cloud. To be able
 to meet policy requirements, the cloud infrastructure will run in their
 own data center. The company has predictable load requirements, but
 requires scaling to cope with nightly increases in demand. Their current
 environment does not have the flexibility to align with their goal of
 running an open source API environment. The current environment consists
 of the following:
 * Between 120 and 140 installations of Nginx and Tomcat, each with 2
  vCPUs and 4 GB of RAM
 * A three-node MariaDB and Galera cluster, each with 4 vCPUs and 8 GB
  RAM
 The company runs hardware load balancers and multiple web applications
 serving their websites, and orchestrates environments using combinations
 of scripts and Puppet. The website generates large amounts of log data
 daily that requires archiving.
 The solution would consist of the following OpenStack components:
 * A firewall, switches and load balancers on the public facing network
  connections.
 * OpenStack Controller service running Image, Identity, Networking,
  combined with support services such as MariaDB and RabbitMQ,
  configured for high availability on at least three controller nodes.
 * OpenStack Compute nodes running the KVM hypervisor.
 * OpenStack Block Storage for use by compute instances, requiring
  persistent storage (such as databases for dynamic sites).
 * OpenStack Object Storage for serving static objects (such as images).
 .. figure:: figures/General_Architecture3.png
 Running up to 140 web instances and the small number of MariaDB
 instances requires 292 vCPUs available, as well as 584 GB RAM. On a
 typical 1U server using dual-socket hex-core Intel CPUs with
 Hyperthreading, and assuming 2:1 CPU overcommit ratio, this would
 require 8 OpenStack Compute nodes.
 The web application instances run from local storage on each of the
 OpenStack Compute nodes. The web application instances are stateless,
 meaning that any of the instances can fail and the application will
 continue to function.
 MariaDB server instances store their data on shared enterprise storage,
 such as NetApp or Solidfire devices. If a MariaDB instance fails,
 storage would be expected to be re-attached to another instance and
 rejoined to the Galera cluster.
 Logs from the web application servers are shipped to OpenStack Object
 Storage for processing and archiving.
 Additional capabilities can be realized by moving static web content to
 be served from OpenStack Object Storage containers, and backing the
 OpenStack Image service with OpenStack Object Storage.
 .. note::
   Increasing OpenStack Object Storage means network bandwidth needs to
   be taken into consideration. Running OpenStack Object Storage with
   network connections offering 10 GbE or better connectivity is
   advised.
 Leveraging Orchestration and Telemetry services is also a potential
 issue when providing auto-scaling, orchestrated web application
 environments. Defining the web applications in a
 :term:`Heat Orchestration Template (HOT)`
 negates the reliance on the current scripted Puppet
 solution.
 OpenStack Networking can be used to control hardware load balancers
 through the use of plug-ins and the Networking API. This allows users to
 control hardware load balance pools and instances as members in these
 pools, but their use in production environments must be carefully
 weighed against current stability.
--- a/doc/arch-design-draft/source/arch-examples-hybrid.rst
+++ b/doc/arch-design-draft/source/arch-examples-hybrid.rst
@ -0,0 +1,154 @@
 =====================
 Hybrid cloud examples
 =====================
 Hybrid cloud environments are designed for these use cases:
 * Bursting workloads from private to public OpenStack clouds
 * Bursting workloads from private to public non-OpenStack clouds
 * High availability across clouds (for technical diversity)
 This chapter provides examples of environments that address
 each of these use cases.
 Bursting to a public OpenStack cloud
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Company A's data center is running low on capacity.
 It is not possible to expand the data center in the foreseeable future.
 In order to accommodate the continuously growing need for
 development resources in the organization,
 Company A decides to use resources in the public cloud.
 Company A has an established data center with a substantial amount
 of hardware. Migrating the workloads to a public cloud is not feasible.
 The company has an internal cloud management platform that directs
 requests to the appropriate cloud, depending on the local capacity.
 This is a custom in-house application written for this specific purpose.
 This solution is depicted in the figure below:
 .. figure:: figures/Multi-Cloud_Priv-Pub3.png
   :width: 100%
 This example shows two clouds with a Cloud Management
 Platform (CMP) connecting them. This guide does not
 discuss a specific CMP, but describes how the Orchestration and
 Telemetry services handle, manage, and control workloads.
 The private OpenStack cloud has at least one controller and at least
 one compute node. It includes metering using the Telemetry service.
 The Telemetry service captures the load increase and the CMP
 processes the information.  If there is available capacity,
 the CMP uses the OpenStack API to call the Orchestration service.
 This creates instances on the private cloud in response to user requests.
 When capacity is not available on the private cloud, the CMP issues
 a request to the Orchestration service API of the public cloud.
 This creates the instance on the public cloud.
 In this example, Company A does not direct the deployments to an
 external public cloud due to concerns regarding resource control,
 security, and increased operational expense.
 Bursting to a public non-OpenStack cloud
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 The second example examines bursting workloads from the private cloud
 into a non-OpenStack public cloud using Amazon Web Services (AWS)
 to take advantage of additional capacity and to scale applications.
 The following diagram demonstrates an OpenStack-to-AWS hybrid cloud:
 .. figure:: figures/Multi-Cloud_Priv-AWS4.png
   :width: 100%
 Company B states that its developers are already using AWS
 and do not want to change to a different provider.
 If the CMP is capable of connecting to an external cloud
 provider with an appropriate API, the workflow process remains
 the same as the previous scenario.
 The actions the CMP takes, such as monitoring loads and
 creating new instances, stay the same.
 However, the CMP performs actions in the public cloud
 using applicable API calls.
 If the public cloud is AWS, the CMP would use the
 EC2 API to create a new instance and assign an Elastic IP.
 It can then add that IP to HAProxy in the private cloud.
 The CMP can also reference AWS-specific
 tools such as CloudWatch and CloudFormation.
 Several open source tool kits for building CMPs are
 available and can handle this kind of translation.
 Examples include ManageIQ, jClouds, and JumpGate.
 High availability and disaster recovery
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Company C requires their local data center to be able to
 recover from failure.  Some of the workloads currently in
 use are running on their private OpenStack cloud.
 Protecting the data involves Block Storage, Object Storage,
 and a database. The architecture supports the failure of
 large components of the system while ensuring that the
 system continues to deliver services.
 While the services remain available to users, the failed
 components are restored in the background based on standard
 best practice data replication policies.
 To achieve these objectives, Company C replicates data to
 a second cloud in a geographically distant location.
 The following diagram describes this system:
 .. figure:: figures/Multi-Cloud_failover2.png
   :width: 100%
 This example includes two private OpenStack clouds connected with a CMP.
 The source cloud, OpenStack Cloud 1, includes a controller and
 at least one instance running MySQL. It also includes at least
 one Block Storage volume and one Object Storage volume.
 This means that data is available to the users at all times.
 The details of the method for protecting each of these sources
 of data differs.
 Object Storage relies on the replication capabilities of
 the Object Storage provider.
 Company C enables OpenStack Object Storage so that it creates
 geographically separated replicas that take advantage of this feature.
 The company configures storage so that at least one replica
 exists in each cloud. In order to make this work, the company
 configures a single array spanning both clouds with OpenStack Identity.
 Using Federated Identity, the array talks to both clouds, communicating
 with OpenStack Object Storage through the Swift proxy.
 For Block Storage, the replication is a little more difficult,
 and involves tools outside of OpenStack itself.
 The OpenStack Block Storage volume is not set as the drive itself
 but as a logical object that points to a physical back end.
 Disaster recovery is configured for Block Storage for
 synchronous backup for the highest level of data protection,
 but asynchronous backup could have been set as an alternative
 that is not as latency sensitive.
 For asynchronous backup, the Block Storage API makes it possible
 to export the data and also the metadata of a particular volume,
 so that it can be moved and replicated elsewhere.
 More information can be found here:
 https://blueprints.launchpad.net/cinder/+spec/cinder-backup-volume-metadata-support.
 The synchronous backups create an identical volume in both
 clouds and chooses the appropriate flavor so that each cloud
 has an identical back end. This is done by creating volumes
 through the CMP. After this is configured, a solution
 involving DRDB synchronizes the physical drives.
 The database component is backed up using synchronous backups.
 MySQL does not support geographically diverse replication,
 so disaster recovery is provided by replicating the file itself.
 As it is not possible to use Object Storage as the back end of
 a database like MySQL, Swift replication is not an option.
 Company C decides not to store the data on another geo-tiered
 storage system, such as Ceph, as Block Storage.
 This would have given another layer of protection.
 Another option would have been to store the database on an OpenStack
 Block Storage volume and backing it up like any other Block Storage.
--- a/doc/arch-design-draft/source/arch-examples-multi-site.rst
+++ b/doc/arch-design-draft/source/arch-examples-multi-site.rst
@ -0,0 +1,192 @@
 =========================
 Multi-site cloud examples
 =========================
 There are multiple ways to build a multi-site OpenStack installation,
 based on the needs of the intended workloads. Below are example
 architectures based on different requirements. These examples are meant
 as a reference, and not a hard and fast rule for deployments. Use the
 previous sections of this chapter to assist in selecting specific
 components and implementations based on specific needs.
 A large content provider needs to deliver content to customers that are
 geographically dispersed. The workload is very sensitive to latency and
 needs a rapid response to end-users. After reviewing the user, technical
 and operational considerations, it is determined beneficial to build a
 number of regions local to the customer's edge. Rather than build a few
 large, centralized data centers, the intent of the architecture is to
 provide a pair of small data centers in locations that are closer to the
 customer. In this use case, spreading applications out allows for
 different horizontal scaling than a traditional compute workload scale.
 The intent is to scale by creating more copies of the application in
 closer proximity to the users that need it most, in order to ensure
 faster response time to user requests. This provider deploys two
 datacenters at each of the four chosen regions. The implications of this
 design are based around the method of placing copies of resources in
 each of the remote regions. Swift objects, Glance images, and block
 storage need to be manually replicated into each region. This may be
 beneficial for some systems, such as the case of content service, where
 only some of the content needs to exist in some but not all regions. A
 centralized Keystone is recommended to ensure authentication and that
 access to the API endpoints is easily manageable.
 It is recommended that you install an automated DNS system such as
 Designate. Application administrators need a way to manage the mapping
 of which application copy exists in each region and how to reach it,
 unless an external Dynamic DNS system is available. Designate assists by
 making the process automatic and by populating the records in the each
 region's zone.
 Telemetry for each region is also deployed, as each region may grow
 differently or be used at a different rate. Ceilometer collects each
 region's meters from each of the controllers and report them back to a
 central location. This is useful both to the end user and the
 administrator of the OpenStack environment. The end user will find this
 method useful, as it makes possible to determine if certain locations
 are experiencing higher load than others, and take appropriate action.
 Administrators also benefit by possibly being able to forecast growth
 per region, rather than expanding the capacity of all regions
 simultaneously, therefore maximizing the cost-effectiveness of the
 multi-site design.
 One of the key decisions of running this infrastructure is whether or
 not to provide a redundancy model. Two types of redundancy and high
 availability models in this configuration can be implemented. The first
 type is the availability of central OpenStack components. Keystone can
 be made highly available in three central data centers that host the
 centralized OpenStack components. This prevents a loss of any one of the
 regions causing an outage in service. It also has the added benefit of
 being able to run a central storage repository as a primary cache for
 distributing content to each of the regions.
 The second redundancy type is the edge data center itself. A second data
 center in each of the edge regional locations house a second region near
 the first region. This ensures that the application does not suffer
 degraded performance in terms of latency and availability.
 :ref:`ms-customer-edge` depicts the solution designed to have both a
 centralized set of core data centers for OpenStack services and paired edge
 data centers:
 .. _ms-customer-edge:
 .. figure:: figures/Multi-Site_Customer_Edge.png
   **Multi-site architecture example**
 Geo-redundant load balancing
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 A large-scale web application has been designed with cloud principles in
 mind. The application is designed provide service to application store,
 on a 24/7 basis. The company has typical two tier architecture with a
 web front-end servicing the customer requests, and a NoSQL database back
 end storing the information.
 As of late there has been several outages in number of major public
 cloud providers due to applications running out of a single geographical
 location. The design therefore should mitigate the chance of a single
 site causing an outage for their business.
 The solution would consist of the following OpenStack components:
 * A firewall, switches and load balancers on the public facing network
  connections.
 * OpenStack Controller services running, Networking, dashboard, Block
  Storage and Compute running locally in each of the three regions.
  Identity service, Orchestration service, Telemetry service, Image
  service and Object Storage service can be installed centrally, with
  nodes in each of the region providing a redundant OpenStack
  Controller plane throughout the globe.
 * OpenStack Compute nodes running the KVM hypervisor.
 * OpenStack Object Storage for serving static objects such as images
  can be used to ensure that all images are standardized across all the
  regions, and replicated on a regular basis.
 * A distributed DNS service available to all regions that allows for
  dynamic update of DNS records of deployed instances.
 * A geo-redundant load balancing service can be used to service the
  requests from the customers based on their origin.
 An autoscaling heat template can be used to deploy the application in
 the three regions. This template includes:
 * Web Servers, running Apache.
 * Appropriate ``user_data`` to populate the central DNS servers upon
  instance launch.
 * Appropriate Telemetry alarms that maintain state of the application
  and allow for handling of region or instance failure.
 Another autoscaling Heat template can be used to deploy a distributed
 MongoDB shard over the three locations, with the option of storing
 required data on a globally available swift container. According to the
 usage and load on the database server, additional shards can be
 provisioned according to the thresholds defined in Telemetry.
 Two data centers would have been sufficient had the requirements been
 met. But three regions are selected here to avoid abnormal load on a
 single region in the event of a failure.
 Orchestration is used because of the built-in functionality of
 autoscaling and auto healing in the event of increased load. Additional
 configuration management tools, such as Puppet or Chef could also have
 been used in this scenario, but were not chosen since Orchestration had
 the appropriate built-in hooks into the OpenStack cloud, whereas the
 other tools were external and not native to OpenStack. In addition,
 external tools were not needed since this deployment scenario was
 straight forward.
 OpenStack Object Storage is used here to serve as a back end for the
 Image service since it is the most suitable solution for a globally
 distributed storage solution with its own replication mechanism. Home
 grown solutions could also have been used including the handling of
 replication, but were not chosen, because Object Storage is already an
 intricate part of the infrastructure and a proven solution.
 An external load balancing service was used and not the LBaaS in
 OpenStack because the solution in OpenStack is not redundant and does
 not have any awareness of geo location.
 .. _ms-geo-redundant:
 .. figure:: figures/Multi-site_Geo_Redundant_LB.png
   **Multi-site geo-redundant architecture**
 Location-local service
 ~~~~~~~~~~~~~~~~~~~~~~
 A common use for multi-site OpenStack deployment is creating a Content
 Delivery Network. An application that uses a location-local architecture
 requires low network latency and proximity to the user to provide an
 optimal user experience and reduce the cost of bandwidth and transit.
 The content resides on sites closer to the customer, instead of a
 centralized content store that requires utilizing higher cost
 cross-country links.
 This architecture includes a geo-location component that places user
 requests to the closest possible node. In this scenario, 100% redundancy
 of content across every site is a goal rather than a requirement, with
 the intent to maximize the amount of content available within a minimum
 number of network hops for end users. Despite these differences, the
 storage replication configuration has significant overlap with that of a
 geo-redundant load balancing use case.
 In :ref:`ms-shared-keystone`, the application utilizing this multi-site
 OpenStack install that is location-aware would launch web server or content
 serving instances on the compute cluster in each site. Requests from clients
 are first sent to a global services load balancer that determines the location
 of the client, then routes the request to the closest OpenStack site where the
 application completes the request.
 .. _ms-shared-keystone:
 .. figure:: figures/Multi-Site_shared_keystone1.png
   **Multi-site shared keystone architecture**
--- a/doc/arch-design-draft/source/arch-examples-network.rst
+++ b/doc/arch-design-draft/source/arch-examples-network.rst
@ -0,0 +1,166 @@
 ==============================
 Network-focused cloud examples
 ==============================
 An organization designs a large-scale web application with cloud
 principles in mind. The application scales horizontally in a bursting
 fashion and generates a high instance count. The application requires an
 SSL connection to secure data and must not lose connection state to
 individual servers.
 The figure below depicts an example design for this workload. In this
 example, a hardware load balancer provides SSL offload functionality and
 connects to tenant networks in order to reduce address consumption. This
 load balancer links to the routing architecture as it services the VIP
 for the application. The router and load balancer use the GRE tunnel ID
 of the application's tenant network and an IP address within the tenant
 subnet but outside of the address pool. This is to ensure that the load
 balancer can communicate with the application's HTTP servers without
 requiring the consumption of a public IP address.
 Because sessions persist until closed, the routing and switching
 architecture provides high availability. Switches mesh to each
 hypervisor and each other, and also provide an MLAG implementation to
 ensure that layer-2 connectivity does not fail. Routers use VRRP and
 fully mesh with switches to ensure layer-3 connectivity. Since GRE is
 provides an overlay network, Networking is present and uses the Open
 vSwitch agent in GRE tunnel mode. This ensures all devices can reach all
 other devices and that you can create tenant networks for private
 addressing links to the load balancer.
 .. figure:: figures/Network_Web_Services1.png
 A web service architecture has many options and optional components. Due
 to this, it can fit into a large number of other OpenStack designs. A
 few key components, however, need to be in place to handle the nature of
 most web-scale workloads. You require the following components:
 *  OpenStack Controller services (Image, Identity, Networking and
   supporting services such as MariaDB and RabbitMQ)
 *  OpenStack Compute running KVM hypervisor
 *  OpenStack Object Storage
 *  Orchestration service
 *  Telemetry service
 Beyond the normal Identity, Compute, Image service, and Object Storage
 components, we recommend the Orchestration service component to handle
 the proper scaling of workloads to adjust to demand. Due to the
 requirement for auto-scaling, the design includes the Telemetry service.
 Web services tend to be bursty in load, have very defined peak and
 valley usage patterns and, as a result, benefit from automatic scaling
 of instances based upon traffic. At a network level, a split network
 configuration works well with databases residing on private tenant
 networks since these do not emit a large quantity of broadcast traffic
 and may need to interconnect to some databases for content.
 Load balancing
 ~~~~~~~~~~~~~~
 Load balancing spreads requests across multiple instances. This workload
 scales well horizontally across large numbers of instances. This enables
 instances to run without publicly routed IP addresses and instead to
 rely on the load balancer to provide a globally reachable service. Many
 of these services do not require direct server return. This aids in
 address planning and utilization at scale since only the virtual IP
 (VIP) must be public.
 Overlay networks
 ~~~~~~~~~~~~~~~~
 The overlay functionality design includes OpenStack Networking in Open
 vSwitch GRE tunnel mode. In this case, the layer-3 external routers pair
 with VRRP, and switches pair with an implementation of MLAG to ensure
 that you do not lose connectivity with the upstream routing
 infrastructure.
 Performance tuning
 ~~~~~~~~~~~~~~~~~~
 Network level tuning for this workload is minimal. Quality-of-Service
 (QoS) applies to these workloads for a middle ground Class Selector
 depending on existing policies. It is higher than a best effort queue
 but lower than an Expedited Forwarding or Assured Forwarding queue.
 Since this type of application generates larger packets with
 longer-lived connections, you can optimize bandwidth utilization for
 long duration TCP. Normal bandwidth planning applies here with regards
 to benchmarking a session's usage multiplied by the expected number of
 concurrent sessions with overhead.
 Network functions
 ~~~~~~~~~~~~~~~~~
 Network functions is a broad category but encompasses workloads that
 support the rest of a system's network. These workloads tend to consist
 of large amounts of small packets that are very short lived, such as DNS
 queries or SNMP traps. These messages need to arrive quickly and do not
 deal with packet loss as there can be a very large volume of them. There
 are a few extra considerations to take into account for this type of
 workload and this can change a configuration all the way to the
 hypervisor level. For an application that generates 10 TCP sessions per
 user with an average bandwidth of 512 kilobytes per second per flow and
 expected user count of ten thousand concurrent users, the expected
 bandwidth plan is approximately 4.88 gigabits per second.
 The supporting network for this type of configuration needs to have a
 low latency and evenly distributed availability. This workload benefits
 from having services local to the consumers of the service. Use a
 multi-site approach as well as deploying many copies of the application
 to handle load as close as possible to consumers. Since these
 applications function independently, they do not warrant running
 overlays to interconnect tenant networks. Overlays also have the
 drawback of performing poorly with rapid flow setup and may incur too
 much overhead with large quantities of small packets and therefore we do
 not recommend them.
 QoS is desirable for some workloads to ensure delivery. DNS has a major
 impact on the load times of other services and needs to be reliable and
 provide rapid responses. Configure rules in upstream devices to apply a
 higher Class Selector to DNS to ensure faster delivery or a better spot
 in queuing algorithms.
 Cloud storage
 ~~~~~~~~~~~~~
 Another common use case for OpenStack environments is providing a
 cloud-based file storage and sharing service. You might consider this a
 storage-focused use case, but its network-side requirements make it a
 network-focused use case.
 For example, consider a cloud backup application. This workload has two
 specific behaviors that impact the network. Because this workload is an
 externally-facing service and an internally-replicating application, it
 has both :term:`north-south<north-south traffic>` and
 :term:`east-west<east-west traffic>` traffic considerations:
 north-south traffic
 When a user uploads and stores content, that content moves into the
 OpenStack installation. When users download this content, the
 content moves out from the OpenStack installation. Because this
 service operates primarily as a backup, most of the traffic moves
 southbound into the environment. In this situation, it benefits you
 to configure a network to be asymmetrically downstream because the
 traffic that enters the OpenStack installation is greater than the
 traffic that leaves the installation.
 east-west traffic
 Likely to be fully symmetric. Because replication originates from
 any node and might target multiple other nodes algorithmically, it
 is less likely for this traffic to have a larger volume in any
 specific direction. However this traffic might interfere with
 north-south traffic.
 .. figure:: figures/Network_Cloud_Storage2.png
 This application prioritizes the north-south traffic over east-west
 traffic: the north-south traffic involves customer-facing data.
 The network design in this case is less dependent on availability and
 more dependent on being able to handle high bandwidth. As a direct
 result, it is beneficial to forgo redundant links in favor of bonding
 those connections. This increases available bandwidth. It is also
 beneficial to configure all devices in the path, including OpenStack, to
 generate and pass jumbo frames.
--- a/doc/arch-design-draft/source/arch-examples-specialized.rst
+++ b/doc/arch-design-draft/source/arch-examples-specialized.rst
@ -0,0 +1,42 @@
 =================
 Specialized cases
 =================
 .. toctree::
   :maxdepth: 2
   specialized-multi-hypervisor.rst
   specialized-networking.rst
   specialized-software-defined-networking.rst
   specialized-desktop-as-a-service.rst
   specialized-openstack-on-openstack.rst
   specialized-hardware.rst
   specialized-single-site.rst
   specialized-add-region.rst
   specialized-scaling-multiple-cells.rst
 Although OpenStack architecture designs have been described
 in seven major scenarios outlined in other sections
 (compute focused, network focused, storage focused, general
 purpose, multi-site, hybrid cloud, and massively scalable),
 there are a few use cases that do not fit into these categories.
 This section discusses these specialized cases and provide some
 additional details and design considerations for each use case:
 * :doc:`Specialized networking <specialized-networking>`:
  describes running networking-oriented software that may involve reading
  packets directly from the wire or participating in routing protocols.
 * :doc:`Software-defined networking (SDN)
  <specialized-software-defined-networking>`:
  describes both running an SDN controller from within OpenStack
  as well as participating in a software-defined network.
 * :doc:`Desktop-as-a-Service <specialized-desktop-as-a-service>`:
  describes running a virtualized desktop environment in a cloud
  (:term:`Desktop-as-a-Service`).
  This applies to private and public clouds.
 * :doc:`OpenStack on OpenStack <specialized-openstack-on-openstack>`:
  describes building a multi-tiered cloud by running OpenStack
  on top of an OpenStack installation.
 * :doc:`Specialized hardware <specialized-hardware>`:
  describes the use of specialized hardware devices from within
  the OpenStack environment.
--- a/doc/arch-design-draft/source/arch-examples-storage.rst
+++ b/doc/arch-design-draft/source/arch-examples-storage.rst
@ -0,0 +1,143 @@
 ==============================
 Storage-focused cloud examples
 ==============================
 Storage-focused architecture depends on specific use cases. This section
 discusses three example use cases:
 *  An object store with a RESTful interface
 *  Compute analytics with parallel file systems
 *  High performance database
 The example below shows a REST interface without a high performance
 requirement.
 Swift is a highly scalable object store that is part of the OpenStack
 project. This diagram explains the example architecture:
 .. figure:: figures/Storage_Object.png
 The example REST interface, presented as a traditional Object store
 running on traditional spindles, does not require a high performance
 caching tier.
 This example uses the following components:
 Network:
 *  10 GbE horizontally scalable spine leaf back-end storage and front
   end network.
 Storage hardware:
 *  10 storage servers each with 12x4 TB disks equaling 480 TB total
   space with approximately 160 TB of usable space after replicas.
 Proxy:
 *  3x proxies
 *  2x10 GbE bonded front end
 *  2x10 GbE back-end bonds
 *  Approximately 60 Gb of total bandwidth to the back-end storage
   cluster
 .. note::
   It may be necessary to implement a 3rd-party caching layer for some
   applications to achieve suitable performance.
 Compute analytics with Data processing service
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Analytics of large data sets are dependent on the performance of the
 storage system. Clouds using storage systems such as Hadoop Distributed
 File System (HDFS) have inefficiencies which can cause performance
 issues.
 One potential solution to this problem is the implementation of storage
 systems designed for performance. Parallel file systems have previously
 filled this need in the HPC space and are suitable for large scale
 performance-orientated systems.
 OpenStack has integration with Hadoop to manage the Hadoop cluster
 within the cloud. The following diagram shows an OpenStack store with a
 high performance requirement:
 .. figure:: figures/Storage_Hadoop3.png
 The hardware requirements and configuration are similar to those of the
 High Performance Database example below. In this case, the architecture
 uses Ceph's Swift-compatible REST interface, features that allow for
 connecting a caching pool to allow for acceleration of the presented
 pool.
 High performance database with Database service
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Databases are a common workload that benefit from high performance
 storage back ends. Although enterprise storage is not a requirement,
 many environments have existing storage that OpenStack cloud can use as
 back ends. You can create a storage pool to provide block devices with
 OpenStack Block Storage for instances as well as object interfaces. In
 this example, the database I-O requirements are high and demand storage
 presented from a fast SSD pool.
 A storage system presents a LUN backed by a set of SSDs using a
 traditional storage array with OpenStack Block Storage integration or a
 storage platform such as Ceph or Gluster.
 This system can provide additional performance. For example, in the
 database example below, a portion of the SSD pool can act as a block
 device to the Database server. In the high performance analytics
 example, the inline SSD cache layer accelerates the REST interface.
 .. figure:: figures/Storage_Database_+_Object5.png
 In this example, Ceph presents a Swift-compatible REST interface, as
 well as a block level storage from a distributed storage cluster. It is
 highly flexible and has features that enable reduced cost of operations
 such as self healing and auto balancing. Using erasure coded pools are a
 suitable way of maximizing the amount of usable space.
 .. note::
   There are special considerations around erasure coded pools. For
   example, higher computational requirements and limitations on the
   operations allowed on an object; erasure coded pools do not support
   partial writes.
 Using Ceph as an applicable example, a potential architecture would have
 the following requirements:
 Network:
 *  10 GbE horizontally scalable spine leaf back-end storage and
   front-end network
 Storage hardware:
 *  5 storage servers for caching layer 24x1 TB SSD
 *  10 storage servers each with 12x4 TB disks which equals 480 TB total
   space with about approximately 160 TB of usable space after 3
   replicas
 REST proxy:
 *  3x proxies
 *  2x10 GbE bonded front end
 *  2x10 GbE back-end bonds
 *  Approximately 60 Gb of total bandwidth to the back-end storage
   cluster
 Using an SSD cache layer, you can present block devices directly to
 hypervisors or instances. The REST interface can also use the SSD cache
 systems as an inline cache.
--- a/doc/arch-design-draft/source/arch-examples.rst
+++ b/doc/arch-design-draft/source/arch-examples.rst
@ -0,0 +1,14 @@
 ===========================
 Cloud architecture examples
 ===========================
 .. toctree::
   :maxdepth: 2
   arch-examples-general.rst
   arch-examples-compute.rst
   arch-examples-storage.rst
   arch-examples-network.rst
   arch-examples-multi-site.rst
   arch-examples-hybrid.rst
   arch-examples-specialized.rst
--- a/doc/arch-design-draft/source/example-architectures.rst
+++ b/doc/arch-design-draft/source/example-architectures.rst
@ -1,9 +0,0 @@
 =====================
 Example architectures
 =====================
 .. toctree::
   :maxdepth: 2
--- a/doc/arch-design-draft/source/figures/Compute_NSX.png
+++ b/doc/arch-design-draft/source/figures/Compute_NSX.png
--- a/doc/arch-design-draft/source/figures/Compute_Tech_Bin_Packing_CPU_optimized1.png
+++ b/doc/arch-design-draft/source/figures/Compute_Tech_Bin_Packing_CPU_optimized1.png
--- a/doc/arch-design-draft/source/figures/Compute_Tech_Bin_Packing_General1.png
+++ b/doc/arch-design-draft/source/figures/Compute_Tech_Bin_Packing_General1.png
--- a/doc/arch-design-draft/source/figures/General_Architecture3.png
+++ b/doc/arch-design-draft/source/figures/General_Architecture3.png
--- a/doc/arch-design-draft/source/figures/Generic_CERN_Architecture.png
+++ b/doc/arch-design-draft/source/figures/Generic_CERN_Architecture.png
--- a/doc/arch-design-draft/source/figures/Generic_CERN_Example.png
+++ b/doc/arch-design-draft/source/figures/Generic_CERN_Example.png
--- a/doc/arch-design-draft/source/figures/Massively_Scalable_Cells_regions_azs.png
+++ b/doc/arch-design-draft/source/figures/Massively_Scalable_Cells_regions_azs.png
--- a/doc/arch-design-draft/source/figures/Multi-Cloud_Priv-AWS4.png
+++ b/doc/arch-design-draft/source/figures/Multi-Cloud_Priv-AWS4.png
--- a/doc/arch-design-draft/source/figures/Multi-Cloud_Priv-Pub3.png
+++ b/doc/arch-design-draft/source/figures/Multi-Cloud_Priv-Pub3.png
--- a/doc/arch-design-draft/source/figures/Multi-Cloud_failover2.png
+++ b/doc/arch-design-draft/source/figures/Multi-Cloud_failover2.png
--- a/doc/arch-design-draft/source/figures/Multi-Site_Customer_Edge.png
+++ b/doc/arch-design-draft/source/figures/Multi-Site_Customer_Edge.png
--- a/doc/arch-design-draft/source/figures/Multi-Site_shared_keystone1.png
+++ b/doc/arch-design-draft/source/figures/Multi-Site_shared_keystone1.png
--- a/doc/arch-design-draft/source/figures/Multi-Site_shared_keystone_horizon_swift1.png
+++ b/doc/arch-design-draft/source/figures/Multi-Site_shared_keystone_horizon_swift1.png
--- a/doc/arch-design-draft/source/figures/Multi-site_Geo_Redundant_LB.png
+++ b/doc/arch-design-draft/source/figures/Multi-site_Geo_Redundant_LB.png
--- a/doc/arch-design-draft/source/figures/Network_Cloud_Storage2.png
+++ b/doc/arch-design-draft/source/figures/Network_Cloud_Storage2.png
--- a/doc/arch-design-draft/source/figures/Network_Web_Services1.png
+++ b/doc/arch-design-draft/source/figures/Network_Web_Services1.png
--- a/doc/arch-design-draft/source/figures/Specialized_Hardware2.png
+++ b/doc/arch-design-draft/source/figures/Specialized_Hardware2.png
--- a/doc/arch-design-draft/source/figures/Specialized_OOO.png
+++ b/doc/arch-design-draft/source/figures/Specialized_OOO.png
--- a/doc/arch-design-draft/source/figures/Specialized_SDN_external.png
+++ b/doc/arch-design-draft/source/figures/Specialized_SDN_external.png
--- a/doc/arch-design-draft/source/figures/Specialized_SDN_hosted.png
+++ b/doc/arch-design-draft/source/figures/Specialized_SDN_hosted.png
--- a/doc/arch-design-draft/source/figures/Specialized_VDI1.png
+++ b/doc/arch-design-draft/source/figures/Specialized_VDI1.png
--- a/doc/arch-design-draft/source/figures/Storage_Database_+_Object5.png
+++ b/doc/arch-design-draft/source/figures/Storage_Database_+_Object5.png
--- a/doc/arch-design-draft/source/figures/Storage_Hadoop3.png
+++ b/doc/arch-design-draft/source/figures/Storage_Hadoop3.png
--- a/doc/arch-design-draft/source/figures/Storage_Object.png
+++ b/doc/arch-design-draft/source/figures/Storage_Object.png
--- a/doc/arch-design-draft/source/index.rst
+++ b/doc/arch-design-draft/source/index.rst
@ -32,7 +32,7 @@ Contents
   high-availability.rst
   security-requirements.rst
   legal-requirements.rst
-   example-architectures.rst
+   arch-examples.rst
   common/app_support.rst
   common/glossary.rst
--- a/doc/arch-design-draft/source/specialized-add-region.rst
+++ b/doc/arch-design-draft/source/specialized-add-region.rst
@ -0,0 +1,5 @@
 =====================
 Adding another region
 =====================
 .. TODO
--- a/doc/arch-design-draft/source/specialized-desktop-as-a-service.rst
+++ b/doc/arch-design-draft/source/specialized-desktop-as-a-service.rst
@ -0,0 +1,47 @@
 ====================
 Desktop-as-a-Service
 ====================
 Virtual Desktop Infrastructure (VDI) is a service that hosts
 user desktop environments on remote servers. This application
 is very sensitive to network latency and requires a high
 performance compute environment. Traditionally these types of
 services do not use cloud environments because few clouds
 support such a demanding workload for user-facing applications.
 As cloud environments become more robust, vendors are starting
 to provide services that provide virtual desktops in the cloud.
 OpenStack may soon provide the infrastructure for these types of deployments.
 Challenges
 ~~~~~~~~~~
 Designing an infrastructure that is suitable to host virtual
 desktops is a very different task to that of most virtual workloads.
 For example, the design must consider:
 * Boot storms, when a high volume of logins occur in a short period of time
 * The performance of the applications running on virtual desktops
 * Operating systems and their compatibility with the OpenStack hypervisor
 Broker
 ~~~~~~
 The connection broker determines which remote desktop host
 users can access. Medium and large scale environments require a broker
 since its service represents a central component of the architecture.
 The broker is a complete management product, and enables automated
 deployment and provisioning of remote desktop hosts.
 Possible solutions
 ~~~~~~~~~~~~~~~~~~
 There are a number of commercial products currently available that
 provide a broker solution. However, no native OpenStack projects
 provide broker services.
 Not providing a broker is also an option, but managing this manually
 would not suffice for a large scale, enterprise solution.
 Diagram
 ~~~~~~~
 .. figure:: figures/Specialized_VDI1.png
--- a/doc/arch-design-draft/source/specialized-hardware.rst
+++ b/doc/arch-design-draft/source/specialized-hardware.rst
@ -0,0 +1,43 @@
 ====================
 Specialized hardware
 ====================
 Certain workloads require specialized hardware devices that
 have significant virtualization or sharing challenges.
 Applications such as load balancers, highly parallel brute
 force computing, and direct to wire networking may need
 capabilities that basic OpenStack components do not provide.
 Challenges
 ~~~~~~~~~~
 Some applications need access to hardware devices to either
 improve performance or provide capabilities that are not
 virtual CPU, RAM, network, or storage. These can be a shared
 resource, such as a cryptography processor, or a dedicated
 resource, such as a Graphics Processing Unit (GPU). OpenStack can
 provide some of these, while others may need extra work.
 Solutions
 ~~~~~~~~~
 To provide cryptography offloading to a set of instances,
 you can use Image service configuration options.
 For example, assign the cryptography chip to a device node in the guest.
 The OpenStack Command Line Reference contains further information on
 configuring this solution in the section `Image service property keys
 <http://docs.openstack.org/cli-reference/glance.html#image-service-property-keys>`_.
 A challenge, however, is this option allows all guests using the
 configured images to access the hypervisor cryptography device.
 If you require direct access to a specific device, PCI pass-through
 enables you to dedicate the device to a single instance per hypervisor.
 You must define a flavor that has the PCI device specifically in order
 to properly schedule instances.
 More information regarding PCI pass-through, including instructions for
 implementing and using it, is available at
 `https://wiki.openstack.org/wiki/Pci_passthrough <https://wiki.openstack.org/
 wiki/Pci_passthrough#How_to_check_PCI_status_with_PCI_api_patches>`_.
 .. figure:: figures/Specialized_Hardware2.png
   :width: 100%
--- a/doc/arch-design-draft/source/specialized-multi-hypervisor.rst
+++ b/doc/arch-design-draft/source/specialized-multi-hypervisor.rst
@ -0,0 +1,78 @@
 ========================
 Multi-hypervisor example
 ========================
 A financial company requires its applications migrated
 from a traditional, virtualized environment to an API driven,
 orchestrated environment. The new environment needs
 multiple hypervisors since many of the company's applications
 have strict hypervisor requirements.
 Currently, the company's vSphere environment runs 20 VMware
 ESXi hypervisors. These hypervisors support 300 instances of
 various sizes. Approximately 50 of these instances must run
 on ESXi. The remaining 250 or so have more flexible requirements.
 The financial company decides to manage the
 overall system with a common OpenStack platform.
 .. figure:: figures/Compute_NSX.png
   :width: 100%
 Architecture planning teams decided to run a host aggregate
 containing KVM hypervisors for the general purpose instances.
 A separate host aggregate targets instances requiring ESXi.
 Images in the OpenStack Image service have particular
 hypervisor metadata attached. When a user requests a
 certain image, the instance spawns on the relevant aggregate.
 Images for ESXi use the VMDK format. You can convert
 QEMU disk images to VMDK, VMFS Flat Disks. These disk images
 can also be thin, thick, zeroed-thick, and eager-zeroed-thick.
 After exporting a VMFS thin disk from VMFS to the
 OpenStack Image service (a non-VMFS location), it becomes a
 preallocated flat disk. This impacts the transfer time from the
 OpenStack Image service to the data store since transfers require
 moving the full preallocated flat disk rather than the thin disk.
 The VMware host aggregate compute nodes communicate with
 vCenter rather than spawning directly on a hypervisor.
 The vCenter then requests scheduling for the instance to run on
 an ESXi hypervisor.
 This functionality requires that VMware Distributed Resource
 Scheduler (DRS) is enabled on a cluster and set to **Fully Automated**.
 The vSphere requires shared storage because the DRS uses vMotion
 which is a service that relies on shared storage.
 This solution to the company's migration uses shared storage
 to provide Block Storage capabilities to the KVM instances while
 also providing vSphere storage. The new environment provides this
 storage functionality using a dedicated data network. The
 compute hosts should have dedicated NICs to support the
 dedicated data network. vSphere supports OpenStack Block Storage. This
 support gives storage from a VMFS datastore to an instance. For the
 financial company, Block Storage in their new architecture supports
 both hypervisors.
 OpenStack Networking provides network connectivity in this new
 architecture, with the VMware NSX plug-in driver configured. legacy
 networking (nova-network) supports both hypervisors in this new
 architecture example, but has limitations. Specifically, vSphere
 with legacy networking does not support security groups. The new
 architecture uses VMware NSX as a part of the design. When users launch an
 instance within either of the host aggregates, VMware NSX ensures the
 instance attaches to the appropriate network overlay-based logical networks.
 The architecture planning teams also consider OpenStack Compute integration.
 When running vSphere in an OpenStack environment, nova-compute
 communications with vCenter appear as a single large hypervisor.
 This hypervisor represents the entire ESXi cluster. Multiple nova-compute
 instances can represent multiple ESXi clusters. They can connect to
 multiple vCenter servers. If the process running nova-compute
 crashes it cuts the connection to the vCenter server.
 Any ESXi clusters will stop running, and you will not be able to
 provision further instances on the vCenter, even if you enable high
 availability. You must monitor the nova-compute service connected
 to vSphere carefully for any disruptions as a result of this failure point.
--- a/doc/arch-design-draft/source/specialized-networking.rst
+++ b/doc/arch-design-draft/source/specialized-networking.rst
@ -0,0 +1,32 @@
 ==============================
 Specialized networking example
 ==============================
 Some applications that interact with a network require
 specialized connectivity. Applications such as a looking glass
 require the ability to connect to a BGP peer, or route participant
 applications may need to join a network at a layer2 level.
 Challenges
 ~~~~~~~~~~
 Connecting specialized network applications to their required
 resources alters the design of an OpenStack installation.
 Installations that rely on overlay networks are unable to
 support a routing participant, and may also block layer-2 listeners.
 Possible solutions
 ~~~~~~~~~~~~~~~~~~
 Deploying an OpenStack installation using OpenStack Networking with a
 provider network allows direct layer-2 connectivity to an
 upstream networking device.
 This design provides the layer-2 connectivity required to communicate
 via Intermediate System-to-Intermediate System (ISIS) protocol or
 to pass packets controlled by an OpenFlow controller.
 Using the multiple layer-2 plug-in with an agent such as
 :term:`Open vSwitch` allows a private connection through a VLAN
 directly to a specific port in a layer-3 device.
 This allows a BGP point-to-point link to join the autonomous system.
 Avoid using layer-3 plug-ins as they divide the broadcast
 domain and prevent router adjacencies from forming.
--- a/doc/arch-design-draft/source/specialized-openstack-on-openstack.rst
+++ b/doc/arch-design-draft/source/specialized-openstack-on-openstack.rst
@ -0,0 +1,70 @@
 ======================
 OpenStack on OpenStack
 ======================
 In some cases, users may run OpenStack nested on top
 of another OpenStack cloud. This scenario describes how to
 manage and provision complete OpenStack environments on instances
 supported by hypervisors and servers, which an underlying OpenStack
 environment controls.
 Public cloud providers can use this technique to manage the
 upgrade and maintenance process on complete OpenStack environments.
 Developers and those testing OpenStack can also use this
 technique to provision their own OpenStack environments on
 available OpenStack Compute resources, whether public or private.
 Challenges
 ~~~~~~~~~~
 The network aspect of deploying a nested cloud is the most
 complicated aspect of this architecture.
 You must expose VLANs to the physical ports on which the underlying
 cloud runs because the bare metal cloud owns all the hardware.
 You must also expose them to the nested levels as well.
 Alternatively, you can use the network overlay technologies on the
 OpenStack environment running on the host OpenStack environment to
 provide the required software defined networking for the deployment.
 Hypervisor
 ~~~~~~~~~~
 In this example architecture, consider which
 approach you should take to provide a nested
 hypervisor in OpenStack. This decision influences which
 operating systems you use for the deployment of the nested
 OpenStack deployments.
 Possible solutions: deployment
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Deployment of a full stack can be challenging but you can mitigate
 this difficulty by creating a Heat template to deploy the
 entire stack, or a configuration management system. After creating
 the Heat template, you can automate the deployment of additional stacks.
 The OpenStack-on-OpenStack project (:term:`TripleO`)
 addresses this issue. Currently, however, the project does
 not completely cover nested stacks. For more information, see
 https://wiki.openstack.org/wiki/TripleO.
 Possible solutions: hypervisor
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 In the case of running TripleO, the underlying OpenStack
 cloud deploys the Compute nodes as bare-metal. You then deploy
 OpenStack on these Compute bare-metal servers with the
 appropriate hypervisor, such as KVM.
 In the case of running smaller OpenStack clouds for testing
 purposes, where performance is not a critical factor, you can use
 QEMU instead. It is also possible to run a KVM hypervisor in an instance
 (see http://davejingtian.org/2014/03/30/nested-kvm-just-for-fun/),
 though this is not a supported configuration, and could be a
 complex solution for such a use case.
 Diagram
 ~~~~~~~
 .. figure:: figures/Specialized_OOO.png
   :width: 100%
--- a/doc/arch-design-draft/source/specialized-scaling-multiple-cells.rst
+++ b/doc/arch-design-draft/source/specialized-scaling-multiple-cells.rst
@ -0,0 +1,5 @@
 ======================
 Scaling multiple cells
 ======================
 .. TODO
--- a/doc/arch-design-draft/source/specialized-single-site.rst
+++ b/doc/arch-design-draft/source/specialized-single-site.rst
@ -0,0 +1,5 @@
 ==================================================
 Single site architecture with OpenStack Networking
 ==================================================
 .. TODO
--- a/doc/arch-design-draft/source/specialized-software-defined-networking.rst
+++ b/doc/arch-design-draft/source/specialized-software-defined-networking.rst
@ -0,0 +1,46 @@
 ===========================
 Software-defined networking
 ===========================
 Software-defined networking (SDN) is the separation of the data
 plane and control plane. SDN is a popular method of
 managing and controlling packet flows within networks.
 SDN uses overlays or directly controlled layer-2 devices to
 determine flow paths, and as such presents challenges to a
 cloud environment. Some designers may wish to run their
 controllers within an OpenStack installation. Others may wish
 to have their installations participate in an SDN-controlled network.
 Challenges
 ~~~~~~~~~~
 SDN is a relatively new concept that is not yet standardized,
 so SDN systems come in a variety of different implementations.
 Because of this, a truly prescriptive architecture is not feasible.
 Instead, examine the differences between an existing and a planned
 OpenStack design and determine where potential conflicts and gaps exist.
 Possible solutions
 ~~~~~~~~~~~~~~~~~~
 If an SDN implementation requires layer-2 access because it
 directly manipulates switches, we do not recommend running an
 overlay network or a layer-3 agent.
 If the controller resides within an OpenStack installation,
 it may be necessary to build an ML2 plug-in and schedule the
 controller instances to connect to tenant VLANs that they can
 talk directly to the switch hardware.
 Alternatively, depending on the external device support,
 use a tunnel that terminates at the switch hardware itself.
 Diagram
 -------
 OpenStack hosted SDN controller:
 .. figure:: figures/Specialized_SDN_hosted.png
 OpenStack participating in an SDN controller network:
 .. figure:: figures/Specialized_SDN_external.png