swift/doc/source/howto_installmultinode.rst
2011-01-17 21:49:48 +00:00

16 KiB

Instructions for a Multiple Server Swift Installation (Ubuntu)

Prerequisites

  • Ubuntu Server 10.04 LTS installation media

Basic architecture and terms

  • node - a host machine running one or more Swift services
  • Proxy node - node that runs Proxy services; can also run Swauth
  • Auth node - node that runs the Auth service; only required for DevAuth
  • Storage node - node that runs Account, Container, and Object services
  • ring - a set of mappings of Swift data to physical devices

This document shows a cluster using the following types of nodes:

  • one Proxy node
    • Runs the swift-proxy-server processes which proxy requests to the appropriate Storage nodes. For Swauth, the proxy server will also contain the Swauth service as WSGI middleware.
  • one Auth node
    • Runs the swift-auth-server which controls authentication and authorization for all requests. This can be on the same node as a Proxy node. This is only required for DevAuth.
  • five Storage nodes
    • Runs the swift-account-server, swift-container-server, and swift-object-server processes which control storage of the account databases, the container databases, as well as the actual stored objects.

Note

Fewer Storage nodes can be used initially, but a minimum of 5 is recommended for a production cluster.

This document describes each Storage node as a separate zone in the ring. It is recommended to have a minimum of 5 zones. A zone is a group of nodes that is as isolated as possible from other nodes (separate servers, network, power, even geography). The ring guarantees that every replica is stored in a separate zone. For more information about the ring and zones, see: The Rings <overview_ring>.

To increase reliability, you may want to add additional Proxy servers for performance which is described in add-proxy-server.

Network Setup Notes

This document refers to two networks. An external network for connecting to the Proxy server, and a storage network that is not accessibile from outside the cluster, to which all of the nodes are connected. All of the Swift services, as well as the rsync daemon on the Storage nodes are configured to listen on their STORAGE_LOCAL_NET IP addresses.

General OS configuration and partitioning for each node

  1. Install the baseline Ubuntu Server 10.04 LTS on all nodes.

  2. Install common Swift software prereqs:

    apt-get install python-software-properties
    add-apt-repository ppa:swift-core/ppa
    apt-get update
    apt-get install swift openssh-server
  3. Create and populate configuration directories:

    mkdir -p /etc/swift
    chown -R swift:swift /etc/swift/
  4. Create /etc/swift/swift.conf:

    [swift-hash]
    # random unique string that can never change (DO NOT LOSE)
    swift_hash_path_suffix = changeme

Note

/etc/swift/swift.conf should be set to some random string of text to be used as a salt when hashing to determine mappings in the ring. This file should be the same on every node in the cluster!

Configure the Proxy node

Note

It is assumed that all commands are run as the root user

  1. Install swift-proxy service:

    apt-get install swift-proxy memcached
  2. Create self-signed cert for SSL:

    cd /etc/swift
    openssl req -new -x509 -nodes -out cert.crt -keyout cert.key
  3. Modify memcached to listen on the default interfaces. Preferably this should be on a local, non-public network. Edit the following line in /etc/memcached.conf, changing:

    -l 127.0.0.1
    to
    -l <PROXY_LOCAL_NET_IP>
  4. Restart the memcached server:

    service memcached restart
  5. Create /etc/swift/proxy-server.conf:

    [DEFAULT]
    cert_file = /etc/swift/cert.crt
    key_file = /etc/swift/cert.key
    bind_port = 8080
    workers = 8
    user = swift
    
    [pipeline:main]
    # For DevAuth:
    pipeline = healthcheck cache auth proxy-server
    # For Swauth:
    # pipeline = healthcheck cache swauth proxy-server
    
    [app:proxy-server]
    use = egg:swift#proxy
    allow_account_management = true
    
    # Only needed for DevAuth
    [filter:auth]
    use = egg:swift#auth
    ssl = true
    
    # Only needed for Swauth
    [filter:swauth]
    use = egg:swift#swauth
    default_swift_cluster = local#https://<PROXY_LOCAL_NET_IP>:8080/v1
    # Highly recommended to change this key to something else!
    super_admin_key = swauthkey
    
    [filter:healthcheck]
    use = egg:swift#healthcheck
    
    [filter:cache]
    use = egg:swift#memcache
    memcache_servers = <PROXY_LOCAL_NET_IP>:11211

    Note

    If you run multiple memcache servers, put the multiple IP:port listings in the [filter:cache] section of the proxy-server.conf file like: 10.1.2.3:11211,10.1.2.4:11211. Only the proxy server uses memcache.

  6. Create the account, container and object rings:

    cd /etc/swift
    swift-ring-builder account.builder create 18 3 1
    swift-ring-builder container.builder create 18 3 1
    swift-ring-builder object.builder create 18 3 1

    Note

    For more information on building rings, see overview_ring.

  7. For every storage device on each node add entries to each ring:

    swift-ring-builder account.builder add z<ZONE>-<STORAGE_LOCAL_NET_IP>:6002/<DEVICE> 100
    swift-ring-builder container.builder add z<ZONE>-<STORAGE_LOCAL_NET_IP_1>:6001/<DEVICE> 100
    swift-ring-builder object.builder add z<ZONE>-<STORAGE_LOCAL_NET_IP_1>:6000/<DEVICE> 100

    Note

    Assuming there are 5 zones with 1 node per zone, ZONE should start at 1 and increment by one for each additional node.

  8. Verify the ring contents for each ring:

    swift-ring-builder account.builder
    swift-ring-builder container.builder
    swift-ring-builder object.builder
  9. Rebalance the rings:

    swift-ring-builder account.builder rebalance
    swift-ring-builder container.builder rebalance
    swift-ring-builder object.builder rebalance

    Note

    Rebalancing rings can take some time.

  10. Copy the account.ring.gz, container.ring.gz, and object.ring.gz files to each of the Proxy and Storage nodes in /etc/swift.

  11. Make sure all the config files are owned by the swift user:

    chown -R swift:swift /etc/swift
  12. Start Proxy services:

    swift-init proxy start

Configure the Auth node

Note

Only required for DevAuth; you can skip this section for Swauth.

  1. If this node is not running on the same node as a proxy, create a self-signed cert as you did for the Proxy node

  2. Install swift-auth service:

    apt-get install swift-auth
  3. Create /etc/swift/auth-server.conf:

    [DEFAULT]
    cert_file = /etc/swift/cert.crt
    key_file = /etc/swift/cert.key
    user = swift
    
    [pipeline:main]
    pipeline = auth-server
    
    [app:auth-server]
    use = egg:swift#auth
    default_cluster_url = https://<PROXY_HOSTNAME>:8080/v1
    # Highly recommended to change this key to something else!
    super_admin_key = devauth
  4. Start Auth services:

    swift-init auth start
    chown swift:swift /etc/swift/auth.db
    swift-init auth restart            # 1.1.0 workaround because swift creates auth.db owned as root

Configure the Storage nodes

Note

Swift should work on any modern filesystem that supports Extended Attributes (XATTRS). We currently recommend XFS as it demonstrated the best overall performance for the swift use case after considerable testing and benchmarking at Rackspace. It is also the only filesystem that has been thoroughly tested.

  1. Install Storage node packages:

    apt-get install swift-account swift-container swift-object xfsprogs
  2. For every device on the node, setup the XFS volume (/dev/sdb is used as an example):

    fdisk /dev/sdb  (set up a single partition)
    mkfs.xfs -i size=1024 /dev/sdb1
    echo "/dev/sdb1 /srv/node/sdb1 xfs noatime,nodiratime,nobarrier,logbufs=8 0 0" >> /etc/fstab
    mkdir -p /srv/node/sdb1
    mount /srv/node/sdb1
    chown -R swift:swift /srv/node
  3. Create /etc/rsyncd.conf:

    uid = swift
    gid = swift
    log file = /var/log/rsyncd.log
    pid file = /var/run/rsyncd.pid
    address = <STORAGE_LOCAL_NET_IP>
    
    [account]
    max connections = 2
    path = /srv/node/
    read only = false
    lock file = /var/lock/account.lock
    
    [container]
    max connections = 2
    path = /srv/node/
    read only = false
    lock file = /var/lock/container.lock
    
    [object]
    max connections = 2
    path = /srv/node/
    read only = false
    lock file = /var/lock/object.lock
  4. Edit the following line in /etc/default/rsync:

    RSYNC_ENABLE=true
  5. Start rsync daemon:

    service rsync start

    Note

    The rsync daemon requires no authentication, so it should be run on a local, private network.

  6. Create /etc/swift/account-server.conf:

    [DEFAULT]
    bind_ip = <STORAGE_LOCAL_NET_IP>
    workers = 2
    
    [pipeline:main]
    pipeline = account-server
    
    [app:account-server]
    use = egg:swift#account
    
    [account-replicator]
    
    [account-auditor]
    
    [account-reaper]
  7. Create /etc/swift/container-server.conf:

    [DEFAULT]
    bind_ip = <STORAGE_LOCAL_NET_IP>
    workers = 2
    
    [pipeline:main]
    pipeline = container-server
    
    [app:container-server]
    use = egg:swift#container
    
    [container-replicator]
    
    [container-updater]
    
    [container-auditor]
  8. Create /etc/swift/object-server.conf:

    [DEFAULT]
    bind_ip = <STORAGE_LOCAL_NET_IP>
    workers = 2
    
    [pipeline:main]
    pipeline = object-server
    
    [app:object-server]
    use = egg:swift#object
    
    [object-replicator]
    
    [object-updater]
    
    [object-auditor]
  9. Start the storage services:

    swift-init object-server start
    swift-init object-replicator start
    swift-init object-updater start
    swift-init object-auditor start
    swift-init container-server start
    swift-init container-replicator start
    swift-init container-updater start
    swift-init container-auditor start
    swift-init account-server start
    swift-init account-replicator start
    swift-init account-auditor start

Create Swift admin account and test

You run these commands from the Auth node.

Note

For Swauth, replace the https://<AUTH_HOSTNAME>:11000/v1.0 with https://<PROXY_HOSTNAME>:8080/auth/v1.0

  1. Create a user with administrative privileges (account = system, username = root, password = testpass). Make sure to replace devauth (or swauthkey) with whatever super_admin key you assigned in the auth-server.conf file (or proxy-server.conf file in the case of Swauth) above. Note: None of the values of account, username, or password are special - they can be anything.:

    # For DevAuth:
    swift-auth-add-user -K devauth -a system root testpass
    # For Swauth:
    swauth-add-user -K swauthkey -a system root testpass
  2. Get an X-Storage-Url and X-Auth-Token:

    curl -k -v -H 'X-Storage-User: system:root' -H 'X-Storage-Pass: testpass' https://<AUTH_HOSTNAME>:11000/v1.0
  3. Check that you can HEAD the account:

    curl -k -v -H 'X-Auth-Token: <token-from-x-auth-token-above>' <url-from-x-storage-url-above>
  4. Check that st works:

    st -A https://<AUTH_HOSTNAME>:11000/v1.0 -U system:root -K testpass stat
  5. Use st to upload a few files named 'bigfile[1-2].tgz' to a container named 'myfiles':

    st -A https://<AUTH_HOSTNAME>:11000/v1.0 -U system:root -K testpass upload myfiles bigfile1.tgz
    st -A https://<AUTH_HOSTNAME>:11000/v1.0 -U system:root -K testpass upload myfiles bigfile2.tgz
  6. Use st to download all files from the 'myfiles' container:

    st -A https://<AUTH_HOSTNAME>:11000/v1.0 -U system:root -K testpass download myfiles

Adding a Proxy Server

For reliability's sake you may want to have more than one proxy server. You can set up the additional proxy node in the same manner that you set up the first proxy node but with additional configuration steps.

Once you have more than two proxies, you also want to load balance between the two, which means your storage endpoint also changes. You can select from different strategies for load balancing. For example, you could use round robin dns, or an actual load balancer (like pound) in front of the two proxies, and point your storage url to the load balancer.

See config-proxy for the initial setup, and then follow these additional steps.

  1. Update the list of memcache servers in /etc/swift/proxy-server.conf for all the added proxy servers. If you run multiple memcache servers, use this pattern for the multiple IP:port listings: 10.1.2.3:11211,10.1.2.4:11211 in each proxy server's conf file.:

    [filter:cache]
    use = egg:swift#memcache
    memcache_servers = <PROXY_LOCAL_NET_IP>:11211
  2. Change the default_cluster_url to point to the load balanced url, rather than the first proxy server you created in /etc/swift/auth-server.conf (for DevAuth) or in /etc/swift/proxy-server.conf (for Swauth):

    # For DevAuth, in /etc/swift/auth-server.conf
    [app:auth-server]
    use = egg:swift#auth
    default_cluster_url = https://<LOAD_BALANCER_HOSTNAME>/v1
    # Highly recommended to change this key to something else!
    super_admin_key = devauth
    
    # For Swauth, in /etc/swift/proxy-server.conf
    [filter:swauth]
    use = egg:swift#swauth
    default_swift_cluster = local#http://<LOAD_BALANCER_HOSTNAME>/v1
    # Highly recommended to change this key to something else!
    super_admin_key = swauthkey
  3. For DevAuth, after you change the default_cluster_url setting, you have to delete the auth database and recreate the Swift users, or manually update the auth database with the correct URL for each account.

    For Swauth, you can change a service URL with:

    swauth-set-account-service -K swauthkey <account> storage local <new_url_for_the_account>

    You can obtain old service URLs with:

    swauth-list -K swauthkey <account>
  4. Next, copy all the ring information to all the nodes, including your new proxy nodes, and ensure the ring info gets to all the storage nodes as well.

  5. After you sync all the nodes, make sure the admin has the keys in /etc/swift and the ownership for the ring file is correct.

Additional Cleanup Script for Swauth

If you decide to use Swauth, you'll want to install a cronjob to clean up any orphaned expired tokens. These orphaned tokens can occur when a "stampede" occurs where a single user authenticates several times concurrently. Generally, these orphaned tokens don't pose much of an issue, but it's good to clean them up once a "token life" period (default: 1 day or 86400 seconds).

This should be as simple as adding swauth-cleanup-tokens -K swauthkey > /dev/null to a crontab entry on one of the proxies that is running Swauth; but run swauth-cleanup-tokens with no arguments for detailed help on the options available.

Troubleshooting Notes

If you see problems, look in var/log/syslog (or messages on some distros).

Also, at Rackspace we have seen hints at drive failures by looking at error messages in /var/log/kern.log.

There are more debugging hints and tips in the admin_guide.