1100 Commits

Author SHA1 Message Date
Jenkins
096c6f21a4 Merge "Increase max scheduling attempts to 10" 2016-04-07 20:56:16 +00:00
Jenkins
a69977ce32 Merge "Fix the Neutron reconfigure" 2016-04-07 15:18:35 +00:00
Jenkins
a7dbbcf952 Merge "Configure Neutron to use internalURL for contacting Nova" 2016-04-07 11:28:22 +00:00
Jenkins
3ce80f9357 Merge "Fix manila keystone v3 issues for all clients" 2016-04-07 06:51:40 +00:00
Dave McCowan
afc2bd83dc Configure Neutron to use internalURL for contacting Nova
The Kolla design is for services to use the internalURL for
service to service communication.  In Mitaka, Neutron added
a new config parameter specifying which URL to use to to
contact Nova, making the default 'public'.  This patch sets
the value to 'internal'.

Change-Id: I2d36f3b4a860af9e9034ebfb2b5cea56450e5e4e
Closes-Bug: #1565624
2016-04-07 06:39:00 +00:00
Jenkins
d3f7eee9c1 Merge "Start the haproxy by using haproxy-system-wrapper" 2016-04-06 19:23:38 +00:00
Steven Dake
8cdd8c4ec3 Increase max scheduling attempts to 10
At high scale, such as 64 nodes with 13TB ram and 2600 cores, nova
seems to struggle when scheduling 100+ VMs at the same time.  The
issue is unrelated to the database, as the error printed indicates
the max_scheduling_attempts have been reached.  Increase that value
to something more fitting of a 100 node cluster.

Change-Id: I8982d77c7c66db8f7c95b9fd73f58ceb66dbd723
Closes-Bug: #1563664
2016-04-06 01:58:37 -04:00
Steven Dake
ba4bcdbbe8 Change waiting on mariadb.pid
Previously the code looked at mariadb.pid, but this seemed
flakey in function.  It seemed racey and prone to failure on
slower connections to a registry.  The original task was
extremely complex and it didnt really verify that the MariaDB
was ready to serve connections.  Use wait_for with a regex
instead.

Change-Id: I3aafac04f03639b08e0ef4d6a9c9e1a4499f000c
Closes-Bug: #1564278
2016-04-05 12:02:01 -04:00
Mauricio Lima
76ec9cbc5a Fix manila keystone v3 issues for all clients
Change-Id: I58472bd429accb7f0efa0fcbc7e8e2d3b76de9c4
Closes-bug: #1563972
2016-04-05 11:40:52 -04:00
Jeffrey Zhang
64ab5b0664 Fix the Neutron reconfigure
Closes-Bug: #1566360
Change-Id: I8eaab1df0c85c6a0bd60a13851f8716cc5dcfa59
2016-04-05 22:49:02 +08:00
Jenkins
05e92db2ad Merge "Playbook for rabbitmq upgrade" 2016-04-05 13:48:34 +00:00
Jenkins
3161202bd9 Merge "Enable the kibana access from the external network" 2016-04-05 11:57:47 +00:00
Jenkins
4e62c583b5 Merge "Check empty passwords in $CONFIG_DIR/passwords.yml" 2016-04-05 11:46:39 +00:00
Jenkins
5f6d4d10d4 Merge "Make Heka write Keepalived logs to dedicated directory" 2016-04-05 05:39:54 +00:00
Vikram Hosakote
8595707a25 Check empty passwords in $CONFIG_DIR/passwords.yml
Co-Authored-By: Carlos Cesario <carloscesario@gmail.com>

Change-Id: Ic2ea835cf38be64a590d3f6b022683081aeb363b
Closes-Bug: #1566150
2016-04-05 04:48:35 +00:00
Jenkins
073c9c4af9 Merge "Add comments on site.yml neutron hosts reordering" 2016-04-04 20:50:18 +00:00
Ganesh Maharaj Mahalingam
509573153e Add comments on site.yml neutron hosts reordering
TrivialFix

Change-Id: Ia5ee3a836f35c4223f9149574cbdd29b8a61af84
2016-04-04 08:59:01 -07:00
Jenkins
bd6be1a4dd Merge "Correct typo" 2016-04-04 15:03:42 +00:00
Jenkins
dc9990e744 Merge "Add general_log to mariadb container and make heka collect it" 2016-04-04 10:51:19 +00:00
Swapnil Kulkarni (coolsvap)
2a182ac741 Correct typo
The typo was missed and the code got
merged. Added commit to correct it

TrivialFix

Change-Id: Ice33bcb5fb2f8826054feeae41370038f86bcf6b
2016-04-04 15:39:16 +05:30
Jenkins
ac228c370b Merge "Implement nova-ssh container" 2016-04-04 09:50:02 +00:00
Vikram Hosakote
c7e261c8c2 Check empty passwords in /etc/kolla/passwords.yml in prechecks
This patch set makes "kolla-ansible prechecks" flag an error if
any password is empty in /etc/kolla/passwords.yml.

Change-Id: I87dee25b79c97be64ca49a5638c7f5a30d4cf464
Closes-Bug: #1563506
2016-04-04 04:56:24 +00:00
Vikram Hosakote
86bafeee7a Add general_log to mariadb container and make heka collect it
Added general_log to ansible/roles/mariadb/templates/galera.cnf.j2
to improve mariadb logging.

This will be helpful to debug mariadb issues especially when
mariadb is scaled.

Test results of this patch set are at:
http://paste.openstack.org/show/492852/

Change-Id: I80438d1bbdd1ed2a1f47489c6f9c45b8107340a0
Closes-Bug: #1563668
2016-04-04 02:52:30 +00:00
Carlos Cesario
1e8abe3592 Make Heka write Keepalived logs to dedicated directory
Currently Heka writes the keepalived logs in
/var/log/kolla/haproxy/keepalived.log.
This commit changes this to /var/log/kolla/keepalived/keepalived.log.

Closes-Bug: #1565499
Change-Id: I3033097bd77ddbf72948697b34a6a499ea903083
2016-04-03 18:55:00 -03:00
Jeffrey Zhang
42420830f6 Implement nova-ssh container
Add a nova-ssh container to handle the `nova migrate` and
`nova resize` case, in which the nova will use ssh to copy
files between machines.

Change-Id: Ie6675943f3aeabfbba8589d308d55b9c89d732db
Closes-Bug: #1562141
2016-04-03 07:21:17 +00:00
Jenkins
808d6baa44 Merge "Change keystone log dir" 2016-04-03 00:48:09 +00:00
Jenkins
73800ef8ed Merge "Make keepalived virtual_router_id param configurable" 2016-04-01 21:38:19 +00:00
Jenkins
ad05c1846d Merge "Fix the nova reconfigure in the multinode env" 2016-04-01 20:42:51 +00:00
Carlos Cesario
676998a124 Make keepalived virtual_router_id param configurable
To be kolla deploy multiple clouds, we need to be able to configure
virtual_router_id other wise haproxy will fail setup the VIP for the
second cloud.

Partially-Implements: blueprint multiple-cloud
Closes-Bug: #1564547
Change-Id: I9eb27dd6fba61205841eadafc96601e235d2fe6d
2016-04-01 17:35:35 -03:00
Jenkins
19d325093d Merge "Workaround ansible bug related to delegate_to" 2016-04-01 19:27:07 +00:00
Jenkins
d544a4f3cd Merge "When two interfaces are used for two VIPs, tie them together" 2016-04-01 19:11:34 +00:00
Jenkins
17781afa52 Merge "Fix IP resolve check for RabbitMQ on AIO" 2016-04-01 17:19:18 +00:00
Steven Dake
0bba5fe000 Workaround ansible bug related to delegate_to
Currently the delegate_to doesnt happen and the neutron role creation is
attempted once on the first server and is skipped. The re-ordering of hosts in
site.yml seems to make the first host to be one inside neutron-server group
yielding the expected results. This patch needs to be re-visited as soon as a
version of ansible is chosen that fixes the issues with delegate_to

Co-Authored-By: Steven Dake <stdake@cisco.com>
Co-Authored-By: Vikram Hosakote <vhosakot@cisco.com>
Co-Authored-By: Nate Potter <nathaniel.potter@intel.com>
Co-Authored-By: Ganesh Mahalingam <ganesh.mahalingam@intel.com>
Change-Id: Ia712b323aa9d750d470a11ee899ab1b3054a903f
Partial-Bug: #1546789
2016-04-01 10:13:31 -07:00
Dave McCowan
bbbe84667d When two interfaces are used for two VIPs, tie them together
When a node uses two physical interfaces for its two VIPs, these
physical interfaces should be tied together, so both VIPs will
be taken out of scheduling if either one fails.  Without this change,
if a request comes into one interface that needs access to the
second interface to process the request, the original request
unnecessarily fails.  Repeating this results in a black hole where
a failing server keeps getting new requests.

Change-Id: Ic51e6584c1fbda3eb7821cb47f759c77e562cc65
Closes-Bug: #1550455
2016-04-01 12:04:22 -04:00
Michal Rostecki
5aa2dcce30 Fix IP resolve check for RabbitMQ on AIO
On AIO installation we cannot assume that the public IP address
will be the first entry in "getent ahostsv4" result, because
it may be also a localhost address. To make this check positive
in AIO, we should look for the public IP in the whole output.

Change-Id: I1da7b95d7f00c7f87ff68ead46bf55fdea812599
Closes-Bug: 1564564
2016-04-01 07:57:14 +02:00
Jeffrey Zhang
5680bf0f24 Enable the kibana access from the external network
Closes-Bug: #1564440
Change-Id: I8b9ff303d131fab18e51e9f99c64f4e4004f41b4
2016-03-31 22:34:14 +08:00
Steven Dake
1fac190cec Increase database connections to 10000
Scale limit testing on a 64 node cluster with 13 TB RAM and 2600
cores showed that atleast 1800 database connections are required
to appropriately start 2000 virtual machines simulateounsly.  Other
documentation on the internet recommends larger values such as 8000,
so we set to a larger value that should be able to handle a maxed out
512GB ram per compute node cluster with all services enabled.

Change-Id: I8767cf3fb04e066cc22e796c647e944b4e4a1742
Closes-Bug: #1564275
2016-03-31 00:42:23 -07:00
Jenkins
3a028e4334 Merge "Revert libvirt change in heka-openstack.toml.j2 in master" 2016-03-31 06:43:48 +00:00
Jeffrey Zhang
de6650874d Start the haproxy by using haproxy-system-wrapper
haproxy-system-wrapper is a solution for systemd from upstream. it can
handle the reload graceful.

Change-Id: I6a3d141af065e429bd1be1b7252f5c6df1fda3bb
Closes-Bug: #1559238
2016-03-30 18:07:40 +08:00
Jenkins
94e5616464 Merge "Fix the Ceph reconfigure" 2016-03-30 09:49:07 +00:00
Jenkins
a8e788819b Merge "Added precheck to check kernel version 4.2 on Ubuntu 14.04." 2016-03-30 09:34:35 +00:00
Jeffrey Zhang
792e2c1f26 Fix the nova reconfigure in the multinode env
Checking the fix condition first in the when clause rather than the
dynamic condition. Otherwise it will raise issue, especially in
multinode env.

For example use

  when:
    - nova_console == 'novnc'
    - inventory_hostname in groups['nova-novncproxy']

rather than

  when:
    - inventory_hostname in groups['nova-novncproxy']
    - nova_console == 'novnc'

Because the nova-novncproxy groups may do not exist.

Change-Id: I1e76ee239908a17a4c2ffd9a18c570fbc485172e
Closes-Bug: #1563643
2016-03-30 16:57:42 +08:00
Jenkins
981d8206ab Merge "Starting heka bootstrap container failed in deployment" 2016-03-30 07:47:26 +00:00
Jenkins
251b498c72 Merge "Fix the wrong refer self in kolla_docker.py" 2016-03-30 05:22:21 +00:00
Carlos Cesario
833c1269c7 Added precheck to check kernel version 4.2 on Ubuntu 14.04.
Kolla recommends LTS wily kernel version 4.2 on Ubuntu 14.04.

http://docs.openstack.org/developer/kolla/quickstart.html#installing-dependencies

The precheck check the running kernel version, because checking
the linux-image-generic-lts-wily cannot be a valid check.

The package (linux-image-generic-lts-wily) can be installed, but
the running kernel can be other version.

Closes-Bug: #1563500
Co-Authored-By: Vikram Hosakote <vhosakot@cisco.com>
Change-Id: I4e08bca95493b2dbd0ad5f8d1ea4e07b18c4959d
2016-03-30 03:12:19 +00:00
Jenkins
882636c9dd Merge "Restrict the rabbitmq only start on rabbitmq group" 2016-03-29 19:47:10 +00:00
Mauricio Lima
9a910d6b1e Starting heka bootstrap container failed in deployment
Change-Id: I7efa3f5df3a5743d264864070967cf995900b147
Closes-Bug: #1561111
2016-03-29 15:24:47 -04:00
Jeffrey Zhang
f7066db4f0 Fix the Ceph reconfigure
Closes-Bug: #1563251
Change-Id: I0072c92d6c8160f868b4ab0ec08b11541d3f3401
2016-03-29 18:05:48 +08:00
Jenkins
0523c8fc90 Merge "Refactor the glance service group mapping" 2016-03-29 09:27:02 +00:00
Jenkins
48fcd382ec Merge "Only copy the necessary config files for Glance" 2016-03-29 09:26:40 +00:00