Cluster stays in waiting state and then goes into error

Bug #1477530 reported by Tatiana Kholkina
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Sahara
Fix Released
High
Sergey Reshetnyak

Bug Description

I create a cluster using this guide http://docs.openstack.org/developer/sahara/devref/quickstart.html but it stays in waiting state and after some time goes into error. In sahara logs I see the following:

2015-07-17 09:31:20.510 DEBUG sahara.service.engine [req-c4feb0e8-8f35-4634-b939-5a51d340f2a2 demo demo] [instance: d3571fd3-0f20-4c39-bfbc-239c9b5f1413, cluster: b992
81a7-d38e-4d4e-adf3-3774261fd8ee] Can't login to node, IP: 172.24.4.6, reason error: [Errno 110] Connection timed out

Seems it can not ssh to the instances since port 22 isn't opened. 'nova list' and 'nova secgroup-list' show 3 active instances and 3 security groups. But these groups are not assigned to the instances. After I assign them manually the cluster turns to active state.

I tested it on devstack (master branch) using nova-net.

Please let me know if I can provide any further information.

Revision history for this message
Michael McCune (mimccune) wrote :

we have discussed this issue and we are wondering if the automatic security groups were turned on for this test?

we are having some difficulty assessing the proper response as this could be seen as an operator/configuration issue, assuming that the ports were not opened. if, however, the automatic security groups are not working with nova networking then this would be a bug.

also, there is another possibility here, and that is for sahara to check whether the proper security groups are in place for ssh to occur.

regardless, we would like to know a little more about the setup. in specific, were automatic security groups enabled?

Changed in sahara:
status: New → Incomplete
Changed in sahara:
assignee: nobody → Sergey Reshetnyak (sreshetniak)
milestone: none → liberty-3
Changed in sahara:
importance: Undecided → High
status: Incomplete → Triaged
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to sahara (master)

Fix proposed to branch: master
Review: https://review.openstack.org/217791

Changed in sahara:
status: Triaged → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to sahara (master)

Reviewed: https://review.openstack.org/217791
Committed: https://git.openstack.org/cgit/openstack/sahara/commit/?id=92a1b7052fb51156488ac7cff68d18d1d6822c8c
Submitter: Jenkins
Branch: master

commit 92a1b7052fb51156488ac7cff68d18d1d6822c8c
Author: Sergey Reshetnyak <email address hidden>
Date: Thu Aug 27 19:34:18 2015 +0300

    Fix problem with using auto security groups in Heat

    Change-Id: I95744f71770486ea2ccdc20db456702d53ae5c42
    Closes-bug: #1477530

Changed in sahara:
status: In Progress → Fix Committed
Changed in sahara:
status: Fix Committed → Fix Released
Thierry Carrez (ttx)
Changed in sahara:
milestone: liberty-3 → 3.0.0
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.