DescribeAvailabilityZones: services alive status is not consistent (if clocks are not synced)

Bug #708822 reported by Tushar Patil
10
This bug affects 2 people
Affects Status Importance Assigned to Milestone
OpenStack Compute (nova)
Invalid
Low
Unassigned

Bug Description

Tested on Revision No : 624

If I run describe-availability-zones repeatedly for 4-5 times, each time it shows service status as different.

I am using default "service_down_time" flag which is set to 60.

root@ubuntu-network-api-server:/home/openstack/nova# euca-describe-availability-zones verbose
AVAILABILITYZONE nova available
AVAILABILITYZONE |- ubuntu-compute-01
AVAILABILITYZONE | |- nova-compute enabled XXX 2011-01-27 19:19:59
AVAILABILITYZONE |- ubuntu-compute-03
AVAILABILITYZONE | |- nova-compute enabled XXX 2011-01-27 19:19:58
AVAILABILITYZONE |- ubuntu-compute-02
AVAILABILITYZONE | |- nova-compute enabled :-) 2011-01-27 19:19:54
AVAILABILITYZONE |- ubuntu-network-api-server
AVAILABILITYZONE | |- nova-scheduler enabled :-) 2011-01-27 19:19:50
AVAILABILITYZONE | |- nova-network enabled :-) 2011-01-27 19:19:52

root@ubuntu-network-api-server:/home/openstack/nova# euca-describe-availability-zones verbose
AVAILABILITYZONE nova available
AVAILABILITYZONE |- ubuntu-compute-01
AVAILABILITYZONE | |- nova-compute enabled :-) 2011-01-27 19:24:00
AVAILABILITYZONE |- ubuntu-compute-03
AVAILABILITYZONE | |- nova-compute enabled XXX 2011-01-27 19:24:09
AVAILABILITYZONE |- ubuntu-compute-02
AVAILABILITYZONE | |- nova-compute enabled XXX 2011-01-27 19:24:05
AVAILABILITYZONE |- ubuntu-network-api-server
AVAILABILITYZONE | |- nova-scheduler enabled :-) 2011-01-27 19:24:01
AVAILABILITYZONE | |- nova-network enabled :-) 2011-01-27 19:23:54

Related branches

Revision history for this message
Vish Ishaya (vishvananda) wrote : Re: [Bug 708822] [NEW] DescribeAvailabilityZones: services alive status is not consistent
Download full text (4.0 KiB)

This can happen if you have clock skew. Try installing ntp on all of your boxes so the clocks are in sync.

On Jan 27, 2011, at 11:24 AM, Tushar Patil wrote:

> Public bug reported:
>
> Tested on Revision No : 624
>
> If I run describe-availability-zones repeatedly for 4-5 times, each time
> it shows service status as different.
>
> I am using default "service_down_time" flag which is set to 60.
>
> root@ubuntu-network-api-server:/home/openstack/nova# euca-describe-availability-zones verbose
> AVAILABILITYZONE nova available
> AVAILABILITYZONE |- ubuntu-compute-01
> AVAILABILITYZONE | |- nova-compute enabled XXX 2011-01-27 19:19:59
> AVAILABILITYZONE |- ubuntu-compute-03
> AVAILABILITYZONE | |- nova-compute enabled XXX 2011-01-27 19:19:58
> AVAILABILITYZONE |- ubuntu-compute-02
> AVAILABILITYZONE | |- nova-compute enabled :-) 2011-01-27 19:19:54
> AVAILABILITYZONE |- ubuntu-network-api-server
> AVAILABILITYZONE | |- nova-scheduler enabled :-) 2011-01-27 19:19:50
> AVAILABILITYZONE | |- nova-network enabled :-) 2011-01-27 19:19:52
>
> root@ubuntu-network-api-server:/home/openstack/nova# euca-describe-availability-zones verbose
> AVAILABILITYZONE nova available
> AVAILABILITYZONE |- ubuntu-compute-01
> AVAILABILITYZONE | |- nova-compute enabled :-) 2011-01-27 19:24:00
> AVAILABILITYZONE |- ubuntu-compute-03
> AVAILABILITYZONE | |- nova-compute enabled XXX 2011-01-27 19:24:09
> AVAILABILITYZONE |- ubuntu-compute-02
> AVAILABILITYZONE | |- nova-compute enabled XXX 2011-01-27 19:24:05
> AVAILABILITYZONE |- ubuntu-network-api-server
> AVAILABILITYZONE | |- nova-scheduler enabled :-) 2011-01-27 19:24:01
> AVAILABILITYZONE | |- nova-network enabled :-) 2011-01-27 19:23:54
>
> ** Affects: nova
> Importance: Undecided
> Status: New
>
> --
> You received this bug notification because you are a member of Nova Bug
> Team, which is subscribed to OpenStack Compute (nova).
> https://bugs.launchpad.net/bugs/708822
>
> Title:
> DescribeAvailabilityZones: services alive status is not consistent
>
> Status in OpenStack Compute (Nova):
> New
>
> Bug description:
> Tested on Revision No : 624
>
> If I run describe-availability-zones repeatedly for 4-5 times, each
> time it shows service status as different.
>
> I am using default "service_down_time" flag which is set to 60.
>
> root@ubuntu-network-api-server:/home/openstack/nova# euca-describe-availability-zones verbose
> AVAILABILITYZONE nova available
> AVAILABILITYZONE |- ubuntu-compute-01
> AVAILABILITYZONE | |- nova-compute enabled XXX 2011-01-27 19:19:59
> AVAILABILITYZONE |- ubuntu-compute-03
> AVAILABILITYZONE | |- nova-compute enabled XXX 2011-01-27 19:19:58
> AVAILABILITYZONE |- ubuntu-compute-02
> AVAILABILITYZONE | |- nova-compute enabled :-) 2011-01-27 19:19:54
> AVAILABILITYZONE |- ubuntu-network-api-server
> AVAILABILITYZONE | |- nova-scheduler enabled :-) 2011-01-27 19:19...

Read more...

Tushar Patil (tpatil)
Changed in nova:
assignee: nobody → Tushar Patil (tpatil)
status: New → In Progress
Revision history for this message
Tushar Patil (tpatil) wrote : Re: DescribeAvailabilityZones: services alive status is not consistent

If the time are sync on all the nodes including api server, I don't think we need to find this issue.

Revision history for this message
Tushar Patil (tpatil) wrote :

In my previous comment, I meant to say we don't need to fix the issue

Thierry Carrez (ttx)
Changed in nova:
importance: Undecided → Low
summary: - DescribeAvailabilityZones: services alive status is not consistent
+ DescribeAvailabilityZones: services alive status is not consistent (if
+ clocks are not synced)
Revision history for this message
Thierry Carrez (ttx) wrote :

@tushar: any chance you could merge with trunk and resubmit fix for Cactus ?

Revision history for this message
Thierry Carrez (ttx) wrote :

Unassigning since Tushar seems to have gone AWOL, to give someone else a chance to work on this.
@Tushar: feel free to reassign yourself and propose again.

Changed in nova:
assignee: Tushar Patil (tpatil) → nobody
status: In Progress → Triaged
Revision history for this message
Vladimir Popovski (vladimir.p) wrote :

the fundamental problem here is that comparison is performed between time on machine where API server running vs time where SQL is running. We just experienced exactly the same issue with scheduler, where node was off a bit...

We either need to have a strict requirement to have clocks on all nodes synchronized or to implement a fix on sqlalchemy.api level - updated/created times wil be set and verified always on the same machine.

Revision history for this message
Tom Fifield (fifieldt) wrote :

Is this bug still valid?

Jim Jiang (jiangwt100)
Changed in nova:
assignee: nobody → 蒋闻天 (jiangwt100)
assignee: 蒋闻天 (jiangwt100) → nobody
Revision history for this message
Michael Still (mikal) wrote :

@Tom -- I don't think so. As long as the docs say you should have NTP deployed, which everyone should anyways, this bug is invalid.

Changed in nova:
status: Triaged → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.