pacemaker pengine crashes ofter

Bug #715751 reported by Ricardo Sousa
28
This bug affects 5 people
Affects Status Importance Assigned to Milestone
pacemaker (Ubuntu)
Won't Fix
Undecided
Andres Rodriguez

Bug Description

Binary package hint: pacemaker

on certain cluster operations (resource migration for instance) pacemaker appears to crash with the following message or similar.

 pengine[13685]: segfault at 10 ip 00007f47676e18ec sp 00007fff54f13690 error 4 in libpengine.so.3.0.0[7f47676d5000+36000]

This error resembles the issue reported and fixed on this thread http://<email address hidden>/msg07193.html

Thank you

Revision history for this message
Andres Rodriguez (andreserl) wrote :

Hi Ricardo,

Thank you for taking the time to report bugs and trying to make Ubuntu better.

Now, could you please provide the following information to continue to determine the cause of your issue.

1. Pacemaker and Ubuntu version in use.
2. Sample configuration
3. Step by Step to be able reproduce this error/bug.

I'm marking this bug as incomplete until the information requested is provided. Thank you again!

Changed in pacemaker (Ubuntu):
status: New → Incomplete
Revision history for this message
Ricardo Sousa (rsousa-servismart-deactivatedaccount) wrote :

We are running Lucid:
Distributor ID: Ubuntu
Description: Ubuntu 10.04.2 LTS
Release: 10.04
Codename: lucid

and the involved packages are:

ii cluster-agents 1:1.0.3-2ubuntu1 The reusable cluster components for Linux HA
ii cluster-glue 1.0.5-1 The reusable cluster components for Linux HA
ii heartbeat 1:3.0.3-1ubuntu1 Subsystem for High-Availability Linux
ii libcluster-glue 1.0.5-1 The reusable cluster components for Linux HA
ii libcorosync4 1.2.0-0ubuntu1 Standards-based cluster framework (libraries
ii libheartbeat2 1:3.0.3-1ubuntu1 Subsystem for High-Availability Linux (libra
ii pacemaker 1.0.8+hg15494-2ubuntu2 HA cluster resource manager

Revision history for this message
Ricardo Sousa (rsousa-servismart-deactivatedaccount) wrote :

To reproduce the problem we only need to run:
crm resource move nfs

It isn't every time but happens often enough. (we have about 150 instances in the last day or so)

Revision history for this message
nunogt (nunogt) wrote :

This problem also affects my production systems running Lucid. The following patch http://<email address hidden>/msg07525.html should fix this. Any chances of an updated package?

Revision history for this message
Andres Rodriguez (andreserl) wrote :

I'll look into this in the next couple days as soon as I can set up a test environment.

Cheers,

Changed in pacemaker (Ubuntu):
assignee: nobody → Andres Rodriguez (andreserl)
Revision history for this message
Mike Forbes (mike.forbes) wrote :

Any movement here?
This is also affecting us in the same way.

Revision history for this message
Ante Karamatić (ivoks) wrote :

Have you tried packages from the PPA?

https://launchpad.net/~ubuntu-ha-maintainers/+archive/ppa/

This PPA includes new version of pacemaker and a fix for glib. It also includes support for upstart OCF.

Revision history for this message
Mike Forbes (mike.forbes) wrote :

Thanks Ante,

Any Idea of potential issues upgrading from pacemaker 1.0.8+hg15494-2ubuntu2 (lucid standard) to this PPA's version?

Revision history for this message
Ante Karamatić (ivoks) wrote : Re: [Ubuntu-ha] [Bug 715751] Re: pacemaker pengine crashes ofter

Dana 05.03.2012 22:47, Mike Forbes je napisao:

> Any Idea of potential issues upgrading from pacemaker
> 1.0.8+hg15494-2ubuntu2 (lucid standard) to this PPA's version?

I haven't seen one, but of course, test it on a non-production system.

Changed in pacemaker (Ubuntu):
status: Incomplete → Won't Fix
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.