hadoop teragen generates - fatal error: caught unhandled signal 11 with zero JVM

Bug #919137 reported by James Page
16
This bug affects 1 person
Affects Status Importance Assigned to Milestone
openjdk-6 (Ubuntu)
Fix Released
Undecided
Unassigned

Bug Description

I'm testing hadoop packages for armhf from http://launchpad.net/~hadoop-ubuntu/+archive/dev:

sudo add-apt-repository ppa:hadoop-ubuntu/dev
sudo apt-get update
sudo apt-get install hadoop-conf-pseudo

I've reconfigured my install to use zero (-server) by default by changing the order of JVM's in /etc/java-6-openjdk/jvm-armhf.cfg so that -server KNOWN is first in the list. This is important as JamVM current borkes the data in the install.

Then:

for i in namenode datanode tasktracker jobtracker secondarynamenode; do sudo start hadoop-$i; done

Following the instructions for running terasort benchmarking here - http://www.michael-noll.com/blog/2011/04/09/benchmarking-and-stress-testing-an-hadoop-cluster-with-terasort-testdfsio-nnbench-mrbench/#teragen-generate-the-terasort-input-data-if-needed (albiet with a 100G data set rather than 1TB).

When running:

hadoop jar /usr/share/hadoop/hadoop-examples-0.20.205.0.jar teragen 100000000 terasort-input

I get the following error sporadically from the JVM (tasks are re-run so it kind of recovers).

12/01/20 09:47:06 INFO mapred.JobClient: Task Id : attempt_201201200846_0004_m_000000_1, Status : FAILED
java.lang.Throwable: Child Error
        at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:271)
Caused by: java.io.IOException: Task process exit with nonzero status of 134.
        at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:258)

attempt_201201200846_0004_m_000000_1: #
attempt_201201200846_0004_m_000000_1: # A fatal error has been detected by the Java Runtime Environment:
attempt_201201200846_0004_m_000000_1: #
attempt_201201200846_0004_m_000000_1: # Internal Error (os_linux_zero.cpp:270), pid=19845, tid=2739610736
attempt_201201200846_0004_m_000000_1: # fatal error: caught unhandled signal 11
attempt_201201200846_0004_m_000000_1: #
attempt_201201200846_0004_m_000000_1: # JRE version: 6.0_24-b24
attempt_201201200846_0004_m_000000_1: # Java VM: OpenJDK Zero VM (20.0-b12 mixed mode linux-arm )
attempt_201201200846_0004_m_000000_1: # Derivative: IcedTea6 1.11pre
attempt_201201200846_0004_m_000000_1: # Distribution: Ubuntu precise (development branch), package 6b24~pre3-0ubuntu1
attempt_201201200846_0004_m_000000_1: # An error report file with more information is saved as:
attempt_201201200846_0004_m_000000_1: # /var/lib/hadoop/cache/mapred/mapred/local/taskTracker/jamespage/jobcache/job_201201200846_0004/attempt_201201200846_0004_m_000000_1/work/hs_err_pid19845.log
attempt_201201200846_0004_m_000000_1: #
attempt_201201200846_0004_m_000000_1: # If you would like to submit a bug report, please include
attempt_201201200846_0004_m_000000_1: # instructions how to reproduce the bug and visit:
attempt_201201200846_0004_m_000000_1: # https://bugs.launchpad.net/ubuntu/+source/openjdk-6/
attempt_201201200846_0004_m_000000_1: #

Unfortunately the hs_erro*.log is removed before I can grab it.

ProblemType: Bug
DistroRelease: Ubuntu 12.04
Package: openjdk-6-jre 6b24~pre3-0ubuntu1
ProcVersionSignature: Ubuntu 3.2.0-1403.5-omap4 3.2.0-rc5
Uname: Linux 3.2.0-1403-omap4 armv7l
ApportVersion: 1.91-0ubuntu1
Architecture: armhf
Date: Fri Jan 20 10:48:29 2012
ProcEnviron:
 LANGUAGE=en_GB:
 LANG=en_GB.UTF-8
 SHELL=/bin/bash
SourcePackage: openjdk-6
UpgradeStatus: No upgrade log present (probably fresh install)

Revision history for this message
James Page (james-page) wrote :
Revision history for this message
James Page (james-page) wrote :

Managed to capture a crash - see bug 920528 (marked as duplicate of this one).

Revision history for this message
Xerxes Rånby (xranby) wrote :

please retry using IcedTea6 1.11.1

xranby@ac100-armel:~$ java -version
java version "1.6.0_24"
OpenJDK Runtime Environment (IcedTea6 1.11.1) (6b24-1.11.1-3ubuntu3)
OpenJDK Zero VM (build 20.0-b12, mixed mode)

Revision history for this message
James Page (james-page) wrote :

hadoop jar /usr/share/hadoop/hadoop-examples-*.jar teragen 100000000 terasort-input now successfully completes.

trying the corresponding terasort example now.

Xerxes Rånby (xranby)
Changed in openjdk-6 (Ubuntu):
status: New → Fix Committed
Revision history for this message
Xerxes Rånby (xranby) wrote :

(16:26:57) jamespage: xranby, w00t - teragen completed!
(10:27:27) xranby: jamespage: mind if i close https://bugs.launchpad.net/ubuntu/+source/openjdk-6/+bug/919137 ?
(10:27:37) jamespage: xranby, +1

Changed in openjdk-6 (Ubuntu):
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.