------- Comment From <email address hidden> 2015-10-27 15:41 EDT------- I just verified that issue is fixed in Ubuntu-3.19.0-32.37 kernel version
------------------------------------------------------------------------------------ Ubuntu 14.04.3 LTS ltc-fire14 hvc0
ltc-fire14 login: root Password: Last login: Tue Oct 27 10:11:22 CDT 2015 on hvc0 Welcome to Ubuntu 14.04.3 LTS (GNU/Linux 3.19.0-32-generic ppc64le)
* Documentation: https://help.ubuntu.com/ root@ltc-fire14:~# uname -a Linux ltc-fire14 3.19.0-32-generic #37-Ubuntu SMP Wed Oct 21 10:22:35 UTC 2015 ppc64le ppc64le ppc64le GNU/Linux root@ltc-fire14:~# cd /home/workload_scripts/ root@ltc-fire14:/home/workload_scripts# ls find_work.sh run_workload.sh root@ltc-fire14:/home/workload_scripts# ./run_workload.sh root@ltc-fire14:/home/workload_scripts# getscom -l Chip ID | Rev | Chip type ---------|-------|-------- 80000085 | DD2.0 | Centaur memory buffer 80000084 | DD2.0 | Centaur memory buffer 80000005 | DD2.0 | Centaur memory buffer 80000004 | DD2.0 | Centaur memory buffer 00000008 | DD2.0 | P8 (Venice) processor 00000000 | DD2.0 | P8 (Venice) processor root@ltc-fire14:/home/workload_scripts# getscom -c 0x0 11013100 0 root@ltc-fire14:/home/workload_scripts# getscom -c 0x0 11013106 15a20c688a448b01 root@ltc-fire14:/home/workload_scripts# getscom -c 0x0 11013107 ea5c139705980000 root@ltc-fire14:/home/workload_scripts# putscom -c 0x0 11013107 fa5c139705980000 fa5c139705980000 root@ltc-fire14:/home/workload_scripts# getscom -c 0x0 11013107 fa5c139705980000 root@ltc-fire14:/home/workload_scripts# putscom -c 0x0 11013100 1000000000000000 [ 333.045651] Fatal Hypervisor Maintenance interrupt [Not recovered] [ 333.045916] Error detail: Malfunction Alert [ 333.046288] HMER: 8040000000000000 [ 333.046543] CPU PIR: 00000000 [ 333.046601] [Unit: IFU] RegFile core check stop [ 333.046778] [Unit: PC ] Debug Trigger Error inject 1000000000000008[ 333.046883] F [194049345926,0] OPAL: Reboot requested due to Platform error.at[194049767279,3] OPAL: Reboot requested due to Platform error.al 1.69405|ERRL|Dumping errors reported prior to registration 3.46924|Ignoring boot flags, incorrect version 0x0 3.70396|ISTEP 6. 3 4.14478|ISTEP 6. 4 4.14531|ISTEP 6. 5 10.54385|HWAS|PRESENT> DIMM[03]=00000000AAAAAAAA 10.54386|HWAS|PRESENT> Membuf[04]=0C0C000000000000 10.54387|HWAS|PRESENT> Proc[05]=C000000000000000 23.49515|ISTEP 6. 6 [...] ------------------------------------------------------------------------------------
------- Comment From <email address hidden> 2015-10-27 15:41 EDT-------
I just verified that issue is fixed in Ubuntu-3.19.0-32.37 kernel version
------- ------- ------- ------- ------- ------- ------- ------- ------- ------- ------- -------
Ubuntu 14.04.3 LTS ltc-fire14 hvc0
ltc-fire14 login: root
Password:
Last login: Tue Oct 27 10:11:22 CDT 2015 on hvc0
Welcome to Ubuntu 14.04.3 LTS (GNU/Linux 3.19.0-32-generic ppc64le)
* Documentation: https:/ /help.ubuntu. com/ scripts/ fire14: /home/workload_ scripts# ls fire14: /home/workload_ scripts# ./run_workload.sh fire14: /home/workload_ scripts# getscom -l --|---- ---|--- ----- fire14: /home/workload_ scripts# getscom -c 0x0 11013100 fire14: /home/workload_ scripts# getscom -c 0x0 11013106 fire14: /home/workload_ scripts# getscom -c 0x0 11013107 fire14: /home/workload_ scripts# putscom -c 0x0 11013107 fa5c139705980000 fire14: /home/workload_ scripts# getscom -c 0x0 11013107 fire14: /home/workload_ scripts# putscom -c 0x0 11013100 1000000000000000 194049767279, 3] OPAL: Reboot requested due to Platform error.al 1.69405| ERRL|Dumping errors reported prior to registration HWAS|PRESENT> DIMM[03] =00000000AAAAAA AA HWAS|PRESENT> Membuf[ 04]=0C0C0000000 00000 HWAS|PRESENT> Proc[05] =C0000000000000 00 ------- ------- ------- ------- ------- ------- ------- ------- ------- ------- -------
root@ltc-fire14:~# uname -a
Linux ltc-fire14 3.19.0-32-generic #37-Ubuntu SMP Wed Oct 21 10:22:35 UTC 2015 ppc64le ppc64le ppc64le GNU/Linux
root@ltc-fire14:~# cd /home/workload_
root@ltc-
find_work.sh run_workload.sh
root@ltc-
root@ltc-
Chip ID | Rev | Chip type
-------
80000085 | DD2.0 | Centaur memory buffer
80000084 | DD2.0 | Centaur memory buffer
80000005 | DD2.0 | Centaur memory buffer
80000004 | DD2.0 | Centaur memory buffer
00000008 | DD2.0 | P8 (Venice) processor
00000000 | DD2.0 | P8 (Venice) processor
root@ltc-
0
root@ltc-
15a20c688a448b01
root@ltc-
ea5c139705980000
root@ltc-
fa5c139705980000
root@ltc-
fa5c139705980000
root@ltc-
[ 333.045651] Fatal Hypervisor Maintenance interrupt [Not recovered]
[ 333.045916] Error detail: Malfunction Alert
[ 333.046288] HMER: 8040000000000000
[ 333.046543] CPU PIR: 00000000
[ 333.046601] [Unit: IFU] RegFile core check stop
[ 333.046778] [Unit: PC ] Debug Trigger Error inject
1000000000000008[ 333.046883] F
[194049345926,0] OPAL: Reboot requested due to Platform error.at[
3.46924|Ignoring boot flags, incorrect version 0x0
3.70396|ISTEP 6. 3
4.14478|ISTEP 6. 4
4.14531|ISTEP 6. 5
10.54385|
10.54386|
10.54387|
23.49515|ISTEP 6. 6
[...]
-------