New Solaris features: Notifications from the Fault Management Architecture
The notification framework in Solaris is not only able to notify you of problems with services. As you may know, the fault management architecture monitors the hardware of a Solaris system. With the fmdump
command you can retrieve the information. For example you get this informatoin when you unplug an USB stick with an active ZFS filesystem:
root@solaris:/home/jmoekamp# fmdump -v<br />
TIME UUID SUNW-MSG-ID EVENT
Nov 22 12:59:48.7489 22dffa96-bf27-4f41-ab1f-dfedcdfb11e7 ZFS-8000-HC Diagnosed<br />
100% fault.fs.zfs.io_failure_wait
Problem in: zfs://pool=usbstick1<br />
Affects: zfs://pool=usbstick1<br />
FRU: -<br />
Location: -
Wouldn’t it be nice to receive this and similar information about the failure of components. It’s really simple to configure. However i assume , that you’ve already installed smtp-notify
package with
root@solaris:/# pkg install service/fault-management/smtp-notify
To configure the notification, you just have to configure the target mail adress for the mails.
root@solaris:/# svccfg setnotify problem-diagnosed mailto:root@localhost
To check the configuration you can use the listnotify
command:
root@solaris:/# svccfg listnotify problem-diagnosed
Event: problem-diagnosed (source: svc:/system/fm/notify-params:default)
Notification Type: smtp
Active: true
reply-to: root@localhost
to: root@localhost
Notification Type: snmp
Active: true
Notification Type: syslog
Active: true
When you unplug the USB-stick with while the ZFS pool is imported, you will find a mail like this one in your mailbox:
root@solaris:/# mail
From noaccess@solaris.local Mon Nov 22 12:59:48 2010
Date: Mon, 22 Nov 2010 12:59:48 +0100 (CET)
From: No Access User <noaccess@solaris.local>
Message-Id: <201011221159.oAMBxm5o002968@solaris.local>
Subject: Fault Management Event: solaris:ZFS-8000-HC
To: root@solaris.local
Content-Length: 560
SUNW-MSG-ID: ZFS-8000-HC, TYPE: Error, VER: 1, SEVERITY: Major
EVENT-TIME: Mon Nov 22 12:59:44 CET 2010
PLATFORM: VirtualBox, CSN: 0, HOSTNAME: solaris
SOURCE: zfs-diagnosis, REV: 1.0
EVENT-ID: 22dffa96-bf27-4f41-ab1f-dfedcdfb11e7
DESC: The ZFS pool has experienced currently unrecoverable I/O
failures. Refer to http://sun.com/msg/ZFS-8000-HC for more information.
AUTO-RESPONSE: No automated response will be taken.
IMPACT: Read and write I/Os cannot be serviced.
REC-ACTION: Make sure the affected devices are connected, then run
'zpool clear'.
? q
Neat, isn’t it?