fmadm(8) 맨 페이지 - 윈디하나의 솔라나라

개요

섹션
맨 페이지 이름
검색(S)

fmadm(8)

System Administration Commands                                        fmadm(8)



NAME
       fmadm - fault management configuration tool

SYNOPSIS
       fmadm [-q] [subcommand [arguments]]

DESCRIPTION
       The  fmadm  utility can be used by administrators and service personnel
       to view and modify system configuration parameters  maintained  by  the
       Solaris Fault Manager, fmd(8). fmd receives symptomatic telemetry asso‐
       ciated with conditions detected by the system software,  diagnoses  the
       telemetry  into  faults,  defects,  or  alerts, and initiates proactive
       self-healing activities such as disabling faulty components.


       fmadm can be used to do the following:

           o      View the set of diagnosis engines and agents that  are  cur‐
                  rently participating in fault management.


           o      View  the list of system components that have been diagnosed
                  as associated with a fault, defect, or alert.


           o      Perform administrative tasks related to these entities.



       The Fault Manager attempts to automate as many activities as  possible,
       so use of fmadm is typically not required. When the Fault Manager needs
       help from an administrator, service repair technician,  or  Oracle,  it
       produces a message indicating its needs. It also refers you to a knowl‐
       edge article on the Oracle website. The website might ask  you  to  use
       fmadm  or  one  of  the other fault management utilities to gather more
       information or perform additional tasks. The documentation for  fmd(8),
       fmdump(8),  and  fmstat(8)  man  pages  and  the  Securing  Systems and
       Attached Devices in Oracle Solaris 11.4 guide describe more about tools
       to observe fault management activities.


       One  responsibility  of the Fault Manager is to keep track of the loca‐
       tion of components. At the chassis level, the fmadm *-alias subcommands
       manage  a chassis chassis-name.chassis-serial to alias-id  mapping. The
       administered alias-id is intended to describe the physical location  of
       a chassis.


       The  fmadm utility requires the user to be assigned the solaris.fm.read
       RBAC authorization ("Fault Management" or "Fault Information" RBAC pro‐
       file)  for read operations, or the solaris.fm.modify RBAC authorization
       ("Fault Management" RBAC profile) for modify operations. The fmadm load
       subcommand requires that the user possess all privileges.

   SUBCOMMANDS
       The  fmadm  command accepts the following subcommands. Some of the sub‐
       commands accept or require additional options and operands. The acquit,
       load,  unload,  repaired,  replaced,  reset, and rotate subcommands are
       intended for trained technical  personnel.  Use  of  these  subcommands
       without the specific guidance of, for example, a Knowledge Base article
       is not recommended.


       fmadm acquit fmri | label [uuid]

           Notify the Fault Manager that the specified resource is not  to  be
           considered  to  be a suspect in the event identified by uuid, or if
           no UUID is specified, then in any faults, defects, or  alerts  that
           have been detected. The fmadm acquit command should be used only at
           the direction of a documented Oracle repair procedure.  Administra‐
           tors  might need to apply additional commands to re-enable a previ‐
           ously faulted resource.


       fmadm acquit uuid

           Notify the Fault Manager that the event identified by uuid  can  be
           safely ignored. The fmadm acquit command should be used only at the
           direction of a documented Oracle repair  procedure.  Administrators
           might need to apply additional commands to re-enable any previously
           faulted resources.


       fmadm add-alias chassis-name.chassis-serial alias-id ['comment']

           The add-alias subcommand is used to establish alias-id as a managed
           alias  for  the chassis-name.chassis-serial chassis. When a managed
           alias is defined, the /dev/chassis devchassis(4FS) name space  rep‐
           resentation  of  the  chassis will use the more meaningful alias-id
           instead of the chassis-name.chassis-serial.


             # fmadm add-alias SUN-Storage-J4410.1039QAQ007 RACK29.U25-28

           The command shown above will verify that the new mapping  does  not
           conflict  with  existing mappings. In the case of conflict, no map‐
           ping change occurs. This command completes when the associated name
           space  updates are complete. If the updated name space does not use
           the new alias-id, a warning is printed, but the mapping is updated.
           If the name space update takes too long, a warning is printed.

           The  add-alias  will now also accept "SYS" as an identifier for the
           main chassis, instead of requiring the chassis-name.chassis-serial.

           If an optional comment is provided, the comment  is  preserved  and
           will  be  displayed by a subsequent lookup-alias or list-alias com‐
           mand. See also remove-alias and sync-alias.


       fmadm clear label | uuid | class@resource

           Notify the Fault Manager that any alert events associated with  the
           specified  location  label  or uuid or identified by class@resource
           should be cleared. This command can only be applied  to  an  alert,
           not to a defect or fault.


       fmadm config

           Display  the  configuration  of the Fault Manager itself, including
           the module name, version, and description of each component module.
           Fault Manager modules provide services such as automated diagnosis,
           self-healing, and messaging for hardware and  software  present  on
           the system.


       fmadm faulty [-afprsv] [-u uuid]

           This command is an alias for the fmadm list command.


       fmadm flush fmri | label

           Flush the information cached by the Fault Manager for the specified
           resource, for any faults, defects, or alerts for which the resource
           has already been repaired, acquitted or replaced.


       fmadm list [-afprsv] [-u uuid]

           Display  status  information  for  resources that the Fault Manager
           currently believes to be associated with a fault, defect, or alert.
           See also list-alert, list-defect, and list-fault subcommands.

           The following options are supported:



           -a         Display  all  resources  that might be associated with a
                      fault, defect, or alert. By default, the fmadm list com‐
                      mand  only lists output for resources for which a fault,
                      defect, or alert is still active. If you specify the  -a
                      option,  all  resource  information  cached by the Fault
                      Manager is listed, including information  for  resources
                      that  might  have already been acquitted or repaired, or
                      might no longer be present in the system.


           -f         Display FRUs (Field Replaceable Units) that are  associ‐
                      ated with a fault, defect, or alert.


           -p         Pipe  output through a pager with form feed between each
                      event.


           -r         Display affected Solaris resources with their Identifier
                      (FMRI) and their fault management state.


           -s         Display a one-line line summary for each event.


           -u         Only display the evueunitdwith the given uuid.


           -v         Display full output.


           The percentage certainty is displayed if an event has multiple sus‐
           pects, either of different classes or on different  FRUs.  If  more
           than  one  resource  is  on the same FRU and it is not 100% certain
           that the event is associated with the FRU, the  maximum  percentage
           certainty of the possible suspects on the FRU is displayed.

           The  Fault  Manager  associates the following states with every FRU
           which that has been diagnosed as having a fault, defect, or alert.



           faulty                The FRU has been diagnosed by the Fault  Man‐
                                 ager   as  being  associated  with  a  fault,
                                 defect, or alert, and is still present in the
                                 system.


           faulty/not_present    The  FRU has been diagnosed by the Fault Man‐
                                 ager  as  being  associated  with  a   fault,
                                 defect,  or alert and is no longer present in
                                 the system.


           replaced              The FRU has been removed from the system  and
                                 a replacement FRU has taken its place.


           repaired              The  command  fmadm repaired has been used to
                                 notify the Fault Manager  that  the  FRU  has
                                 been repaired.


           acquitted             The  command  fmadm  acquit  has been used to
                                 notify the Fault Manager  that  the  FRU  has
                                 been acquitted.


           The  state of any affected Solaris resources is also displayed. For
           a faulty FRU, the state of the associated resources can be one of:


               o      Faulted and taken out of service


               o      Faulted but still in service


               o      Faulted but still providing degraded service

           For a FRU that has been repaired, replaced or acquitted, the  state
           of the associated resources can be one of:

               o      Okay and in service


               o      Out  of  service,  but  associated  components no longer
                      faulty


               o      Service degraded, but associated  components  no  longer
                      faulty



       fmadm list-alert [-afprsv] [-u uuid]

           This  subcommand behaves like list, but only displays suspect lists
           that contain an alert event.


       fmadm list-alias

           The list-alias command is used to display  all  comments  and  map‐
           pings.  See  also  add-alias, lookup-alias, remove-alias, and sync-
           alias subcommands.


       fmadm list-defect [-afprsv] [-u uuid]

           This subcommand behaves like list, but only displays suspect  lists
           that contain a defect event.


       fmadm list-fault [-afprsv] [-u uuid]

           This  subcommand behaves like list, but only displays suspect lists
           that contain a fault event.


       fmadm load path

           Load the specified Fault Manager module. The path must be an  abso‐
           lute  path and must refer to a module present in one of the defined
           directories for modules. Typically, the use of this command is  not
           necessary  because  the  Fault  Manager loads modules automatically
           when Solaris initially boots or as needed. See also fmadm unload.


       fmadm lookup-alias alias-id | chassis-name.chassis-serial

           The lookup-alias subcommand can be used to determine what the  cur‐
           rent mapping is. The following is an example command.


             # fmadm lookup-alias SUN-Storage-J4410.1039QAQ007

           See also add-alias, list-alias, remove-alias, and sync-alias.


       fmadm remove-alias alias-id | chassis-name.chassis-serial

           The  fmadm  remove-alias  subcommand  is  used to remove a chassis-
           name.chassis-serial to alias-id mapping.


             # fmadm remove-alias RACK29.U25-28

           The command shown above completes when the  associated  name  space
           updates are complete. See also add-alias, list-alias, lookup-alias,
           and sync-alias.


       fmadm repaired fmri | label

           Notify the Fault Manager that a repair procedure has  been  carried
           out on the specified resource. The fmadm repaired command should be
           used only at the direction of a documented Oracle repair procedure.
           Administrators might need to apply additional commands to re-enable
           a previously faulted resource.


       fmadm replaced fmri | label

           Notify the Fault Manager  that  the  specified  resource  has  been
           replaced.  This  command should be used in those cases in which the
           Fault Manager is unable to automatically  detect  the  replacement.
           The  fmadm replaced command should be used only at the direction of
           a documented Oracle repair procedure. Administrators might need  to
           apply   additional  commands  to  re-enable  a  previously  faulted
           resource.


       fmadm reset [-s serd] module

           Reset the specified Fault Manager module or module subcomponent. If
           the -s option is present, the specified Soft Error Rate Discrimina‐
           tion (SERD) engine is reset within the module. If the -s option  is
           not  present,  the  entire module is reset and all persistent state
           associated with the module is  deleted.  The  fmadm  reset  command
           should  only be used at the direction of a documented Oracle repair
           procedure. The use of  this  command  is  typically  not  necessary
           because the Fault Manager manages its modules automatically.


       fmadm rotate errlog | fltlog | infolog | infolog_hival

           The fmadm rotate command is a helper command for the logadm(8) com‐
           mand, so that logadm can rotate live log files correctly. The fmadm
           rotate command is not intended to be invoked directly (and invoking
           it directly is likely to lose log history). Use one of the  follow‐
           ing commands to cause the appropriate logfile to be rotated, if the
           current logfile is not zero in size:

             # logadm -p now -s 1b /var/fm/fmd/errlog
             # logadm -p now -s 1b /var/fm/fmd/fltlog
             # logadm -p now -s 1b /var/fm/fmd/infolog
             # logadm -p now -s 1b /var/fm/fmd/infolog_hival



       fmadm sync-alias

           The sync-alias subcommand is used to hand-import a set of  mappings
           in bulk. Two copies of the current mappings are maintained:


               o      /etc/dev/chassis_aliases


               o      /etc/dev/.chassis_aliases

           To   import  a  set  of  mappings  in  bulk,  you  can  update  the
           /etc/dev/chassis_aliases file and then run  fmadm  sync-alias.  See
           also add-alias, list-alias, lookup-alias, and remove-alias.


       fmadm unload module

           Unload the specified Fault Manager module. Specify module using the
           basename listed in the fmadm config output. Typically, the  use  of
           this  command  is not necessary because the Fault Manager loads and
           unloads modules automatically based on  the  system  configuration.
           See also fmadm load.



OPTIONS
       The following options are supported:


       -q    Set  quiet  mode.  fmadm does not produce messages indicating the
             result of successful operations to standard output.



OPERANDS
       The following operands are supported:


       subcommand    The name of a subcommand listed in SUBCOMMANDS.


       arguments     One or more options  or  arguments  appropriate  for  the
                     selected  subcommand,  as described in SUBCOMMANDS. Among
                     these arguments are fmri, uuid, and label. These identify
                     resources  that are the objects of fmadm subcommands. Use
                     fmadm list to obtain the fmri, uuid, and label for a tar‐
                     geted  resource.  In  general,  label  is  the most user-
                     friendly of these  operands.  See  the  Examples  section
                     below.



EXIT STATUS
       The following exit values are returned:


       0    Successful completion


       1    An  error  occurred.  Errors include a failure to communicate with
            fmd or insufficient RBAC authorization to  perform  the  requested
            operation


       2    Invalid command-line options were specified



EXAMPLES
       Example 1 Invoking the list Subcommand



       The  following  command invokes the list subcommand, which displays the
       uuid, label, and fmri for a component.


         # fmadm list
         ------------ ------------------------------------  ------------ ---------
         TIME         EVENT-ID                              MSG-ID       SEVERITY
         ------------ ------------------------------------  ------------ ---------
         Sep 09 16:15 96609fae-113c-e48c-b1cf-ebf4b0902d72  DISK-8000-3E Critical

         Problem Status  : open
         Diag Engine     : eft / 1.16
         System
            Manufacturer : Oracle-Corp.
            Name         : SUN-FIRE-X4170-SERVER
            Part Number  : unknown
            Serial Number: 0920XF508B

         ----------------------------------------
         Suspect 1 of 1:
           Fault class: fault.io.scsi.cmd.disk.dev.rqs.derr
           Certainty  : 100%
           Affects    : dev:///:devid=id1,sd@n5000c5000940edbb//scsi_vhci/disk@g\
                          5000c5000940edbb
           Status     : faulted and taken out of service

            FRU
               Status            : faulty
               Location         : "/SUN-Storage-J4410.1037QAQ052/HDD11"
               Location Alias   : "/RACK29.U25-28/HDD11"
               Manufacturer      : SEAGATE
               Name              : ST330057SSUN300G
               Part Number       : SEAGATE-ST330057SSUN300G
               Revision          : 0205
               Serial Number     : 000930G01CN4----3SJ01CN4
               Chassis
                  Manufacturer   : Oracle-Corp.
                  Name           : SUN-Storage-J4410
                  Part Number    : 594-5329
                  Serial Number  : 1037QAQ052
               ...
               ...




       In the preceding output, the uuid is the first  item  in  the  EVENT-ID
       column,  96609fae-113c-e48c-b1cf-ebf4b0902d72.  The label is in the FRU
       section in the Location line, "/SUN-Storage-J4410.1037QAQ052/HDD11". In
       this  example,  an  alias for the chassis has been set, and the aliased
       location    is    displayed    in    the    Location    Alias     line,
       "/RACK29.U25-28/HDD11".



       The fmris are available with fmdump  -v:


         # fmdump -v
         Sep 09 16:15:36.9252 96609fae-113c-e48c-b1cf-ebf4b0902d72 DISK-8000-3E \
         Diagnosed 100%  fault.io.scsi.cmd.disk.dev.rqs.derr

         Problem in: hc://:scheme=:chassis-mfg=Oracle-Corp.:chassis-name=SUN-\
         Storage-J4410:chassis-part=594-5329:chassis-serial=1037QAQ052/ses-\
         enclosure=0/bay=11/disk=0

         Affects: dev:///:devid=id1,sd@n5000c5000940edbb//\
         scsi_vhci/disk@g5000c5000940edbb
         FRU: hc://chassis-mfg=Oracle-Corp.:chassis-name=SUN-Storage-J4410\
         :chassis-part=594-5329:chassis-serial=1037QAQ052:fru-mfg=SEAGATE\
         :fru-name=SEAGATE-ST330057SSUN300G:fru-part=SEAGATE-ST330057SSUN300G\
         :fru-revision=0205:fru-serial=000930G01CN4--------3SJ01CN4/\
         ses-enclosure=0/bay=11/disk=0

         FRU Location: "/SUN-Storage-J4410.1037QAQ052/HDD11"




       Note that label is the easiest-to-use identifier. Either the aliased or
       the non-aliased form of the Location may be used.


       Example 2 Obtaining the Module Name



       The following command displays the module name for each component.  The
       module name is specified as input to the fmadm unload command.


         # fmadm config
         MODULE                   VERSION STATUS  DESCRIPTION
         cpumem-retire            1.1     active  CPU/Memory Retire Agent
         disk-transport           1.0     active  Disk Transport Agent
         eft                      1.16    active  eft diagnosis engine
         ..



ATTRIBUTES
       See attributes(7) for descriptions of the following attributes:


       tab()  box; cw(2.75i) |cw(2.75i) lw(2.75i) |lw(2.75i) ATTRIBUTE TYPEAT‐
       TRIBUTE VALUE _ Availabilitysystem/fault-management _ Interface Stabil‐
       itySee below



       The  command-line  options  are Committed. The human-readable output is
       not-an-interface.

SEE ALSO
       fmd(8), fmdump(8),  fmstat(8),  logadm(8),  syslogd(8),  attributes(7),
       devchassis(4FS)


       Securing Systems and Attached Devices in Oracle Solaris 11.4

NOTES
       Oracle  Solaris  FMA  does not determine or is not actively involved in
       SMART failure analysis or predictions. It reads the SMART data reported
       by the disk.


       A  third  party utility such as SMARTCTL can be used to view more SMART
       information, which is read-only. The thresholds are determined  by  the
       disk  manufacturer and can vary from one disk make/model/firmware level
       to other. HDD manufacturers do not  make  SMART  threshold  information
       available.  However, SMARTCTL may be able to show you these values, but
       it is read-only.


       Once a disk asserts a SMART failure prediction or warning, it  must  be
       replaced. You cannot turn off SMART failure prediction. Note that it is
       a matter of time before the disk fails completely.


       While it is possible to tell Oracle Solaris FMA to  ignore  a  specific
       SMART failure event by using the fmadm acquit command, it is not recom‐
       mended. Again, once a disk asserts SMART failure, it cannot be  changed
       and must be replaced.


       It  is  recommended  to  ensure  that your system software and firmware
       (including SAS controller and disk firmware) are all kept up  to  date.
       This ensures the system has the best capabilities.



Oracle Solaris 11.4               11 May 2021                         fmadm(8)
맨 페이지 내용의 저작권은 맨 페이지 작성자에게 있습니다.
RSS ATOM XHTML 5 CSS3