我需要解决方案
Hi,
We are trying to solve the problem with tape drives which are going down after every unmount request.
Most of the drives are shared. Robot is controlled by one media server with RH 6.4 NBU 7.5.0.4. Other media servers have NBU 6.5 to NBU 7.5.0.4.
Master server is NBU 7.5.0.4 on RedHat 5.
The tape library is ADIC Scalar 10K and have configured the SDLC 2.7 as SCSI/FC bridge. Robot type is configured as TLD.
In the bptm log on media server with shared drives i have:
14:26:41.551 [12386416] <2> nbjm_media_request: Job control returned to BPTM
14:26:41.551 [12386416] <2> drivename_open: Called with Create 1, file s10k6_12
14:26:41.551 [12386416] <2> drivename_lock: lock established
14:26:41.552 [12386416] <4> create_tpreq_file: symlink to path /dev/rmt2.1
14:26:41.553 [12386416] <2> drivename_write: Called with mode 2
14:26:41.555 [12386416] <2> process_tapealert: TapeAlert returned 0x00000000 0x00000000 (from tapealert_and_release)
14:26:41.556 [12386416] <2> really_tpunmount: tpunmount'ing /usr/openv/netbackup/db/media/tpreq/drive_s10k6_12
14:26:42.750 [12386416] <2> tapelib: wait_for_ltid, UnMount, timeout 240
and then:
14:27:20.781 [12386416] <2> db_end: Need to collect reply
14:27:20.781 [12386416] <16> really_tpunmount: error unloading media, TpErrno = Robot operation failed
14:27:20.781 [12386416] <4> create_tpreq_file: symlink to path /dev/rmt2.1
14:27:20.783 [12386416] <2> process_tapealert: TapeAlert returned 0x00000000 0x00000000 (from tapealert_and_release)
14:27:20.784 [12386416] <2> tapealert_and_release: report_attr, fl1 0x00000001, fl2 0x00000000
14:27:20.786 [12386416] <2> tapealert_and_release: SCSI RELEASE
14:27:20.786 [12386416] <2> drivename_unlock: unlocked
14:27:20.786 [12386416] <2> drivename_close: Called for file s10k6_12
14:27:20.786 [12386416] <2> main: Sending [EXIT STATUS 174] to NBJM
14:27:20.786 [12386416] <2> bptm: EXITING with status 174 <----------
In the /usr/openv/volmgr/debug/ltid log on the same media server:
14:27:31.758 [14745792] <6> DownDrivePath: DOWNing path /dev/rmt2.1 for s10k6_12 on host backup-p795-1
14:27:31.758 [14745792] <6> WriteEntry: Updating drive s10k6_12 at path /dev/rmt2.1 on attach host
In the /usr/openv/volmgr/debug/daemon log on the same media server:
14:27:40.977 [10747970] <16> pending_to_mount_req: Unable to get drive details from EMM server
14:27:40.977 [10747970] <16> emmlib_GetDriveByName: (0) The parameter DriveName can not be null or empty, emmError = 4000002, nbError = 0
14:27:40.977 [10747970] <16> pending_to_mount_req: (-) Translating EMM_ERROR_ArgInvalid(4000002) to 191 in the Media context
14:27:40.977 [10747970] <16> pending_to_mount_req: Unable to get drive details from EMM server
14:27:40.977 [10747970] <16> emmlib_GetDriveByName: (0) The parameter DriveName can not be null or empty, emmError = 4000002, nbError = 0
14:27:40.977 [10747970] <16> pending_to_mount_req: (-) Translating EMM_ERROR_ArgInvalid(4000002) to 191 in the Media context
14:27:40.977 [10747970] <16> pending_to_mount_req: Unable to get drive details from EMM server
14:27:40.977 [10747970] <16> emmlib_GetDriveByName: (0) The parameter DriveName can not be null or empty, emmError = 4000002, nbError = 0
14:27:40.977 [10747970] <16> pending_to_mount_req: (-) Translating EMM_ERROR_ArgInvalid(4000002) to 191 in the Media context
14:27:40.977 [10747970] <16> pending_to_mount_req: Unable to get drive details from EMM server
14:27:40.977 [10747970] <16> emmlib_GetDriveByName: (0) The parameter DriveName can not be null or empty, emmError = 4000002, nbError = 0
14:27:40.977 [10747970] <16> pending_to_mount_req: (-) Translating EMM_ERROR_ArgInvalid(4000002) to 191 in the Media context
14:27:40.977 [10747970] <16> pending_to_mount_req: Unable to get drive details from EMM server
14:27:40.977 [10747970] <16> emmlib_GetDriveByName: (0) The parameter DriveName can not be null or empty, emmError = 4000002, nbError = 0
14:27:40.977 [10747970] <16> pending_to_mount_req: (-) Translating EMM_ERROR_ArgInvalid(4000002) to 191 in the Media context
14:27:40.977 [10747970] <16> pending_to_mount_req: Unable to get drive details from EMM server
14:27:41.756 [10747970] <2> process_requests: TERMINATE
14:27:41.756 [10747970] <2> process_requests: received TERMINATE request
The tape is unmounted after about 10-15 minutes.
The robot controller media server controls 4 other tape libraries without any issues. The problem concerns only ADIC Scalar 10K.
Thanks for any ideas.
Regards
Madej