Hello we are having our production databases fail nightly. Prior to this we were in a good spot where we may have had 2 failures or less in a period of three days. Currently we have 2-4 fail a night.
We are getting a few "Cannot connect on socket" errors. The machines can talk to each other because rerunning the backup works.
We are getting a slew of "the backup has failed to back up the requested files". This may be a RMAN sided error. However we would like a second opinion. Here is an example of what that looks like on the RMAN side.
requested files
validation failed for archived log
archived log file name=/u01/app/oracle/admin/DB2/arch/arch_1_741300_582858846.arc RECID=85578 STAMP=853620093
validation failed for archived log
archived log file name=/u01/app/oracle/admin/DB2/arch/arch_1_741301_582858846.arc RECID=85579 STAMP=853620358
We are getting network read failed, however that is in the morning when we do not do network maintenance.
We are running all of our media and master servers on windows server 2008 r2. We have Quantum tape drives, and are running Netbackup 7.1.0.4.