We have application-aware backup of a SQL server failing intermittently. The SQL Server is a secondary node in an always-on cluster, with total of 9 databases, of which 7 are synced and 2 are standalone.
Error reported on backup job:
Code: Select all
21/12/2016 1:06:15 AM :: Unable to release guest. Details: Failed to call RPC function 'Vss.Unfreeze': Error code: 0x80004005. Failed to invoke func [Unfreeze]: Unspecified error. Unfreeze error (over VIX): [Backup job failed. Cannot create a shadow copy of the volumes containing writer's data. A VSS critical writer has failed. Writer name: [SqlServerWriter]. Class ID: [{a65faa63-5ea8-4ebc-9dbd-a0c4db26912a}]. Instance ID: [{1860ad42-e7fa-4b97-8331-806abfee0c4e}]. Writer's state: [VSS_WS_FAILED_AT_POST_SNAPSHOT]. Error code: [0x800
Windows Event logs:
Code: Select all
Source: SQLVDI
Date: 21/12/2016 1:06:07 AM
Event ID: 1
Level: Error
Description:
SQLVDI: Loc=TriggerAbort. Desc=invoked. ErrorCode=(0). Process=1380. Thread=572. Server. Instance=DB01. VD=Global\{F8269FBE-DC09-4853-9C78-E03CF39A8FEA}1_SQLVDIMemoryName_0.
Starting up database 'SecurityAnalyticsEngine'.
Source: SQLWRITER
Date: 21/12/2016 1:06:07 AM
Event ID: 24583
Level: Error
Description:
Sqllib error: OLEDB Error encountered calling ICommandText::Execute. hr = 0x80040e14. SQLSTATE: 42000, Native Error: 3013
Error state: 1, Severity: 16
Source: Microsoft SQL Server Native Client 11.0
Error message: BACKUP DATABASE is terminating abnormally.
SQLSTATE: 42000, Native Error: 3271
Error state: 1, Severity: 16
Source: Microsoft SQL Server Native Client 11.0
Error message: A nonrecoverable I/O error occurred on file "{F8269FBE-DC09-4853-9C78-E03CF39A8FEA}1:" 995(The I/O operation has been aborted because of either a thread exit or an application request.).
SQLSTATE: 01000, Native Error: 4035
Error state: 1, Severity: 0
Source: Microsoft SQL Server Native Client 11.0
Error message: Processed 0 pages for database 'AHP', file 'AHP_mdb' on file 1.
Source: MSSQL$DB01
Date: 21/12/2016 1:06:07 AM
Event ID: 18210
Level: Error
Description:
BackupVirtualDeviceFile::SendFileInfoBegin: failure on backup device '{F8269FBE-DC09-4853-9C78-E03CF39A8FEA}1'. Operating system error 995(The I/O operation has been aborted because of either a thread exit or an application request.).
I had a call opened with support # 01998583. The support engineer referred me to Microsoft KB article - https://msdn.microsoft.com/en-us/librar ... 30%29.aspx. We already have max worker threads setup to 512, so this wasn't applicable. He has now asked me to contact Microsoft as the issue is not with Veeam components but with VSS writer. I'm in the process of contacting Microsoft.
However, in the meantime I want to ask has anybody come across this before? Any SQL guru out there who can advise me to things to look at in order to troubleshoot this.
Another thing I noticed in Windows logs, after the SQL writer, VDI errors, there is an event for starting up database.
Starting up database 'SecurityAnalyticsEngine'.
I understand this is reported because auto-close in SQL is on, could this be in any way contributing towards SQL writer failure?
Thanks
Sushil