Groups | Blog | Home
all groups > sql server clustering > february 2004 >

sql server clustering : Cluster fails


Jonathan.Smith NO[at]SPAM MoneySupermarket.com
2/23/2004 3:58:08 AM
Hi

I have SQL2K SP3 on W2k SP4 in an active passive cluster. Recently I
have had problems with the cluster either failing over or attempting
to and then hanging in an inaccessible state. The following errors
appear in the W2K Application log...

[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed

[sqsrvres] printODBCError: sqlstate = HYT00; native error = 0; message
= [Microsoft][ODBC SQL Server Driver]Timeout expired

[sqsrvres] OnlineThread: QP is not online.

The above are repeated for a while and then...

[sqsrvres] ODBC sqldriverconnect failed

[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = b;
message = [Microsoft][ODBC SQL Server Driver][DBNETLIB]General network
error. Check your network documentation.

The above are repeated for a while and then various other errors
including...

[sqsrvres] CheckServiceAlive: Service is dead

I am also seeing the following repeated many times in the run up to
the shutdown/failover in the SQL Error log...

2004-02-22 11:53:58.56 spid73 WARNING: EC 1aefd588, 0 waited 300
sec. on latch 1a1426c0. Not a BUF latch.
2004-02-22 11:53:58.56 spid73 Waiting for type 0x3, current count
0xa, current owning EC 0x1A197588.

Obviously with different SPIDs and ECs. The errors occur at different
times of the day and there is no common scheduled jobs or maintenance
that I have noticed.

Having searched, I have found many posts from people with similar
problems but have not managed to find a solution. I would really
appreciate any advice anyone can offer because this has been going on
for a while and its really starting to bug me.

Thanks


jeff_zavada NO[at]SPAM lord.com
2/24/2004 10:16:06 AM
I am seeing the same thing Running W2kSP3 and SQL 2kSP3a in an
active/passive cluster for SAP. I see the same errors in the event
log, but i see no errors in the SQL error log. The closest event in
the SQL error log was a log backup that completed successfully 36
minutes prior to the failover. I see no DB corruption from the DBCC
checks that ran earlier in the morning. Right now i am stumped. If
anyone has any info that would be great.


[quoted text, click to view]
cgross NO[at]SPAM online.microsoft.com
2/25/2004 9:27:56 PM
Timeout expired indicates that the SQL Server is too busy to respond to the
cluster admin's looksalive/isalive requests. This could be the result of a
legitimately busy instance, or SQL Server could be "hung" or otherwise
unable to respond.

There are many potential root causes, to narrow it down you need to find
out if other local connections succeed, if existing connections succeed but
new connections fail, if TCP or named pipes connections succeed, if Windows
itself is not responding, etc. You can also do some general performance
testing/tracking/tuning and use SQL Profiler and/or PerfMon to track what
is going on in Windows and in SQL Server.

Cindy Gross, MCDBA, MCSE
http://cindygross.tripod.com
This posting is provided "AS IS" with no warranties, and confers no rights.
JonathanS
3/2/2004 3:26:07 AM
cgross NO[at]SPAM online.microsoft.com
3/11/2004 5:07:30 PM
You need to do some basic performance troubleshooting to determine where
the problem lies. The problem is highly unlikely to be cluster-specific.

Inside SQL Server 2000/7.0 by Kalen Delaney, http://www.insidesqlserver.com
The Guru's Guide to Transact-SQL, http://www.khen.com/books.htm
SQL Server Performance and Scalability References,
http://www.microsoft.com/sql/evaluation/compare/performance.asp
Understanding Performance Testing,
http://msdn.microsoft.com/library/techart/d5dplyover.htm
224587 INF: Troubleshooting Application Performance with SQL Server
http://support.microsoft.com/?id=224587
243586 INF: Troubleshooting Stored Procedure Recompilation
http://support.microsoft.com/?id=243586
298475 INF: Information Required to Successfully Troubleshoot Application
http://support.microsoft.com/?id=298475
175658 How To Gather Information for Effective Troubleshooting of
Performance http://support.microsoft.com/?id=175658
319942 HOW TO: Determine Proper SQL Server Configuration Settings
http://support.microsoft.com/?id=319942
297864 INF: Performance Considerations for an Upgrade from SQL Server 6.5
http://support.microsoft.com/?id=297864
314648 INF: Differences in STATISTICS IO, SQL Profiler and Sysprocesses IO
http://support.microsoft.com/?id=314648
311826 INF: Index Tuning Wizard Best Practices
http://support.microsoft.com/?id=311826
Index Tuning Wizard 2000,
http://msdn.microsoft.com/library/techart/itwforsql.htm
Index Tuning Wizard 7.0,
http://msdn.microsoft.com/library/techart/msdn_sqlindex.htm
Troubleshooting the Index Tuning Wizard,
http://msdn.microsoft.com/library/psdk/sql/tr_servtools_48ro.htm
195565 INF: How SQL Server 7.0 and SQL Server 2000 Autostats Work
http://support.microsoft.com/?id=195565
197297 INF: Comparison of Join Techniques
http://support.microsoft.com/?id=197297
139609 PerfMon: High Number of Pages/Sec Not Necessarily Low Memory
http://support.microsoft.com/?id=139609
268343 Umdhtools.exe: How to Use Umdh.exe to Find Memory Leaks
http://support.microsoft.com/?id=268343
286350 HOWTO: Use Autodump+ to Troubleshoot "Hangs" and "Crashes"
http://support.microsoft.com/?id=286350
316365 INFO: ROADMAP for How to Use the .NET Performance Counters
http://support.microsoft.com/?id=316365
150934 How to Create a Performance Monitor Log for NT Troubleshooting
http://support.microsoft.com/?id=150934
130926 Using Performance Monitor to Identify a Pool Leak
http://support.microsoft.com/?id=130926
146005 Optimizing Windows NT for Performance
http://support.microsoft.com/?id=146005
248345 How to Create a Log Using System Monitor in Windows 2000
http://support.microsoft.com/?id=248345
275685 INF: Importing Performance Monitor Logs into SQL Server
http://support.microsoft.com/?id=275685

Cindy Gross, MCDBA, MCSE
http://cindygross.tripod.com
This posting is provided "AS IS" with no warranties, and confers no rights.
AddThis Social Bookmark Button