Forum Discussion
wtwagon_99154
Nimbostratus
Feb 10, 2010Health Check Issues
The Scenario:
We are performing some load testing of an application and are using a health check as follows:
GET /blah/blah.asmx HTTP/1.1\r\nUser-Agent:Mozilla\r\nhost:blahblah.blah.net\r\n
We look for a 200 OK
Monitor interval: 5 seconds
Timeout: 16 seconds
We have 2 servers in the pool and they are all green at the moment.
The Problem:
As soon as we start throwing LoadRunner load tests at this VIP, all the servers go to red almost immediately(we are throwing a small amount of load, around 40 concurrent sessions). What really gets me is that I can run the same exact request as above during the load test and I am able to successfully get there and receive a 200 OK.
I also tried swapping the interval to 15 seconds with a timeout of 46 seconds and received the same problem (although this time, it happened after 5 minutes instead of almost immediately).
What steps can be taken to try to diagnose why the F5 is thinking these nodes are down, when they are clearly not?
- hoolio
Cirrostratus
Hi, - wtwagon_99154
Nimbostratus
Posted By hoolio on 02/10/2010 8:35 AM
- hoolio
Cirrostratus
Does the peer unit also mark the pool members down at the same time? Do you see anything interesting in the web server logs? - wtwagon_99154
Nimbostratus
What's strange is that the peer unit has not marked the servers down during that same time. - wtwagon_99154
Nimbostratus
Just wanted to see if anyone else had some thoughts about monitor interval and timeout. - hoolio
Cirrostratus
The timeout = 3 x interval + 1 is a best practice as it allows the server three chances to respond successfully before being marked down. A 5 second interval has been a good place to start as a test request this often shouldn't overload the servers. And 16 seconds is not a horribly long time to wait to mark a failed server down. - wtwagon_99154
Nimbostratus
also, what do you think about the inband monitoring? We obviously aren't doing much application level monitoring here (we use another system for that type of monitoring). The inband monitoring + active monitoring looks pretty cool - just wanted to see if someone has some experience with it and their thoughts on the performance of it. - hoolio
Cirrostratus
The main purpose of a monitor is to ensure that any pool member LTM sends traffic to can handle the traffic. So it's ideal if you can configure the monitor to check a page which hits the database and checks the full state of the application. Inband monitoring is useful for ensuring the network layer is working. As you suggest, the two in combination work well. - wtwagon_99154
Nimbostratus
Got a nice little update.. - hoolio
Cirrostratus
That's interesting. Does top show the monitoring daemon, bigd, using a lot of CPU? Does CPU0 show high usage from other processes?
Recent Discussions
Related Content
DevCentral Quicklinks
* Getting Started on DevCentral
* Community Guidelines
* Community Terms of Use / EULA
* Community Ranking Explained
* Community Resources
* Contact the DevCentral Team
* Update MFA on account.f5.com
Discover DevCentral Connects