Tracking triggers in an iRule

Question

Hi All,&nbsp;
&nbsp;I have the following iRule which checks a data group to see if the server is marked as online or off-line. If it is marked as online, traffic passes as normal, if it is off-line,  it sends back a 503. If the proxy is online, but the back end pool is unavailable (using LB_FAILED), it sends back a 502. The pool is actually a single node, so there is no need for LB::reselect (which I don't think would work anyway). I have a tcp profile assigned to the virtual server that sets max syn retry to 1 so that LB_FAILED is immediate. &nbsp;
&nbsp;This has worked fairly well so far, except that LB_FAILED is being triggered intermittently, and I don't know why.  One request will get a 502, while another request, received within milliseconds, goes through. If I were using a built-in health check, there would be logging on member up/down, and failures to select pools. But since I am doing passive checking, there isn't much info that I can find. Is there a way to see what is causing the failures from the LTMs point of view?&nbsp;
&nbsp;Thanks,
&nbsp;Kenny&nbsp;
&nbsp;when RULE_INIT {&nbsp;
&nbsp;   log local0.info "proxystatushttp v1.0  $static::tcl_platform(os) $static::tcl_platform(osVersion)"&nbsp;
&nbsp;   set static::DEBUG 0
&nbsp;   
&nbsp;   set static::offlineFlag "offline"
&nbsp;   set static::proxyStatus proxystatus&nbsp;
&nbsp;   if { $static::DEBUG } { log local0.debug "$static::proxyStatus:\n[class get $static::proxyStatus]" }
&nbsp;   
&nbsp;   set static::privateNetworkAddresses private_net
&nbsp;   set static::externalMonitoringAddresses external_monitoring_addresses&nbsp;
&nbsp;   if { $static::DEBUG } { log local0.debug "$static::privateNetworkAddresses:\n[class get $static::privateNetworkAddresses]" }
&nbsp;   if { $static::DEBUG } { log local0.debug "$static::externalMonitoringAddresses:\n[class get $static::externalMonitoringAddresses]" }&nbsp;
&nbsp;}&nbsp;
&nbsp;when HTTP_REQUEST {&nbsp;
&nbsp;   if { [class lookup $static::offlineFlag $static::proxyStatus] } {&nbsp;
&nbsp;      if { (not [class match [IP::client_addr] equals $static::externalMonitoringAddresses]) &amp;&amp; 
&nbsp;           (not [class match [IP::client_addr] equals $static::privateNetworkAddresses]) } {&nbsp;
&nbsp;     set response "ForbiddenNOTICE: Service unavailable at this time."&nbsp;
&nbsp;     HTTP::respond 503 content $response noserver "Connection" "close" "Content-Length" [string length $response]
&nbsp;         if { $static::DEBUG } { log local0.debug "Sent HTTP Status Code 503 due to proxy status offline to [IP::client_addr]" }
&nbsp;     log -noname local0. "[virtual name] MyIP=[IP::local_addr] SrcIP=[IP::client_addr] - - $$[clock format [clock seconds] -format "%d/%b/%Y:%H:%M:%S %z"]$$ - \"[HTTP::method] [HTTP::uri] HTTP/[HTTP::version]\" 503 [HTTP::payload length]"
&nbsp;     return
&nbsp;      }
&nbsp;      else {
&nbsp;         if { $static::DEBUG } { log local0.debug "Processing HTTP request with proxy status offline from [IP::client_addr]" }
&nbsp;      }
&nbsp;   }
&nbsp;}&nbsp;
&nbsp;when LB_FAILED {
&nbsp;   set response "Server ErrorNOTICE: Site has experienced an error."
&nbsp;   HTTP::respond 502 content $response noserver "Connection" "close"
&nbsp;   log -noname local0. "[virtual name] MyIP=[IP::local_addr] SrcIP=[IP::client_addr] - - $$[clock format [clock seconds] -format "%d/%b/%Y:%H:%M:%S %z"]$$ - \"[HTTP::method] [HTTP::uri] HTTP/[HTTP::version]\" 502 [HTTP::payload length]"
&nbsp;}

the_bhattman · Answer

HI Kenny, 
&nbsp;   You might need to apply a TCP profile where you can fine tune the "Maximum Syn Retransmissions" . Here is a link to describe the various settings behind it. 
&nbsp;  
&nbsp; http://devcentral.f5.com/wiki/default.aspx/iRules/LB_FAILED.html 
&nbsp;  
&nbsp; I hope this helps 
&nbsp; Bhattman &nbsp;

hooleylist · Answer

Hi Kenny, 
&nbsp;  
&nbsp; I don't think there is anything special you're doing in the iRule which would trigger a load balancing failure and the LB_FAILED event to run.  You can check the LB_FAILED wiki page for details on when this event is triggered: 
&nbsp;  
&nbsp; http://devcentral.f5.com/wiki/default.aspx/iRules/lb_failed 
&nbsp;  
&nbsp; If this happens frequently, you could try capturing a tcpdump of the client and serverside traffic to see if the pool member is in fact not responding to LTM SYNs.  For details on using tcpdump, check SOL411: 
&nbsp;  
&nbsp; sol411: Overview of packet tracing with the tcpdump utility 
&nbsp; http://support.f5.com/content/kb/en-us/solutions/public/0000/400/sol411.html 
&nbsp;  
&nbsp; Aaron

kenny_lussier_5 · Answer

Thanks for the pointers. Tracking connections is a little tough, since there are thousands of connections to the front end, and the pool/node is a load balancer. finding the one SYN that isn't ACKd is like finding a needle in a needle stack :-)  
&nbsp;  
&nbsp; One thing to note is that I replaced some old Linux servers running Apache using mod_proxy with the LTM. We never had this issue until we went to the LTM. I am trying to figure out which of the hundreds of differences is causing the problem, and if there is a way to adjust the LTM so that it doesn't behave this way. I suppose increasing the SYN retry is an option. Would using an LB::reselect work if there is only one node in a pool? 
&nbsp;  
&nbsp; Thanks, 
&nbsp; Kenny &nbsp;

colin_walker_12 · Answer

I don't think you'd need a LB::reselect, if you just want to try again to the same server, you could use HTTP::retry.  
&nbsp;  
&nbsp; Colin

hooleylist · Answer

With a default TCP profile, TMM tries 5 times over 45 seconds to establish a TCP connection.  If that's not enough attempts you could increase the Maximum Syn Retransmissions" option in the TCP profile.   
&nbsp;  
&nbsp; However, it would probably be more effective to try to capture the failure happening in a tcpdump so you can see exactly what's failing.  I realize that's not easy when the virtual server is in production.  However, you might be able to create a test VS with a custom SNAT pool and point a test client (or front end server) at it to isolate the traffic. 
&nbsp;  
&nbsp; Aaron

Forum Discussion

Tracking triggers in an iRule

Recent Discussions

F5 looses the token for the first call

iRule - Url rewrite and header replace and pool selection not working

Switch ssl profile based on weak cipher detection via IRULE

full-proxy HTTP2

Decode ObjectSID from Base64-encode string

Related Content

Irule Trigger two times

Tracking MAC in vCMP

iRULE regsub error

owa iapp assign irule

iRule to trigger email for captured logs

ABOUT DEVCENTRAL

RESOURCES

SUPPORT

PARTNERS