For more information regarding the security incident at F5, the actions we are taking to address it, and our ongoing efforts to protect our customers, click here.

Forum Discussion

JG's avatar
JG
Icon for Cumulonimbus rankCumulonimbus
Mar 10, 2014

Workaround for tmm SNAT memory leak?

We are well on track to come up against the issue detailed in "sol15010: TMM may stop responding due to a memory leak caused by the source address translation feature ( https://support.f5.com/kb/en-us/solutions/public/15000/000/sol15010.html )".

 

My question is: Would restarting tmm using bigstart clear the SNAT memory usage? I am thinking of doing this on the standby only. Certainly a reboot should get the device a new lease of life until we can apply a hotfix release that fixes it, right?

 

7 Replies

  • sol15010 says nomworkaround, however everything says (to me) that "bigstart restart tmm" should work here - why don't you try it on the standby box and check before and after using tmsh show /sys memory | egrep 'Max\ Alloc|source\ addr'? The worst it can do is restart without fixing the memory issue.......

     

  • Hamish's avatar
    Hamish
    Icon for Cirrocumulus rankCirrocumulus

    Yes, the restart of tmm DOES clear the leaked memory. However it's an outage for that unit, which is probably why it's not listed as a 'workaround' :) [The restart is exactly what causes an outage when it restarts itself. You'll see an error in the tmm log when this happens and alarms go off, services failover, tmm restarts and we do the whole thing again]

     

    FWIW I had the same issues in 11.2.1HF6 (Although it wasn't identified as a leak due to SNAT, it was a leak. The TMM would restart after about 4 days, so the temporary work-around until we dropped the ENG HF on the units was to schedule a tmm restart overnight on a rotating basis).

     

    H

     

  • JG's avatar
    JG
    Icon for Cumulonimbus rankCumulonimbus

    What I was thinking was to get the standby in a prepared state (memory cleared) so that when failover takes place (the sol article did mention this possibility) it'll still work. We can then do the same on the new standby.

     

    We can't apply the standard HF release now, and we are waiting for an EngHF that will contain fixes for other issues that we have currently.

     

  • Hamish's avatar
    Hamish
    Icon for Cirrocumulus rankCirrocumulus

    Possibly better if you initiate the failover yourself...

     

    H

     

  • JG's avatar
    JG
    Icon for Cumulonimbus rankCumulonimbus

    Just to give everybody a heads-up: There is a bug with "bigstart restart":

     

    Bug 432260 - "bigstart restart" removes tmm route entry for 127.7.0.0 address for AAA server pool

     

    After running the command, an AAA (LDAP) server could not be reached, resulting in a service outage.

     

    I got bitten by this twice now. :-(

     

    I am on v11.3.0.

     

    But this workaround did get all memory back.