Forum Discussion

Roee_140824's avatar
Roee_140824
Icon for Nimbostratus rankNimbostratus
Jan 07, 2014

constant Kernel bug at AWS ami-95a845e2

Hi. I have tried to use ami-95a845e2 from amazon's AWS marketplace. After launching and configuring everything, and after everything is working, the instance has a reachability check failed. I have done this process several times. Terminated the old instance, and launched a new one. Same behavior appears I have contacted AWS support, and this is the exact reply :

 

Hello Roee, The instance right now looks to be responding to icmp and a telnet to port 22 is working. It was showing good checks and responding to telnet 22 and then went to failure, just while I was typing this. I also see the cpu running 100% and this in the console log. "BUG: soft lockup - CPU3 stuck for 67s! [csyncd:6067]" This indicates a kernel bug. As an overall action You'll want to go back to the vendor to let them know about this. To get you working I have two courses of action you can pursue. The first is a stop start action. You may have to do this multiple times. Each start stop will move you to new host hardware. I suspect a change in the underlying host hardware from a previous stop start has triggered the kernel bug. The alternative course of action that may be more effective would be to try an instance size increase or decrease as the underlying host hardware can be different. As a final solution though, you will need to seek vendor input as to the kernel version in use. The root cause is a kernel bug. Hope this helps. Please let me know if you need any further assistance. Best regards, Mike P. Amazon Web Services

 

After several reboots, I can see this : BIG-IP 11.5.0 Build 0.0.158 Kernel 2.6.32-279.19.1.el6.f5.x86_64 on an x86_64 ip-10-0-0-184.eu-west-1.compute.internal login: BUG: soft lockup - CPU1 stuck for 67s! [chmand:6483] Jan 6 16:47:36 ip-10-0-0-184 emerg kernel: BUG: soft lockup - CPU1 stuck for 67s! [chmand:6483]

 

Can you please help me solve the problem ?

 

Thank you,

 

Roee.

 

  • you'll need to open a case with support on this, they'll need core dumps to analyze. BTW..is that a beta version of TMOS? 11.5 has yet to be released.

     

  • Thank you for you reply. I have contacted F5 support before posting here. this the the reply :

     

    Hi Roee

     

    Currently, support requests for AWS Marketplace are only handled by our DevCentral team. Please refer your request to https://devcentral.f5.com/amazon/getting-started

     

    Sincerely,

     

    Brad

     

    F5 Networks Support | www.f5.com International: +800-27536735 | USA: 1-888-882-7535

     

    • JRahm's avatar
      JRahm
      Icon for Admin rankAdmin
      interesting. I'll look into that and see what I can do to help.
  • Matthew_Quill_3's avatar
    Matthew_Quill_3
    Historic F5 Account
    Roee, what version of the BIG-IP are you attempting to run? Is this the BYOL offering?
  • Hi Roee, this AMI is outdated, it was our very first release and has been replaced. Please see the "Select a Version" drop-down from the "Launch on EC2" page for the "F5 BIG-IP 200mbps Lab License" appliance, and choose 11.5.0.0.203 (released 12/31/2013).

     

    If that gives you any trouble please let us know here. We are considering our AWS offering to be 'community supported' much like iRules and iApps, for the time being.

     

  • Matthew_Quill_3's avatar
    Matthew_Quill_3
    Historic F5 Account

    The AMI with the Fix for 11.4 BYOL will be posted to the marketplace shortly. Please stand by.