BIG-IP High Availability with Azure Route Server

Failover via BGP and Azure Route Server is a fast and reliable way to achieve High Availability in Azure.

Broadly speaking, there’s a few common ways to achieve High Availability in Azure, where 2 BIG-IP devices may be running Active/Active or Active/Standby. I will categorize them as follows:

  1. Azure Load Balancer
  2. Cloud Failover Extension (CFE)
  3. DNS-based HA

I will offer an additional method: BGP and Azure Route Server

(There are further options. The Azure GWLB is an option, but I’ll consider this another load balancer option in category 1. And the CFE approach has some further sub-options too. You can move public or private IP addresses between devices (just like ARP does on-prem), or update UDR in Azure with a CIDR block that covers the entire VIP range.)

This article covers the concept and how-to of using BGP for High Availability in Azure. Using BGP with F5 BIG-IP is nothing new, but it is almost never seen in public cloud environments. 

 

Azure Route Server as depicted in Microsoft documentation

 

 

HA using Azure Route Server

I’ll cover some concepts that are related but discussed separately: Route Health Injection for Virtual Servers, and more generic BGP route advertising. I’ll also talk about Active/Standby vs Active/Active options, and ECMP.

I will then outline multiple methods and a “how-to” for achieving HA in Azure using BGP routing:

  • Active/Standby (Routing to Active device only)
  • Active/Active (Routes to multiple BIG-IP devices, using the network command or static routes)

Concepts: Active/Standby vs Active/Active

Route Health Injection and Active/Standby mode

Route Health Injection allows the BIG-IP to advertise VIPs via routing protocols. As a basic example, a VirtualServer may have an IP address of 192.168.100.100/32. BIG-IP can advertise this /32 route to peers based on the health of the Virtual Server. Another example: I could have a VIP with a destination IP of 192.168.100.0/24, and then this route would also be advertised with a next hop of BIG-IP.

In an Active/Standby device group, both devices are sharing routes via BGP with the Active device as the next hop. In a graceful failover, both devices will immediately update their neighbors with the next hop of the (newly) Active BIG-IP. If the failover is not graceful and the Active device is suddenly lost, it’s BGP peering will be broken and its neighbors will remove the routes that were learned from the device that was Active but is now offline.

When considering an Active/Standby device cluster, let’s highlight words from F5’s guidance on dynamic routing:

Note: When you configure RHI in a device group configuration, only devices with active traffic groups attempt to advertise routes to virtual addresses.

BGP configurations and Active/Active mode

In an Active/Active scenario, multiple devices can advertise a route with equal weight. To ensure traffic flow is symmetric, neighbor routers must support ECMP (this ensures that a connection from a client is persisted to a single BIG-IP only). Azure Route Server uses ECMP when multiple devices advertise the same route with the same AS path length.

There are several methods by which you can share routes via BGP from BIG-IP, whether the device is Active or Standby. Using these methods will result in the same route and AS path with multiple next hops: ie., Active/Active routes on BGP peers.

How-to: HA with BIG-IP using Azure Route Server

 

Demo environment showing Azure Route Server and BIG-IP

 

Whether we will choose an Active/Standby or Active/Active approach, we must first set up BGP peering between F5 BIG-IP and Azure Route Server. These instructions assume we have a single BIG-IP HA pair in Azure and are configuring iBGP peering between our BIG-IP devices and eBGP peering with Azure Route Server.

  1. Follow this tutorial from Microsoft on configuring Azure Route Server. Configure only Azure Route Server. Stop when you reach this section for configuring a network virtual appliance (NVA). We’ll use F5’s instructions to configure BIG-IP devices instead.

 

Screenshot of successful deployment of Azure Route Server

Take note of the ASN and Peer IPs after the Route Server is created. The ASN will always be 65515. We will use this, and the Peer IP’s, when configuring BGP on BIG-IP.


Now, let's configure F5 BIG-IP

  1. Follow this tutorial from F5 on configuring BGP on BIG-IP.
    • allow TCP/179 on appropriate Self IPs.
    • enable BGP on your Route Domain: tmsh modify /net route-domain 0 routing-protocol add { BGP }
    • configure iBGP between your BIG-IP devices. This script will be slightly different on each device.
      # run these commands on both devices. The "neighbor" commands will be unique on each device. 
      imish 
      enable 
      config terminal 
      router bgp 200 
      neighbor 10.0.1.12 remote-as 200 
      # neighbor 10.1.1.11 remote-as 200 
      neighbor 10.0.1.12 activate 
      # neighbor 10.0.1.11 activate 
      end 
      write
    • configure eBGP between the devices and the Azure Route Server. This script will be identical on each device.
      • notice that a route map is created in order to filter which routes we share via BGP (only 192.168.100.0/24)
      • notice the “redistribute kernel” command, without which, BGP would not share Kernel routes (and would share routes that we could configure manually using imish commands)
        imish 
        enable 
        config terminal 
        ip prefix-list PFX_ALLOW_VIPS seq 5 permit 192.168.100.0/24 
        route-map RESTRICT_ADVERTISE permit 10 
        match ip address prefix-list PFX_ALLOW_VIPS 
        router bgp 200 
        redistribute kernel 
        neighbor 10.0.3.4 remote-as 65515 
        neighbor 10.0.3.5 remote-as 65515 
        neighbor 10.0.3.4 route-map RESTRICT_ADVERTISE out 
        neighbor 10.0.3.5 route-map RESTRICT_ADVERTISE out 
        neighbor 10.0.3.4 activate 
        neighbor 10.0.3.5 activate 
        end 
        write

Now let's finish configuration of Azure Route Server

  1. Return to Microsoft’s tutorial at this location to configure Route Server peering.
    • create a peering to BIG-IP device 1
    • create a peering to BIG-IP device 2

Ideally, at this point the Status of the Peering will show completed. Lastly, don’t forget to check the box “enable IP Forwarding” on your NIC in Azure if you want to use a VIP range that is not native to your VNET.

 

Screenshot of successful configuration of BGP peers

We can check learned routes from the Azure Route Server. The Microsoft tutorial uses PowerShell and the Get-AzRouteServerPeerLearnedRoute cmdlet. I prefer az cli so I’ll use:

$ az network routeserver peering list-learned-routes -g oleary-bgp-rg --routeserver oleary-bgp-rs -n bigip1 
{ 
"RouteServiceRole_IN_0": [],
"RouteServiceRole_IN_1": [] 
}

Important
We should ensure that we only share routes with Azure Route Server that we intend to share. Typically we do not want to advertise the 0.0.0.0/0 route from the BIG-IP, because it will cause the entire VNET’s default route to be BIG-IP (and likely make your VNET unreachable). For that reason, we’ve created a route map that will filter any shared routes and only allow our desired route, 192.168.100.0/24, to be advertised via BGP.

At this point, Azure Route Server has not received any routes from BIG-IP, but the BGP relationship is Established.


1. Active/Standby: Advertising a VIP range from the Active device only

In this example, I’ll use a range (192.168.0.0/24) for my VIPs. In the cloud, I’ll often call this an “alien range” because it will be a CIDR block that does not truly exist in the VNET or VPC, but that is routed toward the BIG-IP.

  1. Create a “dummy” VIP where the Destination IP is 192.168.100.0/24. We won’t actually target this VIP, so it can be any type (eg IP forwarding).
  2. Set the Virtual Address to have “Route Advertisement” set to “Enabled”.

 

BIG-IP GUI config setting routing of Virtual Address

This VIP will be “redistributed” by BGP because it matches our allowed route for BGP sharing. It will be advertised with a next hop of the Active BIG-IP device. Let’s check this from both our Active and Standby device, and see that

 

Verifying routes using imish command line

We can also check the Azure Portal (eg, the effective routes of interfaces in the VNET) or use the CLI commands from earlier to see the learned routes by Azure Route Server.

Verifying routes using Azure Route Server

2. Active/Active: using BGP to advertise routes from both devices

Let’s return to the section of our F5 tutorial titled “Configuring and verifying the BIG-IP system to exchange routing prefixes”.

You can exchange network prefixes with BGP on the BIG-IP system with the network command or through redistribution. Using these methods will likely result in both your Active and Standby devices advertising a route with their own Self IP as the next hop.

Network command
  1. To use the network command to manually configure a route to advertise, you would use imish configuration like this:
imish 
enable 
configure terminal 
router bgp 200 
network 192.168.100.0/24 
end 
write
You also would not need to advertise kernel routes in this scenario, so you could add no redistribute kernel if so desired.

Use the same methods as above to verify learned routes in Azure. You can also use show ip bgp neighbor 10.0.3.4 advertised-routes to verify which routes are advertised to Azure from the perspective of BIG-IP.

Static Routes

Another option is to create static routes on BIG-IP itself - with GUI or tmsh create net route - and share those. However, network routes in BIG-IP cannot have a Self IP as the next hop, so sharing these routes will not result in routes in Azure’s VNET that point to BIG-IP. In this case, I don’t see a good reason to explore this option further.

Other Considerations

BGP offers some advantages over other methods for HA:

  • No Azure load balancer is required in this design.
  • Unlike CFE, there are no Managed Identities to configure, no RBAC permissions, and no API calls needed from BIG-IP to Azure’s management plane.
  • BGP is well known by network engineers

However, BGP also brings other considerations so it may not be preferable:

  • Peering with BGP requires the Advanced Routing module (included with Better and Best licenses)
  • Peering with BGP requires Azure Route Server (and the associated costs)
  • BGP also requires planning, skills, and perhaps team cooperation beyond what may have been planned.

Some BIG-IP admins have strong network skills and feel comfortable with BGP; others may fear the potential to cause major outages if misconfigured.

Conclusion

Failover via BGP and Azure Route Server is fast and reliable, although personally I see Azure Load Balancer and CFE implementations much more commonly. Thanks for reading and please reach out with any questions!

Related Articles

Thank you to my colleague Brandon_ for this article on BGP configuration:

BIG-IP BGP Routing Protocol Configuration And Use Cases

Published Nov 05, 2024
Version 1.0
No CommentsBe the first to comment