Author Topic: New Mitel Customer - Issue with Application Reloading  (Read 1513 times)

Offline Phluxed

  • New Member
  • *
  • Posts: 4
  • Country: ca
  • Karma: +0/-0
    • View Profile
New Mitel Customer - Issue with Application Reloading
« on: February 23, 2017, 11:13:19 PM »
Hi All,

Thanks for taking the time to take a quick read here - any feedback or insight is appreciated.

We are experiencing an issue where 5330 and 5320 phones that are connected to a MiCloud provider are experiencing the reboot of a random subset of phones on the :01 minute mark of every hour.

In the office in question, let's say there are 200 phones. This appears to happen at multiple offices, but I will focus on the biggest site that is experiencing it. It is also the most frequently impacted as it is where call centers are located and the users are regularly on the phones. The phones are connected to a set of Stacked Aruba MAS3500 switches and are connecting to the provider over public internet. In this case, Rogers in Canada. The bandwidth is 250/20.

We've done some port mirroring and have found that the phones are polled regularly by the controller to send a heartbeat. The heartbeat that is sent on the hour sometimes seems to fail to make it to the MGB - sometimes this can be 1 phone, sometimes 10 phones. The behaviour is ALWAYS seen at the :01 minute and 12 second mark. This is because the heartbeat is set to 24 seconds to time out and is set to 3 heartbeats to decide it is in DR mode. We see the UDP traffic continuously moving out of the network and is only stopped the instant the TCP stream for the phone fails the primary site and flips to the secondary site that is programmed into the phone.

We have done a number of packet captures here (which I unfortunately cannot entirely share) on our WAN interface and see that the heartbeat TCP packet is sending outbound but not getting an ACK so it retransmits the packet a few times and eventually dies failing over (after 72 seconds).

Is anyone aware of any sort automated job that may cause this? Is there something in the switching world that may cause it?


Offline ralph

  • Mitel Forums Admin
  • Hero Member
  • *****
  • Posts: 5767
  • Country: us
  • Karma: +469/-0
  • Published Author: http://amzn.to/2dcYSY5
    • View Profile
Re: New Mitel Customer - Issue with Application Reloading
« Reply #1 on: February 24, 2017, 07:27:54 AM »
I can't think of anything in the Mitel that would cause this.
I'd be suspicious of the network.
Have you tried putting a continuous ping against the server to see if it starts dropping packets at that time?

Ralph

Offline ralph

  • Mitel Forums Admin
  • Hero Member
  • *****
  • Posts: 5767
  • Country: us
  • Karma: +469/-0
  • Published Author: http://amzn.to/2dcYSY5
    • View Profile
Re: New Mitel Customer - Issue with Application Reloading
« Reply #2 on: February 24, 2017, 07:34:07 AM »
Something just popped to mind here.
If I understood you correctly, you're using MBGs.
I'm betting that you are using a cluster of MBGs.
What might be happening is that the MBGs are load balancing but the software doesn't match.
So when it moves phones over to a different controller it has to reload the applications.

If the phones lose connections to it's MBG and it fails over to another one, it would do the same thing if the software doesn't match.

It wouldn't be a complete reboot of the phone but just the applications.

Just a thought.

Ralph





Offline Phluxed

  • New Member
  • *
  • Posts: 4
  • Country: ca
  • Karma: +0/-0
    • View Profile
Re: New Mitel Customer - Issue with Application Reloading
« Reply #3 on: February 24, 2017, 08:36:54 PM »
Hey Ralph - interesting thought on the config being different per site.

If it's failing to the secondary site, should the phone be dropping a call if the software is the same on both sides? That would explain why the phone doesn't seem all that resilient to the heartbeat being missed and immediately dropping the call.

We're not sure if they are clustered in each datacenter that it's hosted in.

Offline ralph

  • Mitel Forums Admin
  • Hero Member
  • *****
  • Posts: 5767
  • Country: us
  • Karma: +469/-0
  • Published Author: http://amzn.to/2dcYSY5
    • View Profile
Re: New Mitel Customer - Issue with Application Reloading
« Reply #4 on: February 25, 2017, 06:24:42 AM »
I don't think it should fail over mid call unless it actually lost comms to the primary MBG.

Ralph

Offline Phluxed

  • New Member
  • *
  • Posts: 4
  • Country: ca
  • Karma: +0/-0
    • View Profile
Re: New Mitel Customer - Issue with Application Reloading
« Reply #5 on: February 25, 2017, 08:57:08 AM »
The heartbeat is lost to the main MBG. 3 Beats are missed. We see the applications loading because it fails to the secondary site, we can see that in the packet captures. During the whole time the phone is re-transmitting the packet (we cannot figure out why this is happening, is the original issue at hand) the UDP traffic is still solid. So because the TCP stream is broken, the phone doesn't start a new TCP stream to the same MGB but all the while, the 2 devices are talking a-o-k over the network and internet.

I guess what I'm wondering is, even in the case it does fail over in this instance, if the configurations were the same at each datacenter, would the phone drop the call from the original site MGB? It doesn't seem to drop it when the heartbeats aren't reaching, and only does it the instant the phone flips to the second site.

As for the network issue - we appear to have it happen at probably 10 other sites with different internet and router devices. Aruba switches are a constant, all flavours S1500, S2500, S3500.

Offline VinceWhirlwind

  • Hero Member
  • *****
  • Posts: 899
  • Country: au
  • Karma: +31/-0
    • View Profile
Re: New Mitel Customer - Issue with Application Reloading
« Reply #6 on: February 26, 2017, 06:09:42 PM »
Can you look at your QoS, on the various devices between phone and MBG, especially wherever there is a bandwidth bottleneck - ensure phone signalling is being tagged with something like AF41/34, and then ensure everywhere (especially your firewalls and WAN routers) is prioritising this above standard Data traffic.

Offline Phluxed

  • New Member
  • *
  • Posts: 4
  • Country: ca
  • Karma: +0/-0
    • View Profile
Re: New Mitel Customer - Issue with Application Reloading
« Reply #7 on: February 27, 2017, 09:28:39 AM »
Hi Vince - thanks for the idea. We're looking at that now, however, the challenge is that it is VOIP traffic over the WAN. We also are having no call quality issues whatsoever, and it just instantly drops the call when it flips over.

We can't really QOS beyond our modem.

Offline VinceWhirlwind

  • Hero Member
  • *****
  • Posts: 899
  • Country: au
  • Karma: +31/-0
    • View Profile
Re: New Mitel Customer - Issue with Application Reloading
« Reply #8 on: February 27, 2017, 09:56:59 PM »
So the bottleneck you need to look at is wherever you are putting traffic on the WAN - clearly it is prioritising EF/46 properly (RTP) but this is where maybe congestion is occurring and AF41/34 (signalling) is being dropped.
 
Just as a test, you could try changing the values in DHCP Option125 as well as on the MCD so that signalling is tagged the same as RTP?


 

Sitemap 1 2 3 4 5 6 7 8 9 10