Friday, June 29, 2018
Adventures in Normalization
Adventures in Normalization
One of the things that differentiate Lync from its competitors is its reliance on 3rd party peripherals like deskphones. This is often touted as a Good Thing, since youre not locked into a single vendor for phones. Phones from single-vendor solutions are often more expensive than an equivalent Lync phone simply because you cant get them from anywhere else, so competition is low and prices are higher.
Since Lync phones are provided by different vendors, there is naturally a lot of variation in the specific details on how things work. Microsoft has alleviated this somewhat by instituting a Lync-qualified devices program. This is a fairly thorough program that puts devices through their paces to ensure they function properly within Lync.
However rigorous the program, little details slip through the cracks. One of the details that Ive noticed is how different phones (and even the Lync client) handle regular expressions in different ways. For anyone familiar with the minutiae (thanks Word of the Day Calendar) of Lync Enterprise Voice, regular expressions are used for phone number normalization and routing.
All Lync-qualified devices handle basic regex in the same manner. For most people, this is not an issue, but sometimes you need to use lesser-known regex patterns to accomplish certain goals. When you try to get fancy with regex, things sometimes break down with disparate results between clients and phones.
Over the years, Ive heard about and personally experienced situations where Lync Optimizer-generated regex doesnt work on specific phones. The affected vendors have eventually fixed these issues, but I figured that I should do some hardcore 60 Minutes style investigation into just how well different devices handle some of the more esoteric regex patterns out there.
To do this, I used my wide variety of Lync deskphones I have available for testing in my home lab. (You should hear what it sounds like when someone phones me. Everything is ringing and flashing, and I usually go "AAAAhhhhh" and run out of the room.) I used a Polycom VVX 600, a Polycom CX600, a Snom 760, and an old LG-Nortel "Tanjay" series phone. My AudioCodes 440HD wasnt connecting to our Lync server at the time, so it was left out of testing. Every phone was updated with the latest firmware available at testing time.
For the first round of the Lync Normalization Olympics, I decided to keep it simple, using some of the rule formats commonly used by the Lync Optimizer. I input the normalization rules into Lync via the Control Panel, and I used the "Create voice routing test case information" to see how Lync Control Panel would interpret the regex, and included that in the results too. For all the rulesets, I used a unique "identifier" for each rule to allow me to setup a series that I could test all at once.
Commas
First off, I used a regex rule that included a comma, which I had heard caused normalization issues with one particular vendor. In the below example, Im looking for a pattern of numbers that starts with 999 followed by 10 or 11 digits. The 999 gets stripped and puts a + in its place.Pattern: ^999(d{10,11})$
Translation: +$1
Input number: 99915552223333 then 9995552223333
Expected output:+15552223333 then +5552223333
Input number: 99915552223333 then 9995552223333
Expected output:
Control Panel | Lync Client | Tanjay Phone | Polycom CX | Polycom VVX | Snom |
+15552223333 | +15552223333 | +15552223333 | +15552223333 | +15552223333 | +15552223333 |
+5552223333 | +5552223333 | +5552223333 | +5552223333 | +5552223333 | +5552223333 |
Dealing with Extensions, Part I
Another common rule used in the Optimizer is a rule to accept any dialed number, but strip out any extensions and leave it as an externally routable E.164 number. This rule was meant mostly for click-to-dial from the Lync client, but I did hear about a particular vendors phone not being able to parse that particular rule. In the next example, Im looking for a pattern of numbers that starts with 998 followed by 11 digits, followed by some regex that would strip anything else beyond the 11 digits. The 998 gets stripped and puts a + in its place.Pattern: ^998(d{11})d*(D+d+)?$
Translation: +$1
Input number: 99815552223333
Expected output:+15552223333
Input number: 99815552223333
Expected output:
Control Panel | Lync Client | Tanjay Phone | Polycom CX | Polycom VVX | Snom |
+15552223333 | +15552223333 | +15552223333 | +15552223333 | +15552223333 | +15552223333 |
Dealing with Extensions, Part II
The next rule test is a variation on the first example above, using a slightly different regex pattern to strip any extension. This format was used a long time ago in the Lync Optimizer, but was replaced with the "cleaner" looking version above. One person told me he used this format to get around an issue with a particular phone vendors issue parsing the previous examples format.Pattern: ^997(d{11})(s*S*)*?$
Translation: +$1
Input number: 99715552223333
Expected output:+15552223333
Input number: 99715552223333
Expected output:
Control Panel | Lync Client | Tanjay Phone | Polycom CX | Polycom VVX | Snom |
+15552223333 | +15552223333 | +15552223333 | +15552223333 | +15552223333 | +15552223333 |
Extensions with Optional Site Code
In companies with multiple sites that provide its users with extension dialing, its often necessary to use site codes to properly differentiate one site from another. This is especially true if there are overlapping extension ranges originating from legacy PBXs. So, for a company using 4-digit extensions, they may prepend a site code to ensure uniqueness. So, Site #101 could use 1015xxx for their extension range, and Site #102 could use 1025xxx for their extension range.One particular company wanted to ensure that users within any given site could dial their own users by extension without the prepended site code, but still allow them to enter the site code if they wanted to. I did this in one rule by using a bit of fancy regex. In the below example, a user should be able to enter the extension with or without the 996 "site code".
Pattern: ^(?:996)?(5d{3})$$
Translation: +1555222$1
Input number: 9965000 then 5000
Expected output:+15552225000
Sigh, once again, no issues on any platform.
OK, boooooring. All the phones passed the basic regex tests without issue. Not at all "exciting". I use "exciting" in quotes, because how exciting can the topic of normalization be?Input number: 9965000 then 5000
Expected output:
Control Panel | Lync Client | Tanjay Phone | Polycom CX | Polycom VVX | Snom |
+15552225000 | +15552225000 | +15552225000 | +15552225000 | +15552225000 | +15552225000 |
+15552225000 | +15552225000 | +15552225000 | +15552225000 | +15552225000 | +15552225000 |
Multiple Replacement Strings
Sometimes you need to use multiple replacement strings in a rule (ie $1, $2 etc). Rumour has it some phones have issues with this.Pattern: ^996(d{3})000(d{7})$
Translation: +1$1$2
Input number: 9965550002223333
Expected output:+15552223333
Input number: 9965550002223333
Expected output:
Control Panel | Lync Client | Tanjay Phone | Polycom CX | Polycom VVX | Snom |
+15552223333 | +15552223333 | +15552223333 | +15552223333 | +15552223333 | +15552223333 |
Advanced Normalization Tests
To really gauge how well a phones regex engine functions, we have to throw at it some regex that would only be used in very specific circumstances.
For example, lets say I have a need to translate a set of digits into a pattern where those digits are buried within other numbers. For example, I want to take 92xx and translate it to +155592xx999. So, 9201translates to +15559201999. If you try to create a pattern like ^(92d{2})$ and translate it to +1555$1999, there is ambiguity on whether you mean +1555 $1 999 or +1555 $19 99.
Normally, I would follow the .NET regex reference which has different ways of dealing with this, but none of them work in the Lync client. I did stumble upon one that did work, even though it isn�t a valid .NET expression. Not only that, but testing with other phones gave different results most of the time.
I tried various patterns, and plugged in 9100, 9200, 9300 etc to see the results in both Control Panel and Lync softclient and desk phones. The results proved interesting.
Pattern: ^(91d{2})$
Translation: +1555$1999
Input number: 9100
Expected output: Expect failure, but want to see how various devices handle this situation.
Input number: 9100
Expected output: Expect failure, but want to see how various devices handle this situation.
link download
Subscribe to:
Post Comments (Atom)
|
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.