Addition of leniency parameter in predefined PhoneRecognizer #1311

VMD7 · 2024-02-23T19:54:50Z

Addition of leniency parameter in predefined PhoneRecognizer

Change Description

-> The PhoneRecognizer failed to detect phone numbers when they appeared without any space between text and number, as seen in cases like testdata630-596-1111.

-> Internally, the PhoneRecognizer utilizes the phonenumbers library for phone number detection. This library offers a parameter known as leniency, which determines the flexibility of the phone number format. The leniency parameter ranges from 0 to 3, with higher values indicating more strict phone number format match.

-> Initially, the leniency factor in PhoneRecognizer was set to 1, causing it to overlook phone numbers lacking spaces between text and digits. To address this issue, we've enhanced PhoneRecognizer to allow users to adjust the leniency parameter according to their requirements, with a default value of 1.

-> This enhancement resolves various phone number format issues and empowers users to tailor its behavior to their specific needs.

Issue reference

This PR fixes issue #1301

Checklist

I have reviewed the contribution guidelines
I have signed the CLA (if required)
My code includes unit tests
All unit tests and lint checks pass locally
My PR contains documentation updates / additions if required

omri374

Thanks!

omri374 · 2024-02-24T10:44:34Z

/azp run

azure-pipelines · 2024-02-24T10:44:47Z

Azure Pipelines successfully started running 1 pipeline(s).

omri374 · 2024-02-24T10:46:05Z

Could be relevant to revisit #772. The phone number recognizer makes some assumptions, such as the regions it looks for phone numbers for (as it is slower to search all of the regions) and other hyperparameters such as leniency.

VMD7 · 2024-02-24T13:57:14Z

Yes sure, will go through it.

VMD7 and others added 3 commits February 23, 2024 22:19

Added leniency parameter

c3f92a1

Added test cases for phone recognizer leniency

bdb3e3d

Merge branch 'microsoft:main' into testing-predefined-recognizers

944ce76

omri374 approved these changes Feb 24, 2024

View reviewed changes

omri374 merged commit 4c48b92 into microsoft:main Feb 24, 2024
31 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Addition of leniency parameter in predefined PhoneRecognizer #1311

Addition of leniency parameter in predefined PhoneRecognizer #1311

VMD7 commented Feb 23, 2024 •

edited

Loading

omri374 left a comment

omri374 commented Feb 24, 2024

azure-pipelines bot commented Feb 24, 2024

omri374 commented Feb 24, 2024

VMD7 commented Feb 24, 2024

Addition of leniency parameter in predefined PhoneRecognizer #1311

Addition of leniency parameter in predefined PhoneRecognizer #1311

Conversation

VMD7 commented Feb 23, 2024 • edited Loading

Change Description

Issue reference

Checklist

omri374 left a comment

Choose a reason for hiding this comment

omri374 commented Feb 24, 2024

azure-pipelines bot commented Feb 24, 2024

omri374 commented Feb 24, 2024

VMD7 commented Feb 24, 2024

VMD7 commented Feb 23, 2024 •

edited

Loading