presidio sometimes misses very crucial information such telephone number #1474
Replies: 2 comments 1 reply
-
Hi, for telephone numbers specifically, Presidio is configured to search for numbers belonging to some countries but not all. See this issue, where the For addresses and others, I would suggest to look into transformers based models. See this example (still in PR) for a full presidio customization: https://github.com/microsoft/presidio-research/blob/omri/evaluator_updates/notebooks/5_Evaluate_Custom_Presidio_Analyzer.ipynb |
Beta Was this translation helpful? Give feedback.
-
How does it know about the region ? Through the phone number itself ? |
Beta Was this translation helpful? Give feedback.
-
Given that I used this tutorial https://github.com/microsoft/presidio/blob/main/docs/samples/python/example_pdf_annotation.ipynb
and this pdf example
personalpdf.pdf
The outputted pdf was
output_annotated.pdf
We could notice telephone number was missed to be annotated .
Is there any extra parameter or something I could do to avoid such mistakes ?
Beta Was this translation helpful? Give feedback.
All reactions