OpenAI releases open-source model OpenAI Privacy Filter, capable of detecting and de-identifying personal privacy information in text

robot
Abstract generation in progress

Odaily Planet Daily News reports that OpenAI today released the open-source model OpenAI Privacy Filter, designed to detect and redline personal identifiable information (PII) in text. The model has 1.5 billion total parameters and 50 million active parameters, supporting a context window of up to 128k tokens. OpenAI Privacy Filter uses a bidirectional token classification architecture capable of identifying eight types of information, including personal names, addresses, emails, phone numbers, URLs, dates, accounts, and keys, achieving a 96% F1 score on the PII-Masking-300k benchmark. Currently, the model is available under the Apache 2.0 license on Hugging Face and GitHub, allowing developers to deploy locally and fine-tune.

View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pin