OpenAI open-sources Privacy Filter, which can automatically detect and mask private information in text locally.

ME News, April 23 (UTC+8), according to Beating's monitoring, OpenAI has open-sourced Privacy Filter under the Apache 2.0 license, a locally deployed text de-identification model. Users input text into the model, and it automatically identifies eight types of personally identifiable information (PII): names, emails, phone numbers, addresses, accounts, URLs, dates, and keys, and marks or masks them. The entire process is completed locally, and data does not need to be sent to the cloud. The model has 1.5B total parameters, but uses a sparse mixture-of-experts architecture, so each inference only activates 50M parameters, allowing it to run on a laptop or even in a browser. The context window is 128K tokens, and all privacy information can be annotated in a single forward pass. Users can adjust the trade-off between precision and recall through preset operating points, or fine-tune with their own data to adapt to specific scenarios. The model is primarily English-based and has limited multilingual capabilities. (Source: BlockBeats)
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pinned