license: apache-2.0 | |
tags: | |
- token-classification | |
- pii-detection | |
- privacy | |
- named-entity-recognition | |
language: | |
- en | |
pipeline_tag: token-classification | |
# PII Detection Model | |
This model is fine-tuned for detecting Personally Identifiable Information (PII) in text. | |
## Detected PII Types | |
- Names | |
- Email addresses | |
- Phone numbers | |
- Aadhaar numbers | |
- PAN cards | |
- Credit card numbers | |
## Usage | |
```python | |
from transformers import pipeline | |
# Load the model | |
pii_pipe = pipeline( | |
"token-classification", | |
model="Dombara/pii-detection-model", | |
aggregation_strategy="simple" | |
) | |
# Detect PII | |
text = "Contact John Doe at john@example.com or 9876543210" | |
results = pii_pipe(text) | |
print(results) | |
``` | |
## Model Details | |
- Base model: DeBERTa | |
- Fine-tuned for PII detection | |
- Supports English text | |