iplc-t5-model / README.md
pdarleyjr's picture
Add model files with LFS tracking
b507028
---
language: en
tags:
- clinical-nlp
- summarization
- speech-language-pathology
- t5
license: mit
datasets:
- custom
metrics:
- rouge
---
# IPLC T5 Clinical Report Generator
This is a fine-tuned T5 model specialized in generating clinical report summaries for speech-language pathology evaluations. The model has been trained on a custom dataset of clinical reports and evaluation forms.
## Model Description
- **Model Type:** T5 (Text-to-Text Transfer Transformer)
- **Base Model:** t5-small
- **Task:** Clinical Report Summarization
- **Domain:** Speech-Language Pathology
- **Language:** English
## Intended Use
This model is designed to assist speech-language pathologists in generating clinical report summaries from structured evaluation data. It can process information about:
- Patient demographics
- Diagnostic information
- Language assessments
- Clinical observations
- Evaluation results
## Training Data
The model was fine-tuned on a custom dataset of speech-language pathology evaluation reports and clinical documentation.
## Usage
```python
from transformers import T5ForConditionalGeneration, T5Tokenizer
model = T5ForConditionalGeneration.from_pretrained("pdarleyjr/iplc-t5-model")
tokenizer = T5Tokenizer.from_pretrained("pdarleyjr/iplc-t5-model")
text = "summarize: evaluation type: initial. primary diagnosis: F84.0. severity: mild. primary language: english"
input_ids = tokenizer.encode(text, return_tensors="pt", max_length=512, truncation=True)
outputs = model.generate(
input_ids,
max_length=256,
num_beams=4,
no_repeat_ngram_size=3,
length_penalty=2.0,
early_stopping=True
)
summary = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(summary)
```
## Limitations
- The model is specifically trained for speech-language pathology evaluations
- Input should follow the expected format for optimal results
- Clinical judgment should always be used to verify generated summaries