Tada-tot
/

BrainVision

Image Segmentation

Model card Files Files and versions

BrainVision / README.md

Tada-tot's picture

Update README.md

bdadd37 verified 11 days ago

|

history blame contribute delete

2.78 kB

	---
	license: mit
	tags:
	- vision
	- image-segmentation
	- unet
	- resnet
	- medical
	- pytorch
	model-index:
	- name: BrainVision
	results:
	- task:
	type: image-segmentation
	name: Image Segmentation
	dataset:
	name: LGG MRI Segmentation Dataset
	type: lgg-mri-segmentation-dataset
	split: test
	metrics:
	- type: dice_score
	value: 0.8898
	name: Dice Score
	- type: iou_score
	value: 0.8015
	name: IoU Score
	- type: accuracy
	value: 0.9963
	name: Accuracy
	- type: precision
	value: 0.882
	name: Precision
	- type: recall
	value: 0.8978
	name: Recall
	---
	# Brain Tumor Segmentation
	#### Model Description
	This model is a U-Net based deep learning model for the segmentation of brain tumors from 2D MRI slices. It is an end-to-end segmentation model designed to assist radiologists by automatically identifying and outlining tumorous regions. The model leverages transfer learning by using a pre-trained ResNet50 backbone as its encoder.

	#### Intended Use & Limitations
	This model is intended for research and educational purposes. It is a tool for exploring deep learning for medical image segmentation.

	Limitations:
	- The model was trained on a specific dataset and may not generalize well to other datasets from different scanners, protocols, or patient populations.
	- It should not be used for clinical diagnosis without validation by a qualified medical professional.

	#### Training Data
	The model was trained on the [LGG MRI Segmentation Dataset](https://www.kaggle.com/datasets/mateuszbuda/lgg-mri-segmentation) from Kaggle.
	- Volume: 3,929 MRI images from 110 patients.
	- Annotation: Expert-annotated binary masks for tumor regions.
	- Image Size: 256x256 pixels with 3 channels (converted to RGB).

	#### Training Procedure
	The model was trained using a two-stage fine-tuning approach with PyTorch.

	Model Architecture:
	- Architecture: U-Net
	- Encoder: ResNet50 with imagenet pre-trained weights.
	- Decoder: A symmetric expanding path with skip connections.
	- Output: A single-channel probability map.

	Hyperparameters:
	- Optimizer: AdamW
	- Loss Function: A composite loss function combining Dice Loss and Binary Cross-Entropy (BCE).

	$$L_{\text{combo}} = L_{\text{dice}} + L_{\text{BCE}}$$

	- Epochs:
	- Stage 1 (Frozen Encoder): 5 epochs at a learning rate of 1e−4.
	- Stage 2 (Fine-tuning): 15 epochs at a learning rate of 1e−5.
	- Learning Rate Scheduler: ReduceLROnPlateau with patience=3, factor=0.1.
	- Batch Size: 16

	#### Evaluation Results
	The model was evaluated on an unseen test set composed of 10% of the patient data.

	\| Metric \| Value \|
	\| :--- \| :--- \|
	\| Dice Score \| 0.8898 \|
	\| IoU Score \| 0.8015 \|
	\| Accuracy \| 0.9963 \|
	\| Precision \| 0.8820 \|
	\| Recall \| 0.8978\|