Spaces:

krishnapal2308
/

eye_for_blind

Running

krishnapal2308 commited on Feb 9, 2024

Commit

b837217

verified ·

1 Parent(s): 03f2b8a

Updating description

Files changed (1) hide show

app.py CHANGED Viewed

@@ -10,14 +10,14 @@ warnings.filterwarnings('ignore')
 # Define problem statement
 problem_statement = """
-### Problem Statement
-Visually impaired individuals face challenges in understanding image content. This project aims to address this issue by generating descriptive spoken captions for images, leveraging CNNs and RNNs for feature extraction and sequence generation, respectively. The model is trained on the Flickr8K dataset and extended with an attention mechanism for enhanced accessibility.
 """
 # Define solution overview
 solution_overview = """
 ### Solution Overview
-The basic model, trained for a limited duration without extensive hyperparameter tuning, primarily focuses on exploring subclassing techniques. To improve inference quality, Vit-GPT2 architecture is integrated. [Visit the Kaggle notebook](https://www.kaggle.com/code/krishna2308/eye-for-blind) for implementation details.
 """
 # Define real-life scenario application

 # Define problem statement
 problem_statement = """
+### Overview
+This project aims to generate descriptive spoken captions for images, leveraging CNNs and RNNs for feature extraction and sequence generation, respectively. The model is trained on the Flickr8K dataset and extended with an attention mechanism for enhanced accessibility.
 """
 # Define solution overview
 solution_overview = """
 ### Solution Overview
+The basic model, trained for a limited duration without extensive hyperparameter tuning, primarily focuses on exploring the integration of the attention mechanism with the Encoder-Decoder architecture for image processing. To improve inference quality, Vit-GPT2 architecture is integrated. [Visit the Kaggle notebook](https://www.kaggle.com/code/krishna2308/eye-for-blind) for implementation details.
 """
 # Define real-life scenario application