leynessa's picture
Upload 8 files
ff52cdd verified
raw
history blame contribute delete
442 Bytes
---
### `Data_Processing.md`
```markdown
# Data Processing Steps
## 1. Cleaning
- Lowercase all text
- Remove punctuation and numbers
- Remove stopwords (e.g., "the", "and", "is")
- Lemmatize words to base form (e.g., "running" β†’ "run")
## 2. Example
**Raw:** "Paid monthly apartment rent - confirmed"
**Cleaned:** "paid monthly apartment rent confirmed"
## 3. Output
Processed dataset saved as `cleaned_data.csv`