streamlit docx2txt PyPDF2 python-dotenv openai sentence-transformers transformers nltk requests torch torchvision transformers torch