torch pytesseract streamlit layoutparser pdf2image git+https://github.com/facebookresearch/detectron2.git@v0.4#egg=detectron2 poppler-utils opencv-python-headless numpy Pillow wheel