Skip to content

Commit

Permalink
dep: add google vision ocr dependencies to extra-pdf-image dependencies
Browse files Browse the repository at this point in the history
  • Loading branch information
christinestraub committed Apr 23, 2024
1 parent 6a1b10b commit b50e4b4
Show file tree
Hide file tree
Showing 5 changed files with 40 additions and 77 deletions.
6 changes: 1 addition & 5 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -103,7 +103,7 @@ install-xlsx:
python3 -m pip install -r requirements/extra-xlsx.txt

.PHONY: install-all-docs
install-all-docs: install-base install-csv install-docx install-epub install-odt install-pypandoc install-markdown install-msg install-pdf-image install-pptx install-xlsx install-googlevisionocr
install-all-docs: install-base install-csv install-docx install-epub install-odt install-pypandoc install-markdown install-msg install-pdf-image install-pptx install-xlsx

.PHONY: install-all-ingest
install-all-ingest:
Expand Down Expand Up @@ -275,10 +275,6 @@ install-pandoc:
install-paddleocr:
ARCH=${ARCH} ./scripts/install-paddleocr.sh

.PHONY: install-googlevisionocr
install-googlevisionocr:
python3 -m pip install -r requirements/extra-googlevisionocr.txt

## pip-compile: compiles all base/dev/test requirements
.PHONY: pip-compile
pip-compile:
Expand Down
4 changes: 0 additions & 4 deletions requirements/extra-googlevisionocr.in

This file was deleted.

68 changes: 0 additions & 68 deletions requirements/extra-googlevisionocr.txt

This file was deleted.

1 change: 1 addition & 0 deletions requirements/extra-pdf-image.in
Original file line number Diff line number Diff line change
Expand Up @@ -13,3 +13,4 @@ unstructured-inference==0.7.27
# unstructured fork of pytesseract that provides an interface to allow for multiple output formats
# from one tesseract call
unstructured.pytesseract>=0.3.12
google-cloud-vision
38 changes: 38 additions & 0 deletions requirements/extra-pdf-image.txt
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,8 @@
#
antlr4-python3-runtime==4.9.3
# via omegaconf
cachetools==5.3.3
# via google-auth
certifi==2024.2.2
# via
# -c ././deps/constraints.txt
Expand Down Expand Up @@ -43,6 +45,24 @@ fsspec==2024.3.1
# via
# huggingface-hub
# torch
google-api-core[grpc]==2.18.0
# via google-cloud-vision
google-auth==2.29.0
# via
# google-api-core
# google-cloud-vision
google-cloud-vision==3.7.2
# via -r ./extra-pdf-image.in
googleapis-common-protos==1.63.0
# via
# google-api-core
# grpcio-status
grpcio==1.62.2
# via
# google-api-core
# grpcio-status
grpcio-status==1.62.2
# via google-api-core
huggingface-hub==0.22.2
# via
# timm
Expand Down Expand Up @@ -147,11 +167,26 @@ pillow-heif==0.16.0
# via -r ./extra-pdf-image.in
portalocker==2.8.2
# via iopath
proto-plus==1.23.0
# via
# google-api-core
# google-cloud-vision
protobuf==4.23.4
# via
# -c ././deps/constraints.txt
# google-api-core
# google-cloud-vision
# googleapis-common-protos
# grpcio-status
# onnx
# onnxruntime
# proto-plus
pyasn1==0.6.0
# via
# pyasn1-modules
# rsa
pyasn1-modules==0.4.0
# via google-auth
pycocotools==2.0.7
# via
# -c ././deps/constraints.txt
Expand Down Expand Up @@ -195,8 +230,11 @@ regex==2024.4.16
requests==2.31.0
# via
# -c ./base.txt
# google-api-core
# huggingface-hub
# transformers
rsa==4.9
# via google-auth
safetensors==0.4.3
# via
# timm
Expand Down

0 comments on commit b50e4b4

Please sign in to comment.