request to add gcp-implementation-demos to main branch #315

tgaillard1 · 2023-02-03T19:57:44Z

holt skinner asked if I could add my repo to the main branch of document-ai-samples.

google-cla · 2023-02-03T19:57:48Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

holtskinner · 2023-02-03T20:18:50Z

Ok, the linter checks are failing. Looks like most are format-related.

You can use the contributing guide for information on structure and linters used
https://github.com/GoogleCloudPlatform/document-ai-samples/blob/main/.github/CONTRIBUTING.md#code-quality-checks

tgaillard1 · 2023-02-03T20:27:24Z

updated with linter to resolve formatting errors.

holtskinner

The main question I have is why these two separate applications are needed when most of the codebase seems to be the same?

Also follows a very similar structure to these existing projects

I think it would make more sense to create a more general system that could use a schema file or create new fields based on the entities that are extracted.

Then this could be simplified into a single codebase with other applications using the examples.

holtskinner · 2023-02-03T20:52:06Z

gcp-implementation-demos/docai-contract-demo/.env.local

+export PROJECT_ID=shared-vpc-host-254614
+export BUCKET_LOCATION=us-central1
+export CLOUD_FUNCTION_LOCATION=us-central1
+export INGRESS_SETTINGS=internal-and-gclb
+export CLOUD_FUNCTION_SERVICE_ACCOUNT=tg-docai
+export BQ_DATASET_NAME=acme_contract_parser_results
+export BQ_TABLE_NAME=doc_ai_extracted_entities
+export COMPANY_NAME=acme


Don't commit project ids or other global identifiers into repositories, replace with a placeholder like YOUR_PROJECT_ID

holtskinner · 2023-02-03T20:53:03Z

gcp-implementation-demos/docai-contract-demo/cloud-functions/main.py

+        source_format=bigquery.SourceFormat.NEWLINE_DELIMITED_JSON,
+        ignore_unknown_values=True,
+        schema=[
+            bigquery.SchemaField("document_name", "STRING"),
+            bigquery.SchemaField("agreement_date", "DATE"),
+            bigquery.SchemaField("arbitration_venue", "STRING"),
+            bigquery.SchemaField("confidentiality_clause", "STRING"),
+            bigquery.SchemaField("effective_date", "DATE"),
+            bigquery.SchemaField("expiration_date", "DATE"),
+            bigquery.SchemaField("governing_law", "STRING"),


Is there any way to change this to use a schema.json file or something? Seems like this is less reusable

tgaillard1 · 2023-03-03T22:47:30Z

just verifying if anything is needed from my end to commit this PR

holtskinner · 2023-09-21T15:26:34Z

Sorry, I didn't notice that you made an additional comment. I'd recommend combining most of these two demos into a single demo because most of the code base is exactly the same.

tgaillard1 added 2 commits February 3, 2023 12:49

adding gcp-implementation-demos to document-ai-samples repo

a2eb0db

changed owners back

6e3d94d

tgaillard1 requested a review from a team as a code owner February 3, 2023 19:57

ran linter on repo and updated errors

cb63ff8

tgaillard1 requested review from holtskinner, HSbedi87 and dojowahi as code owners February 3, 2023 20:22

Merge branch 'main' into main

4882ae5

holtskinner reviewed Feb 3, 2023

View reviewed changes

Merge branch 'main' into main

c5d2929

holtskinner requested a review from yoshi-approver as a code owner September 21, 2023 15:26

holtskinner closed this Feb 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

request to add gcp-implementation-demos to main branch #315

request to add gcp-implementation-demos to main branch #315

tgaillard1 commented Feb 3, 2023

google-cla bot commented Feb 3, 2023

holtskinner commented Feb 3, 2023

tgaillard1 commented Feb 3, 2023

holtskinner left a comment

holtskinner Feb 3, 2023

holtskinner Feb 3, 2023

tgaillard1 commented Mar 3, 2023

holtskinner commented Sep 21, 2023

request to add gcp-implementation-demos to main branch #315

request to add gcp-implementation-demos to main branch #315

Conversation

tgaillard1 commented Feb 3, 2023

google-cla bot commented Feb 3, 2023

holtskinner commented Feb 3, 2023

tgaillard1 commented Feb 3, 2023

holtskinner left a comment

Choose a reason for hiding this comment

holtskinner Feb 3, 2023

Choose a reason for hiding this comment

holtskinner Feb 3, 2023

Choose a reason for hiding this comment

tgaillard1 commented Mar 3, 2023

holtskinner commented Sep 21, 2023