[Issue #1745] Setup transformation process structure & transform an opportunity #1794

chouinar · 2024-04-22T18:13:31Z

Summary

Fixes #1745

Time to review: 10 mins

Changes proposed

Setup the transformation process script structure

Implement some shared utilities that subsequent PRs will use

Implement the transformation logic for opportunities

Context for reviewers

A lot of setup in this PR, a lot that can be reused in the subsequent PRs to add transformations for the other sets of tables. Tried to make sure those would require refactoring or pulling out implementation details by setting up utils like the timestamp conversions + initial query to the DB to fetch the transforming records.

As far as the implementation goes, determining what needs to be transformed is pretty simple - the transformed_at column is null. There is then a second column that says whether the record should be deleted or not. When we query the staging tables, we also join with the relevant table we'll transform to (opportunity in this case), that way we already have the "source" and "destination" records and just need to modify the destination record (or create it if its an insert).

chouinar · 2024-04-22T18:15:30Z

api/src/data_migration/transformation/transform_oracle_data.py

+logger = logging.getLogger(__name__)
+
+
+class TransformOracleData(Task):


@jamesbursa - FYI I'm imagining that the script we'll have for the copy + transform will just look like:

def whatever_entrypoint_func(db_session): ExtractLoadOracleData(db_session).run() TransformOracleData(db_session).run()

jamesbursa

Looks good, thanks

api/src/data_migration/transformation/transform_oracle_data_task.py

jamesbursa · 2024-05-02T19:35:02Z

api/src/data_migration/transformation/transform_oracle_data_task.py

+                raise ValueError("Cannot delete opportunity as it does not exist")
+
+            self.increment(self.Metrics.TOTAL_RECORDS_DELETED)
+            self.db_session.delete(target_opportunity)


We might need to review this once we have foreign keys pointing to it, and change it to setting is_deleted or similar.

We have the cascades setup on the relationships, so its safe to delete with the foreign keys.

api/src/data_migration/transformation/transform_oracle_data_task.py

jamesbursa · 2024-05-02T19:43:31Z

api/src/data_migration/transformation/transform_oracle_data_task.py

+    updated_timestamp = convert_est_timestamp_to_utc(source.last_upd_date)  # type: ignore[attr-defined]
+
+    if created_timestamp is not None:
+        target.created_at = created_timestamp


I'd suggest we have two pairs of columns. Keep the ones in TimestampMixin managed by SQLAlchemy with that meaning, and add two new columns for these upstream datetimes.

How would you suggest we set this up and use it? I set them this way so when someone fetches them from our API, they have the same values as the legacy system.

How would we want to map these for the API if we add two more columns? A user shouldn't need to be aware that we technically updated everything when we imported it (that's irrelevant to them).

api/src/util/datetime_util.py

@jamesbursa

This is a follow-up to #1794 - which it builds upon. ## Summary Fixes #1746 ### Time to review: __10 mins__ ## Changes proposed Adds transformation logic for the assistance listing (formerly CFDA) tables. ## Context for reviewers The transformations are pretty uneventful, the only complexity is that the legacy Oracle database doesn't have a foreign key between the `TopportunityCfda` table and the `Topportunity` table and there are ~2300 orphaned cfda records that we wouldn't be able to import, so we additionally need to validate that the opportunity exists when we try to transform the data, and if not, we just mark it as "transformed" and do nothing with it. There is some basic work on relationships between the staging tables + more factories for setting up the data which will be ongoing / @jamesbursa is also looking into. --------- Co-authored-by: nava-platform-bot <[email protected]>

chouinar added 6 commits April 18, 2024 15:05

WIP

71e6cf2

WIP

cfc8076

Merge branch 'main' into chouinar/1745-setup-transformations

65d9785

More organization

6ccd8e6

Merge branch 'main' into chouinar/1745-setup-transformations

345e65e

Adding more implementation

3741df5

chouinar requested a review from jamesbursa April 22, 2024 18:13

chouinar requested review from acouch, aplybeah and SammySteiner as code owners April 22, 2024 18:13

github-actions bot added python api labels Apr 22, 2024

chouinar commented Apr 22, 2024

View reviewed changes

Merge branch 'main' into chouinar/1745-setup-transformations

afb077b

acouch mentioned this pull request Apr 23, 2024

[Task]: Add New Table Loading Statements #1686

Closed

4 tasks

chouinar added 11 commits April 23, 2024 16:56

Cleanup, still a WIP

2520062

adding a comment

5d4639e

Merge branch 'main' into chouinar/1745-setup-transformations

9dbbc1c

Switching to the staging tables

44abd4f

WIP

703672e

Merge branch 'main' into chouinar/1745-setup-transformations

68f3968

WIP

b5e1ebe

Merge branch 'main' into chouinar/1745-setup-transformations

94032b2

Final tests and cleanup

38ad477

Minor adjustment

e1dfff0

Merge branch 'main' into chouinar/1745-setup-transformations

fda6a89

chouinar mentioned this pull request Apr 30, 2024

[Issue #1746] Add transformations for assistance listing table #1875

Merged

jamesbursa previously approved these changes May 2, 2024

View reviewed changes

chouinar added 2 commits May 2, 2024 16:46

Merge branch 'main' into chouinar/1745-setup-transformations

66ba784

Two small bug fixes

8ee7388

chouinar dismissed jamesbursa’s stale review via 8ee7388 May 2, 2024 20:53

Fix a few small nits

d33cec2

jamesbursa approved these changes May 2, 2024

View reviewed changes

chouinar merged commit 480f395 into main May 3, 2024
10 checks passed

chouinar deleted the chouinar/1745-setup-transformations branch May 3, 2024 14:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Issue #1745] Setup transformation process structure & transform an opportunity #1794

[Issue #1745] Setup transformation process structure & transform an opportunity #1794

chouinar commented Apr 22, 2024 •

edited

Loading

chouinar Apr 22, 2024 •

edited

Loading

jamesbursa left a comment

jamesbursa May 2, 2024

chouinar May 2, 2024

jamesbursa May 2, 2024

chouinar May 2, 2024

		logger = logging.getLogger(__name__)


		class TransformOracleData(Task):

[Issue #1745] Setup transformation process structure & transform an opportunity #1794

[Issue #1745] Setup transformation process structure & transform an opportunity #1794

Conversation

chouinar commented Apr 22, 2024 • edited Loading

Summary

Time to review: 10 mins

Changes proposed

Context for reviewers

chouinar Apr 22, 2024 • edited Loading

Choose a reason for hiding this comment

jamesbursa left a comment

Choose a reason for hiding this comment

jamesbursa May 2, 2024

Choose a reason for hiding this comment

chouinar May 2, 2024

Choose a reason for hiding this comment

jamesbursa May 2, 2024

Choose a reason for hiding this comment

chouinar May 2, 2024

Choose a reason for hiding this comment

chouinar commented Apr 22, 2024 •

edited

Loading

chouinar Apr 22, 2024 •

edited

Loading