Feature/agent checking #104

SeanScripts · 2022-12-29T17:14:14Z

Creates AgentCheckerUtil class, with functions for finding reachable/unreachable pages and intents. Some other checking functions could be added in the future. Also adds a function to TestCases to get a dataframe of test case results for an agent, rerunning tests without results, or optionally rerunning all test cases. I had put this in the agent checking class, but decided it made more sense to include it in TestCases directly, since we have similar functions to get dataframes in the other core classes.

kmaphoenix · 2023-01-04T18:13:34Z

@MRyderOC please review PR and make optimization suggestions, particularly around the following:

Method length (Should aim for ~40 lines per style guide)
Nested statements (can these be condensed? Refactored?)
Arg Inputs (can these be condensed?)

MRyderOC

Some general points:

Most of the functions don't have proper documentation. Please follow Google Python Style Guide: Comments and Docstrings and make the changes accordingly.
To condense the input parameters, use either id or name for page and flow and be consistent with your choice. e.g. only use name (flow_name, page_name). If you want the user to have the ability to pass either name or id, define an internal helper method to identify which one is passed as an argument and return the one we're using in our code. e.g.

def helper_identify_flow_name_or_id(self, flow_name_or_id):
    # identify whether "flow_name_or_id" is a "flow_name" or a "flow_id"
    # Assuming that we use "flow_name" as an argument for methods like 'find_reachable_pages'
    if flow_name_or_id is like flow_name:
        return flow_name_or_id
    else:
        return self.flow_map[flow_name_or_id]

Most of the if statements could be more brief. I addressed some of them as "Brief if statement" to explain how we can achieve the same result with less code. I noticed that most of them use the hasattr built-in function.
Note that hasattr(obj, name) returns True if obj has the name attribute even though the value of the obj.name is None and for all the proto objects that we're working with the result of hasattr will be True.
For verbose=True change the print statements to logging.info and provide more readable information for the user.

src/dfcx_scrapi/tools/agent_checker_util.py

MRyderOC · 2023-01-19T22:36:18Z

src/dfcx_scrapi/tools/agent_checker_util.py

+          flow_id OR flow_name: The ID or name of the flow
+          from_page: (Optional) The page to start from. If left blank, it will
+            start on the Start Page
+          intent_route_limit: (Optional) Default None
+          include_groups: (Optional) If true, intents from transition route
+            groups will be included, but only if they are actually referenced
+            on some page
+          include_start_page_routes: (Optional) Default true
+          limit_intent_to_initial: (Optional) Default False. If true, only
+            take intent routes on the initial page, rather than on any page
+            in the traversal.
+          is_initial: (Optional) Default True
+          include_meta: (Optional) Default False. If true, includes special
+            transition targets like End Session, End Flow, etc.
+          verbose: (Optional) If true, print debug information about
+            route traversal


Could you please provide more information on intent_route_limit and is_initial?

It seems include_start_page_routes only affects the flow's Start Page (Line 519). That means if we pass some random page other than the Start page to from_page and pass include_start_page_routes it will return all reachable pages from both from_page and Start Page.

I added some documentation about intent_route_limit. is_initial should have been internal here. I realized that I broke the functionality earlier by trying to put is_initial into the params dict, but I think I managed to fix it.

Yes, that's the intended behavior. If include_start_page_routes is true, then start page intent routes will be treated as if they are in scope on every page, which is the way they actually work in the agent. This isn't very useful if you're starting from a page other than the start page with no limit on the number of intent routes taken. But with intent_route_limit = 1 for example, you can answer the question: "Starting from from_page, which pages can be reached in one conversation turn? (assuming all conditional routes are possible)". Choosing whether or not to allow start page routes to be in scope is pretty useful here.

MRyderOC · 2023-01-19T22:39:09Z

@kmaphoenix Would you please take a quick look at the comments? I'll appreciate your input.

kmaphoenix · 2023-01-31T16:18:10Z

src/dfcx_scrapi/core/test_cases.py

+        return "Default Start Flow"
+
+    # Note that flow id includes agent, normally...
+    def _convert_page(self, page_id, flow_id, pages_map):


@SeanScripts one way to optimize this section would be to create dictionary of your page_id mappings.

Then you can call .get on the dictionary directly and return INVALID if nothing matches.
This should significantly cut down on the number of if/else checks you need to do.

page_dict = { 'END_SESSION': 'End Session', 'END_FLOW': 'End Flow', etc. } display_name = page_dict.get(page_id, 'INVALID')

I added this dictionary and returned the mapping through it if the page ID was one of these special cases. However, we still need to look up the page name from the main dictionary of page names if it's not one of these special cases. I simplified the part after this a little bit by using .get.

Also, I know the latest version of Scrapi added the special cases to the list of pages, but I'm not sure what the format is, compared to what is returned in the test cases. In particular, whether or not the "pages" like END_SESSION include the flow ID. I might need to check that.

Looks like with the 1.6 update to get_pages_map, this check for the special pages isn't needed at all. Thanks! This function has been simplified.

kmaphoenix · 2023-01-31T16:20:32Z

src/dfcx_scrapi/core/test_cases.py

+        passed = []
+
+        for response in test_case_results:
+            # Collect untested cases to be retested


@SeanScripts If you have a commented section in a method where you are describing what that section does, you can just make that entire section a new method.

This will clean up your main method and provide better debug tracking later.

I moved this section to a new function and had it return a single-row dataframe for each test case response, which avoided initializing all the empty lists above. I kept the check for test cases that needed to be retested since it was just one condition.

src/dfcx_scrapi/core/test_cases.py

SeanScripts · 2023-07-10T18:35:55Z

@kmaphoenix I wrote some tests, though I wasn't able to run them using pytest in my environment. However, running the content of the tests with a demo agent, they're still working as expected.

…iscrepancies

kmaphoenix

Linted and tested locally.

SeanScripts requested a review from kmaphoenix December 29, 2022 17:14

SeanScripts self-assigned this Dec 29, 2022

kmaphoenix marked this pull request as draft January 4, 2023 18:09

kmaphoenix requested a review from MRyderOC January 4, 2023 18:09

MRyderOC reviewed Jan 19, 2023

View reviewed changes

kmaphoenix reviewed Jan 31, 2023

View reviewed changes

src/dfcx_scrapi/core/test_cases.py Show resolved Hide resolved

SeanScripts force-pushed the feature/agent_checking branch from 39ba9b3 to 612c1f9 Compare July 10, 2023 18:22

SeanScripts added 18 commits August 18, 2023 19:51

Create agent_checker_util.py

5ace003

Update notes

91688f4

Add utility ID conversion functions

332517d

Add function get_test_case_results

824007d

Add maps

444ad71

Simplify conversions

df4b671

Add docstrings

c6b3271

Include imports

da3a4f1

Fix references and missing imports

8a4d730

Create functions for finding reachable pages

d70e7c4

Fix tabbing and imports

9c76cdf

Implement get_page

5ac69a8

Get flow and page data

a8efb12

Fix page data

8414538

Fix find_unreachable_pages function and dependencies

e158d61

Clean up agent_checker_util.py

0bda9bb

Lint fixes

5ea1851

Optimize agent data loading

bdfa66f

kmaphoenix added 24 commits August 18, 2023 22:43

fix: Lint fixes

9a36968

fix: make agent_id required init arg

a6a27d2

feat: Implement additional export_agent options

47e68d5

feat: add get_flow_page_map method

83fcd48

chore: update gitignore

1fcf35c

feat: adding agent_extract feature for offline processing

f6bffdd

feat: add test case parsing

dbbf9fe

fix: linting

96f98e2

feat: Implement graph structure

3e4934c

feat: refactor graph recursion into extract class for finding graph d…

5a66d8b

…iscrepancies

fix: re sort AgentData class; add new fields and types

8d26316

fix: lint fixes

cc75794

fix: formatting

63e1356

feat: add processing for intents; cleanup old code

c117219

fix: add logging; handle lros; add lang_code support

81a7859

fix: added dir cleanup to avoid local file conflicts

be7fd26

fix: fixed display_name parsing for Intents/Entity Types

5f035d7

fix: add lang_code support; fix class type outputs

c739920

feat: implement recursion method for finding reachable pages in graph

66f7578

fix: lint fixes

f676476

chore: reverting tests due to testing refactor coming soon

66361c7

fix: modify active_intents type for downstream processing

a750daf

feat: refactor df code; implement get_unreachable_intents

85aa288

fix: lint fixes

67cf893

kmaphoenix marked this pull request as ready for review August 25, 2023 22:26

kmaphoenix added 2 commits August 27, 2023 12:58

chore: comment cleanup; unused code cleanup

da42187

fix: linting

c4edde8

kmaphoenix approved these changes Aug 27, 2023

View reviewed changes

kmaphoenix merged commit 77187bd into main Aug 28, 2023

kmaphoenix deleted the feature/agent_checking branch August 28, 2023 09:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/agent checking #104

Feature/agent checking #104

SeanScripts commented Dec 29, 2022

kmaphoenix commented Jan 4, 2023

MRyderOC left a comment

MRyderOC Jan 19, 2023

SeanScripts Jan 31, 2023

MRyderOC commented Jan 19, 2023

kmaphoenix Jan 31, 2023

SeanScripts Feb 9, 2023

SeanScripts Feb 9, 2023

kmaphoenix Jan 31, 2023

SeanScripts Feb 9, 2023

SeanScripts commented Jul 10, 2023

kmaphoenix left a comment

Feature/agent checking #104

Feature/agent checking #104

Conversation

SeanScripts commented Dec 29, 2022

kmaphoenix commented Jan 4, 2023

MRyderOC left a comment

Choose a reason for hiding this comment

MRyderOC Jan 19, 2023

Choose a reason for hiding this comment

SeanScripts Jan 31, 2023

Choose a reason for hiding this comment

MRyderOC commented Jan 19, 2023

kmaphoenix Jan 31, 2023

Choose a reason for hiding this comment

SeanScripts Feb 9, 2023

Choose a reason for hiding this comment

SeanScripts Feb 9, 2023

Choose a reason for hiding this comment

kmaphoenix Jan 31, 2023

Choose a reason for hiding this comment

SeanScripts Feb 9, 2023

Choose a reason for hiding this comment

SeanScripts commented Jul 10, 2023

kmaphoenix left a comment

Choose a reason for hiding this comment