Design

updated: 26.10.2022

General

';' seperates different components of LHS/RHS

Library functions:

class TransTypes(Enum):
  TRANSFORMER = 1
  VISITOR = 2

  class Match:
    def __init__(self, graph: DiGraph, nodes: Dict[str, List], edges: Dict[str, List], mapping: Dict[str, Hashable]):
      pass
  
  class ResultSet:
    def __init__(self, matches: List[Match]):
        pass

def rewrite(lhs: Union[str, List[str]], p: str=None, rhs: Union[str, Template]=None, condition: Callable[Match, bool]=None, render_rhs: Dict[str, Callable[Match, str]]=None, 
            type: TransTypes=TransTypes.TRANSFORMER) -> ResultSet:
  """
    lhs: a string / list of strings.
    p: string, in order to specify vertex duplications.
    rhs: a string / a formatted string to allow future rendering (inside implementation) according to each match's actual values.
    condition: function: Match -> bool, 
    render_rhs: dictionary from strings to (Match -> str) functions
    type: TransTypes object
    return value: result set
  """
  pass

Constant transforms no attribute change (only structure)

the override flag (transformer/visitor) are specified as part of the rewriting command, so a structure can be used as both LHS or RHS later.
'_' for anonymous node.
Need to specify the parent to which the RHS is connected, the connection might be not trivial (ambiguous).

L = "a -> b -> _"
R = "a -> c"
rewrite(lhs=L, rhs=R, type=TransTypes.TRANSFORMER)

No structural change to graph

this can be easily implemented using type = TransTypes.VISITOR
TransTypes.TRANSFORMER - replaces the LHS with the RHS.

L = "a -> b -> c -[weight: int]-> d -> e; \
     c -> d; \
      \
     b [ \
       value: str = \"hello\", \
       id: int \
     ]"

P = "b -> b1; b -> b2"

R = "a -> b[value: str = \"hello 2\"] -> c -[weight: int]-> d -> e; \
     c -> d; \
     b [id: int]"

rewrite(lhs=L, rhs=R, type=TransTypes.TRANSFORMER)

Complex constraints

complex conditions can be a function or explicit, as long as it is boolean.

def f(x: str):
  ...
  return True # or some boolean
  
L = "a -> b[attr: int] -> c"
R = "a -> c"
rewrite(lhs=L, rhs=R ,condition=lambda match: f(match["b"]["attr"]), type=TransTypes.TRANSFORMER)

Multiple generic connections

allow duplications (one or more) similar to regex. One or more instances of the entire specified sub-graph only legal in the LHS (because the RHS should not be ambigous.)
in the resultSet, 'c' is now multiplied. the result set contains "d<0>"...
numbering of instances is enforced as part of the matching.
recursive duplications are allowed, d<0,1,5>...
enforcing an explicit number of connections is done by: -num->
enforcing an explicit minimal number of connections is done by: -num+->

l = """  _ -> b -7[weight:int]-> c -> d[value:int]
    d<0> -> e
    d<5> -> e
"""
res = rewrite(lhs=l, rhs=None, type=TransTypes.VISITOR)
sum = 0
for match in results_set.matches: #is there a defined order of graph iteration?
  sum += match["d<0>"]["value"]

#another example
l2 = """  _ -> b -5+-> d[value:int]
    d<0> -7-> e
    e<0,5> -> _
"""

Macros and templates

macros == regular patterns as used before, can be enforced on vertices.
macros with anonymus entities:

pattern1 = f"""{a}-> _[value: int]""" #actually the constraint is inside condition...
L = f"""_ -> e -> c -> d;
    {pattern1.render(a='d')}; {pattern1.render(a='c')}"""

macros with named entities: only used for shortening of strings for the user.
in this case the result set will contain numbered strings(?)

pattern1 = f"""{a}-5+->e{num}[value: int = 6]"""
l = f"""_ -> e -> c -> d;
    {pattern1.render(a='d',num=1)} ; {pattern1.render(a='c',num=2)}"""

results_set = rewrite(lhs=l, rhs=None, type=TransTypes.VISITOR)
sum = 0
for match in results_set.matches: #is there a defined order of graph iteration?
  sum += match["e2<3>"]["value"]

this is necessary to prevent bad translation such as:

macros can be enforced twice in the same vertex, with different numbering.

Visiting - intead of replacing the entire LHS

demonstration of using type = TransTypes.VISITOR

L = "a[token: Node, label = \"EXP\", val: Node] \
        ->b1[token: ExpNode, label = \"EXP\", var: str, code: str] \
        ->op[token: Node, label = \"AND\"] \
        ->b2[token: ExpNode, label = \"EXP\", var: str, code: str]"
R = f'''a[token: ExpNode, label = \"EXP\", val: ExpNode = {x}]'''

def f(match): 
  return ExpNode(BOOL_T)

rewrite(L,rhs=R,render_rhs={x=f})

imperative side effect

ResultSet is a view to the matches found, and allows accessing and changing matched attributes.
allows modifications of local parameters and retrieving direct values, which is not possible inside a function.
condition allows filtering matches.
rhs is not mandatory!

# Example: for every matched a->b[my_value: int] in graph, if my_value > 5, add my_value to local variable "sum". Graph won't be changed because rhs is None by default!
sum = 0
def f(num: int):
  return num > 5

L = "a->b[my_value: int]"
result_set = rewrite(lhs=L, condition=lambda match: f(match["b"]["my_value"]), type=TransTypes.TRANSFORMER)

for match in results_set.matches: #is there a defined order of graph iteration?
  sum += match["b"]["my_value"]

example for 'render_rhs'

def f(match):
  return str(match["b"]["value"] + match["c"]["value"])

LHS = "a->b[value]; \
    a->c[value]"
RHS = f"""a[value ={x}]"""
rewrite(g, rhs=RHS, lhs=LHS,render_rhs={x: f}, type=TransType.TRANSFORMER)

choosing one of several LHS - 10%

to allow the user to select one of several constraints, LHS is supplied as a list of formatted-strings. the matching algorithm finds graph components that match one or more of the constraints. (logical 'OR')

naming edges - 10%

therefore the resultset is a tuple of lists (one for vertices and one for edges).

l = """ a->b->c 
b->c [
  $var_name = x
]
"""

allowing collapse of structural recursion

Relevant mainly for parsing purposes. Convert a graph of sequence format to the fully evaluated version of it.
i.e., a list can be represented by the user as a tree with depth equal to the length of the list. the tree can be converted to a 2-layered tree with the elements of the list as direct children of the root.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

design.md

design.md

Design

General

Constant transforms no attribute change (only structure)

No structural change to graph

Complex constraints

Multiple generic connections

Macros and templates

Visiting - intead of replacing the entire LHS

imperative side effect

example for 'render_rhs'

choosing one of several LHS - 10%

naming edges - 10%

allowing collapse of structural recursion

Files

design.md

Latest commit

History

design.md

File metadata and controls

Design

General

Constant transforms no attribute change (only structure)

No structural change to graph

Complex constraints

Multiple generic connections

Macros and templates

Visiting - intead of replacing the entire LHS

imperative side effect

example for 'render_rhs'

choosing one of several LHS - 10%

naming edges - 10%

allowing collapse of structural recursion