Enable query caching #756

FlorianBracq · 2024-02-28T18:36:06Z

Hello,

this is a WIP of the caching mechanism, heavily inspired by @rcobb-scwx work!

To test it, you can add the parameter "cache_path" to your query:
import msticpy as mp prov: mp.QueryProvider = mp.QueryProvider("LogAnalytics") prov.connect() data = prov.Azure.list_aad_signins_for_account(cache_path=<PATH_TO_CACHE>)
If it is executed from a notebook, and PATH_TO_CACHE is the path to the notebook, the cell's output will contain:

the HTML representation of the first few rows of the dataframe generated from the query result
metadata of the query result:
- timestamp when the cache was generated
- String representation of the executed query
- name of the function called
- dictionary representation of the parameters provided to the query's function
- hash of the parameters provided to the query's function (required to return the right cached value)
- compressed query result

If it is executed outside of a notebook, the same data will be stored in the file provided in cache_path

The path to the notebook is required as the kernel does not know which file it is receiving inputs from, hence cannot know which cell output to read to find the cached data

Things to be done:

Handle split queries
Proper handling of optional parameters
Add tests

…le-query-caching

…arameters are used for multiple functions

FlorianBracq added 5 commits February 28, 2024 18:00

Adding methods to enable cache RW

ed3143a

Create object to store cached results

b24198f

Add methods to write cache to NB cells and file

40b9c30

Add first version of data caching invocation

48b4f0d

Fix logic issue

6e256bd

FlorianBracq force-pushed the enable-query-caching branch from faf0147 to 6e256bd Compare February 29, 2024 10:16

FlorianBracq added 6 commits February 29, 2024 12:30

Remove size checks

2e11acd

Fix imports

6ee679d

Adding tests

ca2eae1

Update test to enforce ipython test to return False

562d6cc

Merge branch 'main' of https://github.com/microsoft/msticpy into enab…

4b814a5

…le-query-caching

Fixing file-based cachingn names to prevent overwrite when the same p…

04058bb

…arameters are used for multiple functions

FlorianBracq mentioned this pull request May 12, 2024

Add extra tests and fixes to QueryProvider, DriverBase and (as)sync query handling #777

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable query caching #756

Enable query caching #756

FlorianBracq commented Feb 28, 2024

Enable query caching #756

Are you sure you want to change the base?

Enable query caching #756

Conversation

FlorianBracq commented Feb 28, 2024