Skip to content

A transactional in-memory SQL-like object store for long running processes, games, analytics, realtime processing and other applications.

License

Notifications You must be signed in to change notification settings

gigaquads/store

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Store

This library provides a Store datatype for Python. Each store looks and feels like an ORM, but unlike an ORM, there is no database on the other end. Instead, all data lives in memory, in the form of plain Python dicts and B-tree indices. Stores support SQL-like select statements in the style of SQLAlchemy, atomic transactions and multithreading.

The source code aims to be rebustly documented, as we encourage open-source collaboration on this Project.

Use-cases

  • Long-running interactive applications, like games.
  • Automated trading systems with complex internal state management requirements.
  • Stream-processing applications that perform fast ad hoc queries on stream buffers.

An Example

Imagine a system that generates user input events, like mouse click and key press. In the following example, we delete click events created after a specified time and capitalize the character asssociated with each key press within a transaction.

from store import Store

events = Store()

# insert fictitious "event" records
events.create_many([
    {'event_type': 'press', 'char': 'x', 'time': 1},
    {'event_type': 'click', 'button': 'L', 'position': (5, 8), 'time': 2},
    {'event_type': 'click', 'button': 'R', 'position': (3, 4), 'time': 3},
    {'event_type': 'press', 'char': 'y', 'time': 4},
    {'event_type': 'press', 'char': 'p', 'time': 5},
])

with events.transaction() as transaction:
    # delete "click" events after specified time
    transaction.select().where(
        events.row.event_type == 'click',
        events.row.time > 2
    ).delete()

    # capitalize the "char" for each selected "press" event
    get_press_events = transaction.select().where(
        x.event_type == 'press',
        x.char.one_of(['x', 'y', 'z'])
    )
    for event in get_press_events(dtype=list):
        event['char'] = event['char'].upper()

State Dicts

Store methods, like create and update, return state dicts. Unlike regular dicts, any change to the keys or values of a state dict results in an update to the store. For example, suppose that user is a state dict. As such, user['name'] ='John' generates a call to store.update under the hood. When this happens, any existing reference to the same user immediately reflect this change. There is no need to refresh each reference manually (as they are all actually the same object). The same is true for other methods, like update, setdefault, etc.

Let's illustrate with an example:

frank_1 = store.create({'id': 1, 'name': 'frank'})
frank_2 = store.get(1)

# the store manages a singleton reference to frank's StateDict
# in its internal so-called identity set.
assert frank_1 is frank_2

# frank_1 and frank_2 are references to the same object,
# so they should both reflect the same change.
frank_1['name'] = 'Franklin'

assert frank_2['name'] == 'Franklin'

# likewise, any subsequent reference should reflect the same change
frank_3 = store.get(1)

assert frank_3['name'] == 'Franklin'

Stateful Methods

Here is a list of each dict method that has been extended to result in an update to store as a side-effect. On the lefthand side of each arrow is the dict method. On the righthand side is the corresponding store call.

  • state.update(mapping)store.update(state, mapping.keys())
  • state.setdefault(key, default)store.update(state, {key})
  • state[key] = valuestore.update(state, {key})
  • del state[key]store.delete(state, {key})

Indexes

By default, all StateDict keys are indexed, including those with non-scalar values -- like lists, sets, dicts, etc. This means that that queries are fast.

Queries

You can query a store like a SQL database, using select, where, order_by, limit and offset constraints.

Symbols

Select statements are written with the help of a class called Symbol. A symbol is a variable used to express what you want to select and how. Suppose you had a store of user records. Then, using a symbol, You could write a query to selects all users, created after a certain cut-off date.

user = user_store.symbol()

get_users = user_store.select(
    user.first_name,
    user.email
).where(
    user.created_at > cutoff_date
)

for user in get_users(dtype=list):
    send(message=f'Hello, {user["first_name"]}!', email=user['email'])

An alternative to instantiating a new symbol for each query is to use a built-in property, store.row. The following query is identical to the one above:

get_users = user_store.select(
    user_store.row.first_name,
    user_store.row.email
).where(
    user_store.row.created_at > cutoff_date
)

Select

By default, an empty select will select everything, like select * from... in SQL; however, if you're only interested in a subset of fields, you can explicitly enumerate them.

Selecting Everything

query = store.select()

Selecting Specific Fields

query = store.select(store.row.name, store.row.email)

Where (Filtering)

You can constrain queries to select only records whose values match a given logical predicate. Predicates can be arbitrarily nested in compound boolean expressions. This is similar to the "where" clause in SQL select statements.

Filtering Non-scalars Values

Unlike a SQL database, with a store, you can apply predicate logic not only to scalar values, like numbers and strings, but also non-scalar types, like dicts, lists, and sets.

For example, this is possible:

# imagine you have a store with user dicts, and each user dict
# has a nested dog dict with an "age" value.

get_users = store.select().where(store.row.dog <= {'age': 10})

for user in get_users():
    assert user['dog']['age'] <= 10

Using a symbol, here are some example:

Conditional Expressions

user = store.symbol()

# equality
predicate = (user.email == '[email protected]')
predicate = (user.email != '[email protected]')

# inequality
predicate = (user.age >= 50)

# containment
predicate = (user.favorite_color.in(['red', 'blue'])

# logical conjunction (AND)
predicate = (user.scent == 'smelly') & (user.income <= 20000)

# logical disjunction (OR)
predicate = (user.scent == 'smelly') | (user.income <= 20000)

# logical conjunction and disjunction combined
predicate = (
    ((user.scent == 'smelly') | (user.age <= 20)) & (user.name == 'Bob')
)

Moreover, predicates can be built up gradually, like so:

predicate = (user.age <= 20)

if some_condition:
    predicate &= (user.income > 100000)   # |= also works

Once you have your predicate, you can pass it into a query's where method:

query = store.select().where(
    (user.age <= 20) | (user.is_member == True)
)

Order By

Query results can be sorted by one or more values using the order_by query method. For example:

# sort results by age (in ascending order) first
# created_at date (in descending order) second.
query = store.select().order_by(
    user.age.asc,
    user.created_at.desc
)

Ordering By Non-scalar Values

Unlike SQL, the store can sort non-scalar datatypes, like dicts, lists, and sets -- in addition to plain ints and strings. This means that you can do things like -- this:

store.create_many([
    {'owner': 'Mohammed', 'dog': {'age': 10}},
    {'owner': 'Kang Bo', 'dog': {'age': 6}},
])

get_users = store.select().order_by(store.row.dog.asc)
users = get_users(dtype=list)

for u1, u2 in zip(users, users[1:]):
    assert u1.dog['age'] <= u2.dog['age']

Note that, when sorting a dict, the dict's items are sorted and compared in the resulting order.

Limit & Offset

Queries support pagination via limit and offset parameters. The limit parameter is an int that determines the maximum number of records returned by the query while the offset parameter determines the starting index of the returned slice. When using limit and offset, it is important to specify an order, using order_by.

query = store.select(
    user.email
).order_by(
    user.age.desc
).offset(
    20
).limit(
    10
)

Transactions

Stores support transactions as well. If, for some reason you don't already know, a database transaction is a mechanism that allows you to perform multiple operations as if they were all performed int a single step. This way, if one operation fails, then they all fail, and the state of the store remains intact. The syntax for creating transactions is straight forward:

with user_store.transaction() as user_trans:
    # update the name of one user and delete another
    users = user_trans.get_many([1, 2])
    users[1]['name'] = 'Updated Name'
    users[2].delete()

At the end of the with block, the transaction commits; otherwise, if an exception is raised, the transaction rolls back, clearing its internal state.

Alternate to using the with statement, commit and rollback methods can be called explicitly.

user_trans = user_store.transaction()

try:
    users = user_trans.get_many([1, 2])
    users[1]['name'] = 'Updated Name'
    users[2].delete()
    user_trans.commit()
except Exception:
    user_trans.rollback()

About

A transactional in-memory SQL-like object store for long running processes, games, analytics, realtime processing and other applications.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages