Try to handle changes to type definitions #43

Changaco · 2015-02-28T17:06:04Z

Fixes #26 (partially, because we can't detect all schema changes).

chadwhitacre · 2015-03-04T19:32:46Z

tests.py

+        assert one.biz == 'x'
+        assert not hasattr(one, 'bar')
+
+    @mark.xfail


Why are we using an xfail here instead of a raises?

Because ideally it shouldn't raise, but it's a problem we can't fix.

But xfail will silently swallow all exceptions, no? Seems like we should test for the specific exception we're expecting.

I've added the appropriate raises parameter. From pytest's documentation:

If you want to be more specific as to why the test is failing, you can specify a single exception, or a list of exceptions, in the raises argument. Then the test will be reported as a regular failure if it fails with an exception not mentioned in raises.

chadwhitacre · 2015-03-04T20:56:59Z

tests.py

+        class EmptyModel(Model): pass
+        self.db.register_model(EmptyModel, 'grok')
+        # Add a new column then drop the original one
+        self.db.run("ALTER TABLE grok ADD COLUMN biz text NOT NULL DEFAULT 'x'")


Isn't biz the same type as bar? Why is this test "different_type"?

Sorry, this is grok here, not foo.

chadwhitacre · 2015-03-04T21:27:07Z

IRC

Changaco · 2015-03-05T09:11:00Z

Re IRC: a silent failure is when something bad happens but the problem is ignored. It's not the case here, the ValueError is only masked if it can be remedied, if not we re-raise it.

chadwhitacre · 2015-03-10T19:51:33Z

postgres/__init__.py

+            tokens = self.tokenize(s)
+            if len(tokens) != len(self.atttypes):
+                # The number of columns has changed, re-fetch the type info
+                self.__dict__.update(self._from_db(self.name, curs).__dict__)


Where did we get this line from, and why do we not call it in the normal course of operation? Why are we doing something in this case that we don't otherwise do? Don't we need to configure the class when it's first instantiated? Why am I not seeing that in our code? Is it in psycopg2?

Here's CompositeCaster on master.

I see, we're calling _from_db, which is a constructor classmethod, and then we're overwriting all possible attributes on ourself with the corresponding attributes from the new instance.

Clarified in 0f422eb.

chadwhitacre · 2015-03-10T20:26:04Z

I'm partway through a commit to update the documentation.

chadwhitacre · 2015-03-10T20:27:42Z

And I still haven't satisfied myself on the ways that our parse override could fail.

chadwhitacre · 2015-03-11T17:28:54Z

I'm dusting off my postgres.py dev env. I've got env's rebuilt for Python 2 and 3. Hitting an import error though ...

chadwhitacre · 2015-03-11T17:39:14Z

Imports clean up in #45.

chadwhitacre · 2015-03-11T17:58:52Z

I've got docs again locally.

chadwhitacre · 2015-03-11T18:51:56Z

I'm unraveling what the parse method is about to understand exactly what we're doing with it. I want to understand it and in particular how our implementation of it can fail, so that I can document it properly.

In the CompositeCaster constructor, parse is used as the adapter function, aka, typecaster, for new_type. It's this typecaster that is then passed to register_type underneath register_composite.

chadwhitacre · 2015-03-11T19:08:14Z

register_type ends up adding the type adapter to a dictionary that is ... somehow referenced when hydrating result sets.

chadwhitacre · 2015-03-11T19:09:37Z

>>> psycopg2._ext.string_types.keys()
[3904, 1028, 3905, 1005, 16, 17, 1042, 1043, 20, 21, 3909, 23, 25, 26, 3802, 1182, 1183, 1184, 1185, 1186, 1187, 1700, 3911, 114, 1082, 1083, 700, 701, 704, 705, 3906, 3907, 3908, 1270, 3910, 199, 3912, 3913, 1231, 3926, 3927, 1114, 1115, 3807, 1000, 1001, 1002, 1003, 18, 1006, 1007, 1009, 1266, 19, 1013, 1014, 1015, 1016, 1017, 1021, 1022]
>>> psycopg2._ext.binary_types.keys()
[]
>>>

chadwhitacre · 2015-03-11T19:15:26Z

I'm trying to find where the adapter function parse is called.

chadwhitacre · 2015-03-11T19:23:27Z

Looks like curs_get_cast dereferences the string_types dict (I'm not sure where binary_types gets looked at) and returns back to psyco_curs_cast, which is where the caster is actually called.

chadwhitacre · 2015-03-11T19:47:47Z

psyco_curs_cast ends up at Cursor.cast, but I'm not seeing where that's actually called outside of CompositeCaster.parse! Unless what we're getting is the result of curs_get_cast in pqpath? Looks likely, that's where DB-API 2.0's fetch seems to be implemented. Actually, fetch* is implemented in cursor_type.c, and pq_fetch is called under there. Okay!

chadwhitacre · 2015-03-11T19:49:46Z

So here's the call chain:

one/all (us)
fetch* (psycopg2)
pq_fetch
curs_get_cast
parse

chadwhitacre · 2015-03-11T20:01:53Z

Yeah, binary casts appear to be broken:

FIXME: what the hell am I trying to do here? This just can't work..

:-)

chadwhitacre · 2015-03-11T20:03:27Z

psycopg/psycopg2#297 💃

chadwhitacre · 2015-03-11T21:00:28Z

Hmmm ... a little more complicated. Long story short, parse is called inside typecast_cast.

chadwhitacre · 2015-03-11T21:17:32Z

Alright. I'm back into parse, understanding how it works.

chadwhitacre · 2015-03-11T21:25:31Z

We have a list of tokens and a list of types. We also have a list of field names.

chadwhitacre · 2015-03-11T21:43:32Z

Here are the basic ways in which a type can change underneath us:

a field can be removed
a field can be added
a field can change type

An arbitrary number of fields can be simultaneously changed in one of these three ways.

chadwhitacre · 2015-03-11T21:44:16Z

Changes to the type definition affect both reads and writes.

chadwhitacre · 2015-03-11T21:49:34Z

@Changaco Can you remind me why we need to support this? Why don't we just always restart the web app when the schema changes? Convenience?

chadwhitacre · 2015-03-11T21:54:33Z

What if we have an object with an update method and we pass data to it that gets stored in the wrong field because of a schema change between when the object was created and when we updated it? Is that possible?

chadwhitacre · 2015-03-11T21:54:57Z

More tomorrow ...

chadwhitacre · 2015-03-12T02:08:44Z

I mean, now that I revisit this, it strikes me as really weird to try to expect that the Python layer can seamlessly deal with an underlying schema change.

Changaco · 2015-03-12T12:17:16Z

It's not weird to expect postgres.py to keep working when the schema changes, because it's not even supposed to know what the schema is.

Changaco · 2015-03-12T12:19:22Z

I've added a commit to simplify our parse method: it occurred to me that we don't need to duplicate the code of psycopg's parse method like it was suggested in #26 (comment).

chadwhitacre · 2015-03-12T23:10:08Z

It's not weird to expect postgres.py to keep working when the schema changes, because it's not even supposed to know what the schema is.

Postgres.py itself isn't supposed to know what the schema is, but:

Model is intended to be subclassed, and the subclass probably knows what the schema is, especially if it implements update methods.
Instances of Model subclasses will have their attributes consumed by other code, which will expect a certain schema.

Any schema update is expected to result in other code changes, in the Model subclass as well as in consumers of subclass instances. If there is no code change, then what was the point of making the schema change? If there is a code change, then of course we need to restart the app.

I hate to say it, but I think we should close this PR. :-(

chadwhitacre · 2015-03-12T23:14:27Z

... though a documentation update could be in order. I can open a new PR for that.

Changaco · 2015-03-13T11:13:30Z

Firstly, this PR is useful. It will reduce the downtime of the apps that use postgres.py, because it will no longer be necessary to take them down every time a backwards-compatible change to the DB schema is made, like dropping an unused column, or adding a new column with a default.

Secondly, what this PR (partially) fixes is a bug, I don't think it needs any other justification: bugs should be fixed whenever possible.

chadwhitacre · 2015-03-20T14:00:55Z

Secondly, what this PR (partially) fixes is a bug

Is #26 a bug? The fact that we can only partially fix it seems to be due to the fundamental constraints of a networked database, suggesting that, while it's certainly a constraint, it's not a bug. Can you explain why you see #26 strictly as a bug?

Very early on (I just spent some time looking for a relevant commit, but didn't find one) I worked around something like #26 on Gittip (at the time) with a three-step deploy process:

deploy code that supports both the old and new schemas
deploy the new schema
deploy code that drops support for the old schema

I believe, given Python and Postgres, that's the only possible way to avoid downtime when we want to make a non-backwards-compatible schema change to the database.

It will reduce the downtime of the apps that use postgres.py, because it will no longer be necessary to take them down every time a backwards-compatible change to the DB schema is made

I can see the value in that. I still need to satisfy myself that supporting this use case doesn't introduce too powerful a footgun.

For the record, I'm being more conservative with this PR than I would for one on Gratipay or Aspen, because postgres.py has already achieved the status of a thoroughly-documented software product in a way that Gratipay and Aspen haven't yet. I want to change postgres.py deliberately and carefully.

Changaco · 2015-03-20T15:26:19Z

Is #26 a bug? The fact that we can only partially fix it seems to be due to the fundamental constraints of a networked database, suggesting that, while it's certainly a constraint, it's not a bug. Can you explain why you see #26 strictly as a bug?

#26 is a caching bug, psycopg caches type definitions but it doesn't refresh them when they've become stale. Does it seem normal to you for a caching system to fail instead of refetching the data from upstream?

The fact that we can't fully fix this is not a "fundamental constraint of a networked database", it's just that postgresql wasn't designed to be used the way postgres.py uses it. It might be possible to modify postgresql to make the problem disappear entirely, but even if it is I don't want to wait for it.

I believe, given Python and Postgres, that's the only possible way to avoid downtime when we want to make a non-backwards-compatible schema change to the database.

You can't avoid downtime without this PR unless the changes only add or drop tables but don't modify any.

chadwhitacre · 2015-09-12T13:52:56Z

Alright, @Changaco, I'm merging this without further review, because a) I'm sure you'll want this for Liberapay, and b) I'm sure you'll quickly fix any bugs we're not seeing with it right now. :-)

Try to handle changes to type definitions

Changaco · 2015-09-12T14:12:34Z

Thanks, better late than never. :-)

Changaco · 2017-01-18T13:55:35Z

FTR I hit a corner case that this PR couldn't fix while deploying Liberapay schema changes today. It was a type change of columns from bool to int.

Changaco added 5 commits February 28, 2015 12:54

failing test for #26

fd399fb

fix #26

d3a7090

add an xfail test for what we can't fix

a2846e4

add another failing test

3ebcc6b

catch type errors and retry parsing after re-fetching the type info

fae67d8

chadwhitacre reviewed Mar 4, 2015
View reviewed changes

chadwhitacre and others added 2 commits March 4, 2015 15:18

Add a little doc in comments

6414fe1

specify the expected exception type of the xfail test

b55b87d

chadwhitacre reviewed Mar 4, 2015
View reviewed changes

improve test name and fix column type to match

5d49f4d

chadwhitacre reviewed Mar 10, 2015
View reviewed changes

chadwhitacre added 2 commits March 10, 2015 15:57

Match the method order in the base class

3431d24

Factor out a method to fetch type information

0f422eb

call CompositeCaster.parse() instead of duplicating its code

363123f

Changaco mentioned this pull request Mar 16, 2015

Delete unused email column from participants gratipay/gratipay.com#3249

Closed

Changaco mentioned this pull request Mar 30, 2015

Exchange routes gratipay/gratipay.com#3282

Merged

chadwhitacre mentioned this pull request Mar 30, 2015

Radar 0 gratipay/inside.gratipay.com#164

Closed

Changaco added the Review label Apr 3, 2015

chadwhitacre mentioned this pull request Apr 6, 2015

Radar 1 gratipay/inside.gratipay.com#176

Closed

chadwhitacre mentioned this pull request Apr 13, 2015

Radar 2 gratipay/inside.gratipay.com#184

Closed

chadwhitacre added a commit that referenced this pull request Sep 12, 2015

Merge pull request #43 from gratipay/fix-race-condition

efc33d7

Try to handle changes to type definitions

chadwhitacre merged commit efc33d7 into master Sep 12, 2015

chadwhitacre deleted the fix-race-condition branch September 12, 2015 13:53

Changaco mentioned this pull request Jan 18, 2017

Better control referencing liberapay/liberapay.com#506

Merged

Try to handle changes to type definitions #43

Try to handle changes to type definitions #43

Conversation

Changaco commented Feb 28, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chadwhitacre commented Mar 4, 2015

Changaco commented Mar 5, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chadwhitacre commented Mar 10, 2015

chadwhitacre commented Mar 10, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 11, 2015

chadwhitacre commented Mar 12, 2015

Changaco commented Mar 12, 2015

Changaco commented Mar 12, 2015

chadwhitacre commented Mar 12, 2015

chadwhitacre commented Mar 12, 2015

Changaco commented Mar 13, 2015

chadwhitacre commented Mar 20, 2015

Changaco commented Mar 20, 2015

chadwhitacre commented Sep 12, 2015

Changaco commented Sep 12, 2015

Changaco commented Jan 18, 2017