Hashing of nodes #119

aiida-bot · 2014-11-06T17:40:58Z

Originally reported by: Andrea Cepellotti (Bitbucket: acepellotti, GitHub: cepellotti)

Implement a node hashing. In this way, it would be possible to immediately recognise whether a node has been already stored in the DB, or if a calculation has been run already, which would return a result immediately.
Note also, this would massively speed up the debugging of workflows (repeating many times the same things): we might immediately understand (create a method for it) whether a wf step has calculations to be executed or not.

Bitbucket: https://bitbucket.org/aiida_team/aiida_core/issue/119

aiida-bot · 2014-11-06T17:45:52Z

Original comment by Andrea Cepellotti (Bitbucket: acepellotti, GitHub: cepellotti):

Moreover, to be sure that results are the same, we should make sure that the code and the parsers are not changed. For the code, we should introduce probably an md5sum, and we should also somehow check whether the parser has changed or not.

aiida-bot · 2014-12-27T17:34:43Z

Original comment by Andrea Cepellotti (Bitbucket: acepellotti, GitHub: cepellotti):

As noted by Andrius:
... I would propose reusing existing nodes instead of creating anew on each request to cope with node duplication. I have implemented a measure to control this issue in verdi data {upf,cif} import {upf,cif} by using get_or_create() classmethod and would suggest moving get_or_create() to the Data class. I am not quite sure whether get_or_create() should return only parentless nodes or all and would like to invite for discussion on this. If I understand it correctly, reusing data nodes with parents might introduce connection between even unrelated calculations, that's why I would at first reuse only parentless nodes.

…at replaces the _dbnode member if a similar node already exists

…hash, which obviously should not be checked in the DB

…extra (hash)

aiida-bot added major labels Dec 23, 2016

aiida-bot assigned muhrin Dec 23, 2016

giovannipizzi removed 0.5.0 labels Jan 19, 2017

giovannipizzi assigned lekah Jan 19, 2017

giovannipizzi added the topic/caching label Jan 19, 2017

lekah added a commit that referenced this issue Jun 1, 2017

#119 added first small test to check for node hashing

ce539d8

lekah added a commit that referenced this issue Jun 1, 2017

#119 Added small functions get_hash and get_same_node to Node

7589afd

lekah added a commit that referenced this issue Jun 1, 2017

#119 Added functionality to store method (only for Django backend) th…

13caff2

…at replaces the _dbnode member if a similar node already exists

lekah added a commit that referenced this issue Jun 2, 2017

#119 Try-except clause if hashing fails, and also check for None for …

ca5f047

…hash, which obviously should not be checked in the DB

lekah added a commit that referenced this issue Jun 2, 2017

#119 Fix in test that was failing because there is now an additional …

87e546f

…extra (hash)

greschd mentioned this issue Oct 19, 2017

Hashing, caching and fast-forwarding #652

Merged

18 tasks

waychal added coding-day/wip and removed type/enhancement [deprecated label] labels Dec 18, 2017

sphuber added coding-day/done type/accepted feature approved feature request and removed coding-day/wip labels Dec 18, 2017

giovannipizzi closed this as completed in #652 Feb 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hashing of nodes #119

Hashing of nodes #119

aiida-bot commented Nov 6, 2014

aiida-bot commented Nov 6, 2014

aiida-bot commented Dec 27, 2014

Hashing of nodes #119

Hashing of nodes #119

Comments

aiida-bot commented Nov 6, 2014

aiida-bot commented Nov 6, 2014

aiida-bot commented Dec 27, 2014