-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Page observation pipeline subject by graph
The subject-cache for the observation pipeline has become too big. We need to handle 30m subjects and more. Downloading these with limit/offset was impossible as we can't sort/ retreive later pages quickly enough. Even a single download exceeded the 15 minute stardog timeout and caused drafter to hang in GC (presumably because stasher would hold the whole 3G+ result in memory). This patch introduces an optional `graph-query` parameter that's applied on the `observation-pipeline`. We use this to page the subject-query by graph (then into page-size partitions). Hopefully this fixes #103.
- Loading branch information
1 parent
57dc5a9
commit 150c73a
Showing
8 changed files
with
96,725 additions
and
90,675 deletions.
There are no files selected for viewing
Large diffs are not rendered by default.
Oops, something went wrong.
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
SELECT ?graph WHERE { | ||
?dataset <http://publishmydata.com/pmdcat#graph> ?graph . | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,7 +1,7 @@ | ||
PREFIX qb: <http://purl.org/linked-data/cube#> | ||
PREFIX pmdcat: <http://publishmydata.com/pmdcat#> | ||
|
||
SELECT ?observation WHERE { | ||
?dataset pmdcat:datasetContents ?cube . | ||
?observation qb:dataSet ?cube . | ||
GRAPH ?graph { | ||
?observation a qb:Observation . | ||
} | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters