Decentralized discovery for object store instances #105

iramiller · 2021-02-24T20:33:48Z

Summary

A decentralized discovery method for object stores to connect to each other using the blockchain

Problem Definition

Existing P8e object stores use a centralized system for connection and discovery. With the transition to a decentralized public network this central relay point will no longer exist. A method for an address to publish a list of endpoint(s) where partner object stores can authoritatively discover and connect is required.

Proposal

Create a record on the blockchain that is controlled by an address and allows it to publish records indicating endpoints available for partners to connect to.

Add a new structure and store on chain under the owner address
Add appropriate read/write methods that allow the owner address to maintain a given entry
Provide a method that returns a list of records for a scope by linking against the "data_access" list of addresses that control which accounts the off-chain information should be provisioned for.

Implementation

The following simple structure will be used to hold a reference for a locator endpoint for a given account. The identified account will be the one that owns/controls the record and must sign requests to modify it.

message ObjectStoreEndpoint {
  // account address the endpoint is owned by
  string owner = 1;
  // locator endpoint uri
  string locator_uri = 2;
}

For Admin Use

Not duplicate issue
Appropriate labels applied
Appropriate contributors tagged
Contributor assigned/self-assigned

The text was updated successfully, but these errors were encountered:

iramiller · 2021-02-24T20:34:52Z

This issue/proposal should be reviewed by @scirner22 for object store implementation requirements as well as @rjmarkel for security sign-off.

scirner22 · 2021-02-24T20:50:41Z

@mlatimer-figure can you look at this as well? The background here is that object-store mailbox can go away if public third party object stores can communicate directly with each other. We're planning on using the blockchain to broadcast a public_key (address) that is owned by a given object-store instance.

A summary of my comments from slack over to this issue:

In my mind there's two things that will be reaching out to fetch items from a given object-store:

another object-store
any machine that's owned by the same org as another object-store

As an owner of an object-store there's ample security in place to have the instance be completely public and allow application level security and encryption to handle data. That being said it's better practice to not have these instances be open to the public and leverage a firewall instead. As an example let's say object-store A is sharing data with object-store B. Object-store B would be privy to this fact when it reads scope events from chain and notices that key A attached key B on the data_access list or is a member of the party list. B knows object-store A's address because it has seen a block like Ira outlined above. Based on the two bullet points above, imo B will reach out to A either via the object-store B directly, or from a machine inside of A's. public or private network. A can correctly share with B and whitelist them by reading B's object-store block and whitelisting object-store B's IP and also by whitelisting any NATs that are listed.

So in my mind

repeated string outbound_cidr = 4;

should become

repeated string addr = 4; // where addr is the object-store's IP address and possibly all NAT addresses coming out of this object-stores infrastructure.

Because of this I don't see any value of allowing cidr blocks and only more complications around validating that a bad actor isn't trying to get you to whitelist a range that is far too large or even whitelisting everything (is 0.0.0.0/1 the largest valid range?)

iramiller · 2021-02-24T20:57:37Z

addr = 4;

We want to avoid this abbreviation if possible due to existing use within other areas of the blockchain.

bad actor isn't trying to get you to whitelist a range

All ranges are suggestions, no object store should accept these without verification and compliance with local constraints and configuration

The use of the CIDR suffix really only makes sense with IPv6 addressing where its use as a /64 is strongly encouraged.

Getting a read from @rjmarkel on these aspects specifically is why he was tagged.

scirner22 · 2021-02-24T21:38:12Z

@iramiller after discussing this with Latimer he doesn't think it's good practice to ever broadcast a range or a NAT address for that matter. Should we keep this strictly to object-store's public ip for now and think more about how to expand that in the future?

iramiller · 2021-02-24T21:59:11Z

Should we keep this strictly to object-store's public ip for now and think more about how to expand that in the future?

That might be the most prudent approach ... another one to consider is that the endpoint we publish here maybe shouldn't even be the object store itself ... it could be just a service to ask for the connection details that would return results signed by the key associated with the address..

iramiller · 2021-02-25T19:17:28Z

Based on the service locator endpoint idea the on chain record could be streamlined extensively

message ObjectStoreLocator {
  // account address the endpoint is owned by
  string owner = 1;
  // locator endpoint uri
  string locator_uri = 2;
}

arnabmitra · 2021-03-04T21:07:21Z

i think we will be making this in time for 0.20, moving to 0.30 :(

* This PR add's "A decentralized discovery method for object stores to connect to each other using the blockchain" See #105 for more details. Co-authored-by: Ira Miller <[email protected]>

iramiller added security Security related request/issue metadata Metadata Module labels Feb 24, 2021

iramiller added this to the 0.1.5 milestone Feb 24, 2021

arnabmitra self-assigned this Mar 1, 2021

arnabmitra modified the milestones: 0.2.0, 0.3.0 Mar 4, 2021

iramiller linked a pull request Mar 17, 2021 that will close this issue

Feature/dec obj store #150

Merged

8 tasks

dwedul-figure mentioned this issue Mar 17, 2021

Feature/dec obj store #150

Merged

8 tasks

arnabmitra closed this as completed in #150 Mar 19, 2021

arnabmitra added a commit that referenced this issue Mar 19, 2021

Feature/dec obj store (#150)

f0f0356

* This PR add's "A decentralized discovery method for object stores to connect to each other using the blockchain" See #105 for more details. Co-authored-by: Ira Miller <[email protected]>

iramiller added this to Provenance Core Protocol Team Jun 1, 2023

github-project-automation bot moved this to Todo in Provenance Core Protocol Team Jun 1, 2023

iramiller moved this from Todo to Done in Provenance Core Protocol Team Jun 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decentralized discovery for object store instances #105

Decentralized discovery for object store instances #105

iramiller commented Feb 24, 2021 •

edited by arnabmitra

Loading

iramiller commented Feb 24, 2021

scirner22 commented Feb 24, 2021 •

edited

Loading

iramiller commented Feb 24, 2021 •

edited

Loading

scirner22 commented Feb 24, 2021

iramiller commented Feb 24, 2021

iramiller commented Feb 25, 2021 •

edited

Loading

arnabmitra commented Mar 4, 2021

Decentralized discovery for object store instances #105

Decentralized discovery for object store instances #105

Comments

iramiller commented Feb 24, 2021 • edited by arnabmitra Loading

Summary

Problem Definition

Proposal

Implementation

For Admin Use

iramiller commented Feb 24, 2021

scirner22 commented Feb 24, 2021 • edited Loading

iramiller commented Feb 24, 2021 • edited Loading

scirner22 commented Feb 24, 2021

iramiller commented Feb 24, 2021

iramiller commented Feb 25, 2021 • edited Loading

arnabmitra commented Mar 4, 2021

iramiller commented Feb 24, 2021 •

edited by arnabmitra

Loading

scirner22 commented Feb 24, 2021 •

edited

Loading

iramiller commented Feb 24, 2021 •

edited

Loading

iramiller commented Feb 25, 2021 •

edited

Loading