[BUG] externally hosted model can not have a private ip address #2142

JohnUiterwyk · 2024-02-20T11:27:43Z

I have use case as well that involves using an "externally hosted model" that is self hosted and located within a private network (or more simply another use cases is if i'm using an api gateway that has a private ip address ), however it seems there is a hard coded requirement that externally hosted models can not have a private ip address:

ml-commons/ml-algorithms/src/main/java/org/opensearch/ml/engine/httpclient/MLHttpClientFactory.java

Lines 77 to 84 in 0903d5d

    
           protected static InetAddress[] validateIp(String hostName) throws UnknownHostException { 
        
               InetAddress[] addresses = InetAddress.getAllByName(hostName); 
        
               if (hasPrivateIpAddress(addresses)) { 
        
                   log.error("Remote inference host name has private ip address: " + hostName); 
        
                   throw new IllegalArgumentException(hostName); 
        
               } 
        
               return addresses; 
        
           }

This seems like an arbitrary restriction, which i think should either be removed or only used when a config flag is provided.

Zhangxunmt · 2024-02-27T18:54:38Z

Need to verify with security guardians.

dhrubo-os · 2024-02-28T22:10:59Z

@JohnUiterwyk Thank you for raising this issue. I have removed the bug label, as blocking any private IP addresses was a deliberate choice made after discussions with our security engineers. However, since there is now a request from the community, we will consult with our security engineers to explore how we can accommodate this for our community.

JohnUiterwyk · 2024-02-29T05:51:28Z

Thanks @dhrubo-os . My motivation raising the issue to enable private ip addresses is specifically driven by security and data control considerations.

JohnUiterwyk · 2024-03-07T03:22:55Z

hi @dhrubo-os, i was wondering if there is any progress on this. i would love to see this included in 2.13 as it looks like a very small change; This private ip restriction is currently a blocker in certain enterprise environments for using some of the amazing capabilities available via the ml-commons open search plugin. Thanks for your effort and attention on this.

dhrubo-os · 2024-03-12T01:31:55Z

Hi @JohnUiterwyk , sorry for the late response. I think 2.13 will be bit tight as we are still in conversation with the security team. But we can definitely target for 2.14. Thanks.

JohnUiterwyk · 2024-03-14T10:34:58Z

thanks @dhrubo-os, great to hear there is progress on this! Also just wanted to say thanks for all you and your teams hard work. This project is incredibly valuable and having a huge impact!

whittssg · 2024-05-01T17:30:37Z

Did this get updated yet? I was researching this error for a while (while trying to configure a local llm connector):

{
  "error": {
    "root_cause": [
      {
        "type": "illegal_argument_exception",
        "reason": "localhost"
      }
    ],
    "type": "illegal_argument_exception",
    "reason": "localhost"
  },
  "status": 400
}

and finally tied that response with this issue.

Thanks,

ylwu-amzn · 2024-05-01T19:50:23Z

@whittssg The private local ip blocked now for security concern (to block creating connector to bypass security layer to call your local service directly) https://github.com/opensearch-project/ml-commons/blob/main/ml-algorithms/src/main/java/org/opensearch/ml/engine/httpclient/MLHttpClientFactory.java#L76

Will consult with security guys first.

faileon · 2024-05-12T10:17:13Z

So how can we communicate with self hosted embedding inference endpoints? Why can't I communicate within my docker network freely? Is there a workaround for now? Why does opensearch take on the responsibility to decide what is and isnt secure here?

ylwu-amzn · 2024-05-12T15:12:03Z

Replied on another Github issue #2126 (comment)

We had a discussion with security guys, they are ok to add a setting for allowing private IP. So user can control whether enable it or not. The setting should be disabled by default. User can enable it if they need. That can solve the problem.

reuschling · 2024-05-15T13:30:04Z

I am really interested what the reason is that an externally hosted LLM should be more secure than a self-hosted one reachable over a private IP.
We currently work with a hack that we open the private IP with an externally reachable redirection. This is really ugly in terms of security.

manzke · 2024-05-22T11:07:43Z

Has it been solved and is it part of 2.14.?

Even if you would like to protect from using private, the implementation has too many flaws.
I just use a different internal ip which is not 127.,192.,168.,172. and it will work.
Can't think of a security requirement it should fulfill.

There are better ways to solve this.

faileon · 2024-05-22T12:16:33Z

Has it been solved and is it part of 2.14.?

Even if you would like to protect from using private, the implementation has too many flaws. I just use a different internal ip which is not 127.,192.,168.,172. and it will work. Can't think of a security requirement it should fulfill.

There are better ways to solve this.

It is planned for 2.15

manzke · 2024-05-22T12:18:53Z

Let me know how you want it to be solved and we open a PR.
It was labeled for 2.14 already.

ylwu-amzn · 2024-06-11T21:35:15Z

PR #2534

hadoopdk · 2024-07-03T11:09:30Z

I still see error in 2.15

{
"error": {
"root_cause": [
{
"type": "illegal_argument_exception",
"reason": "Remote inference host name has private ip address:"

reuschling · 2024-07-03T13:22:39Z

Did you set the new opensearch setting 'connector.private_ip_enabled: true' ? With this it works in my setting.

holdenma · 2024-11-05T17:28:43Z

Seems this setting not allowed on AWS OpenSearch. @ylwu-amzn Can you confirm ? This blocks us

ylwu-amzn · 2024-11-05T18:32:58Z

@holdenma , sorry that this setting not supported on AWS. Suggest to deploy your model somewhere else like Sagemaker, EC2 etc. You can create load balancer and use that URL in connector.

JohnUiterwyk added bug Something isn't working untriaged labels Feb 20, 2024

JohnUiterwyk changed the title ~~[BUG]~~ [BUG] externally hosted model can not have a private ip address Feb 20, 2024

JohnUiterwyk mentioned this issue Feb 21, 2024

[FEATURE] Support the deployment of Small Language Models #2126

Open

Zhangxunmt added this to ml-commons projects Feb 27, 2024

Zhangxunmt removed the untriaged label Feb 27, 2024

Zhangxunmt self-assigned this Feb 27, 2024

ylwu-amzn assigned dhrubo-os and unassigned Zhangxunmt Feb 28, 2024

dhrubo-os added enhancement New feature or request and removed bug Something isn't working labels Feb 28, 2024

mingshl added the v2.14.0 label Mar 12, 2024

mingshl moved this to In Progress in ml-commons projects Mar 12, 2024

bbarani added this to Test roadmap format Apr 9, 2024

bbarani moved this to Features in Test roadmap format Apr 9, 2024

ylwu-amzn added v2.15.0 and removed v2.14.0 labels Jun 11, 2024

ylwu-amzn mentioned this issue Jun 11, 2024

add setting to allow private IP #2534

Merged

5 tasks

ylwu-amzn closed this as completed in #2534 Jun 11, 2024

github-project-automation bot moved this from In Progress to Done in ml-commons projects Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] externally hosted model can not have a private ip address #2142

[BUG] externally hosted model can not have a private ip address #2142

JohnUiterwyk commented Feb 20, 2024

Zhangxunmt commented Feb 27, 2024

dhrubo-os commented Feb 28, 2024

JohnUiterwyk commented Feb 29, 2024

JohnUiterwyk commented Mar 7, 2024

dhrubo-os commented Mar 12, 2024

JohnUiterwyk commented Mar 14, 2024

whittssg commented May 1, 2024

ylwu-amzn commented May 1, 2024

faileon commented May 12, 2024 •

edited

Loading

ylwu-amzn commented May 12, 2024

reuschling commented May 15, 2024

manzke commented May 22, 2024

faileon commented May 22, 2024

manzke commented May 22, 2024

ylwu-amzn commented Jun 11, 2024

hadoopdk commented Jul 3, 2024

reuschling commented Jul 3, 2024

holdenma commented Nov 5, 2024

ylwu-amzn commented Nov 5, 2024

[BUG] externally hosted model can not have a private ip address #2142

[BUG] externally hosted model can not have a private ip address #2142

Comments

JohnUiterwyk commented Feb 20, 2024

Zhangxunmt commented Feb 27, 2024

dhrubo-os commented Feb 28, 2024

JohnUiterwyk commented Feb 29, 2024

JohnUiterwyk commented Mar 7, 2024

dhrubo-os commented Mar 12, 2024

JohnUiterwyk commented Mar 14, 2024

whittssg commented May 1, 2024

ylwu-amzn commented May 1, 2024

faileon commented May 12, 2024 • edited Loading

ylwu-amzn commented May 12, 2024

reuschling commented May 15, 2024

manzke commented May 22, 2024

faileon commented May 22, 2024

manzke commented May 22, 2024

ylwu-amzn commented Jun 11, 2024

hadoopdk commented Jul 3, 2024

reuschling commented Jul 3, 2024

holdenma commented Nov 5, 2024

ylwu-amzn commented Nov 5, 2024

faileon commented May 12, 2024 •

edited

Loading