Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] flytesnacks Error 124 (timeout) #758

Closed
mneilly-et opened this issue Feb 22, 2021 · 6 comments
Closed

[BUG] flytesnacks Error 124 (timeout) #758

mneilly-et opened this issue Feb 22, 2021 · 6 comments
Assignees
Labels
bug Something isn't working
Milestone

Comments

@mneilly-et
Copy link

Describe the bug
Following the tutorial at https://flytecookbook.readthedocs.io/en/latest/tutorial.html and running "make start" results in an "Error 124" due to the following timeout command in the Makefile:

timeout 30 sh -c "until kubectl explain deployment &> /dev/null; do sleep 1; done"

A 30 second timeout does not appears to be long enough.

$ make start                                                                                                              
Preparing dependencies                                                   
[+] Building 1.3s (11/11) FINISHED                                       
 => [internal] load build definition from Dockerfile                                                                      0.0s                     
 => => transferring dockerfile: 38B                                                                                       0.0s                     
 => [internal] load .dockerignore                                                                                         0.0s                     
 => => transferring context: 2B                                                                                           0.0s                     
 => [internal] load metadata for docker.io/library/docker:20.10.3-dind                                                    1.0s                     
 => [1/5] FROM docker.io/library/docker:20.10.3-dind@sha256:43df6c4dbc32ccb306412fcce5b1443310cf9d91507409d7258cea1c9ee7  0.0s                     
 => [internal] load build context                                                                                         0.0s                     
 => => transferring context: 39B                                                                                          0.0s                     
 => https://github.com/k3s-io/k3s/releases/download/v1.20.2%2Bk3s1/k3s                                                    0.2s                     
 => CACHED [2/5] RUN apk add --no-cache bash git make rsync tini                                                          0.0s                     
 => CACHED [3/5] ADD https://github.com/k3s-io/k3s/releases/download/v1.20.2%2Bk3s1/k3s /usr/local/bin/k3s                0.0s                     
 => CACHED [4/5] RUN chmod +x /usr/local/bin/k3s                                                                          0.0s                     
 => CACHED [5/5] COPY k3s-entrypoint.sh /usr/local/bin                                                                    0.0s                     
 => exporting to image                                                                                                    0.0s                     
 => => exporting layers                                                                                                   0.0s                     
 => => writing image sha256:834b8db9baf1ad4e898bad678986da2d1b295afcb484a1203f90004f372ea5c9                              0.0s                     
 => => naming to docker.io/library/flyte-sandbox:latest                                                                   0.0s                     
Starting sandboxed Kubernetes cluster                                                                                                              
make: *** [Makefile:59: start] Error 124   

Expected behavior
No timeout error.

Additional context
To Reproduce
Steps to reproduce the behavior:

  1. pip install --pre flytekit
  2. git clone [email protected]:flyteorg/flytesnacks.git flytesnacks
  3. cd flytesnacks
  4. make start
@mneilly-et mneilly-et added bug Something isn't working untriaged This issues has not yet been looked at by the Maintainers labels Feb 22, 2021
@katrogan katrogan self-assigned this Feb 22, 2021
@kumare3
Copy link
Contributor

kumare3 commented Feb 22, 2021

cc @jeevb / @EngHabu ^^

@kumare3
Copy link
Contributor

kumare3 commented Feb 22, 2021

@mneilly-et I agree, 30 seconds here would rely on network speed as well, so that images can be downloaded within that time-period. @jeevb / @EngHabu can we raise it to much higher like 10 minutes? And in the case of failure, probably write a clear message that reasons why its taking more than 10 minutes - that, fail to start deployments - status of the deploymen?

@mneilly-et on the other hand, we are trying to optimize the deployment to have a fast startup time - thank you for bearing with us

@kumare3 kumare3 removed the untriaged This issues has not yet been looked at by the Maintainers label Feb 22, 2021
@kumare3 kumare3 added this to the 0.11.0 milestone Feb 22, 2021
@mneilly-et
Copy link
Author

FYI - as a secondary issue, when the timeout occurs the container keeps running and issuing "make start" again results in a naming conflict. Maybe stopping the container on failure by default and having a variable to request not stopping if debug is needed would be good. FWIW.

@EngHabu EngHabu assigned jeevb and unassigned katrogan Feb 22, 2021
@EngHabu
Copy link
Contributor

EngHabu commented Feb 22, 2021

@jeevb will migrate this to a newer image to avoid image build and potential timeout.. I think bumping timeout is also a good idea...

@kumare3
Copy link
Contributor

kumare3 commented Mar 1, 2021

@EngHabu / @jeevb what do you think is required more to close this one out?

@jeevb
Copy link
Contributor

jeevb commented Mar 1, 2021

The timeouts were increased until we can start using the flyte-sandbox image for this: flyteorg/flytesnacks@1364536

@EngHabu EngHabu closed this as completed Mar 3, 2021
palchicz pushed a commit to palchicz/flyte that referenced this issue Dec 23, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

5 participants