-
Notifications
You must be signed in to change notification settings - Fork 108
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RFE: Option in pbench-move-results command to output results direct link #3120
Comments
It's worth noting here that with "ownership" and "curation" this becomes a bit more nuanced than it was for 0.69. Yes, if one knew the benchmark name, the config string, and the run timestamp, plus the hostname and port of the Pbench Server, one could generate a URL to pull the tarball or any artifact from the tarball via unauthenticated Apache file sharing. We're moving away from all that. Unless the dataset is So given But, again, unless you use |
Agree. |
Hello. I'm uploading with:
and data seems to be publicly available |
@jhutar New server is still not deployed. With new implementation of user notion, user has to generate a token and use that token to upload the data. |
As Pravin said, the Also, note that |
@pravins, I'm not sure that this is a direction that we want to take. My understanding of the model is that Now, if the issue is that a user is having trouble locating their result locally, that seems like something we might consider creating a command for. (Such a task should be simple, well-bounded, and not subject to implementation details of the Pbench Server.) |
A little more nuanced. First, the default behavior of We also don't currently have dashboard mechanism to return dataset inventory. (Either individual files, which might be displayed inline, or the full tarball, which would presumably be downloaded to the browser's local filesystem.) There's no reason that anyone can't just use So ultimately, whether or not we do this, and however we prioritize it, I don't think there's anything inherently wrong with the request. |
Diving into the nuance, the RFE requests the ability for a user to "do further analysis on the uploaded results to find required information." There are two ways for the user to engage with that: either locally or on some other system. If the user wants to work locally, then there is no reason to involve However, if the user wants to work on another system, then having the result's URL made available on the system where they ran Thus, I think it would be better to refer the user to the Pbench Dashboard:
That is, I think that using the Pbench Dashboard (in collaboration with the Pbench Server) should (and will eventually) be better than accessing the result directly...and so I don't see any value to trying to help the user to avoid using the Dashboard. |
Yes, obviously one needs an authorization header; but once we adopt the "API token" mechanism that's not onerous. We don't want to position our dashboard as the only gateway into the server APIs, for lots of reasons, including the fact that we don't expect our dashboard to be able to do everything someone might want. Making it easy for them to get at the raw archived data provides a hook for all sorts of expansions. This is all way beyond "exposing the URI in |
I concur -- we want to facilitate the development of other client programs, as well. But, equally, I don't think we should expect our (human) end-users to be using the Pbench Server APIs directly (e.g., via
No, we don't...but wouldn't it be cool if it did?! And, if the Dashboard is the first point of contact, then we're much more likely to find out when it doesn't do something which a user needs. (If the default is that we give users a URL for their result, then, instead of getting RFE's for the Dashboard, we'll be getting requests for help making I would really like to see "access to the result" be a Pbench Server API request, e.g., for timeseries data, possibly facilitated by the Dashboard, and not a process where the user pulls the tarball, unpacks it, and then does something like manually ingest the guts into Elasticsearch.... 😢 |
Hello @webbnh . I need that URL because my use-case is "run the test, upload results and cleanup" and I need to add that URL to the metadata I store. |
Also reading the discussion: note I do not know the difference between pbench server and pbench dashboard (and I do not care too much). I just want a direct link to the results I just uploaded to be able to record it together with my metadata so it is available long-term. If it is tarball, unpacked tarball directory or some UI dashboard full of fancy graphs, I do not care at the moment. If this is not possible or does not fit your stack, feel free to close this - I can deal with it. |
Hi @jhutar, The Pbench Dashboard is a web-based GUI for accessing uploaded Pbench results; the Pbench Server provides the RESTful API on which the Dashboard (and, someday, other clients) is built. In v0.69, the Pbench Server deployment includes an Apache file service which is used to serve the results as tarball unpacked into a file tree which mirrors the results tree on the system where the workload was run. As Pbench evolves, we would like to move in a direction which is more data-focused and less tied to the representation of that data -- e.g., to replace the tarball and original filesystem view with APIs which allow the user to query the results and receive configuration, summary, and timeseries data as, e.g., JSON blobs which are more amenable to visualization and data science investigation. I don't know if such a transition is actually possible, since people seem to have a long and continuing tradition of treating the Pbench back end as a file server from which they pull bits of their results in order to do custom processing. But, the idea that That said, if what you are looking for is actually some sort of ID which would be used later to identify and retrieve a result, we could probably do something useful there. The Pbench Server has a notion of a "result ID" -- we could make that available from @jhutar, the concept of having |
Hello. I agree, except that not just ID, but whole URL (with that ID I assume). Thank you! |
But...what exactly do you expect that URL to produce for you? Should it produce a web page for your browser (which will enable you to explore your data interactively and possibly visualize it)? Should it produce some sort of CSV file? Or, are you expecting it to produce a tarfile? And, if it's the last of those, what are you expecting to find in it? That is, what do you conceive of "a Pbench result" as being?? And, how/when/where do you expect to access it? And, what do you hope to do with it? |
I used pbench as a tool that allows me to record some metrics and system info, store it and forgot. I just need a URL that allows me to start digging into the result 3 months later in case I decide I have to. So all of what you named is most probably OK - all of that will give me some starting point. By "a Pbench result" I mean: whatever was uploaded there. I assume (given what I have seen on e.g. http://pbench.perf.lab.eng.bos.redhat.com/results/f09-h26-b02-5039ms.rdu2.scalelab.redhat.com/spc/pbench-user-benchmark__2023.01.04T20.02.19/) it contains some monitoring data and some system info. Hope it sounds as a reasonable use-case? |
Is there an existing issue for this?
Description
As a user, I want a direct link from where i can download results uploaded to the server so i can do further analysis on the uploaded results to find required information.
As of now, we do not have any such option and end user need to construct direct link from config-file+ results + user + benchmark directory
To Reproduce
Run pbench-move-results
Actual Results
It does not output direct link for results.
Expected Results
Some option to get direct link to the uploaded results.
Additional information
None
The text was updated successfully, but these errors were encountered: