Add "save a trace" functionality. #1093

mjbryant · 2016-04-19T22:18:15Z

It'd be really nice if individual traces could be tagged to not age out of Cassandra through the UI.

codefromthecrypt · 2016-04-22T00:31:19Z

neat idea. One way could be to save them off to a user profile (even browser storage), since traces are so small. Ex, zipkin/api/v1/trace/trace_id becomes searchable locally. Just an idea. PS this is only an issue in cassandra at the moment, since others don't support TTLs anyway. It might be tricky to do this on the storage side portably, since we don't have a portable feature for TTL. That doesn't preclude custom logic, just explanation.

yurishkuro · 2016-04-22T02:09:51Z

Since the UI has been decoupled from the server side, another possible solution is to have a Save As, so that the user can save the trace json as a file on local disk and later load it into the UI. It's especially useful considering that tracing ultimately is used to troubleshoot perf issues, so one could save a trace, attach it to a ticket, and someone else can later load the trace and see it in the UI.

We already have the JSON button, so Save As is there, but we don't have a Load function in the UI.

codefromthecrypt · 2016-04-22T02:44:12Z

We already have the JSON button, so Save As is there, but we don't have a
Load function in the UI.

"We" are a privileged minority until #1060 :P

yurishkuro · 2016-04-22T03:33:18Z

huh, I didn't realize, thought it was in already.

codefromthecrypt · 2016-06-03T11:14:47Z

PS there's now a download button on a trace.

codefromthecrypt · 2016-10-07T00:09:32Z

We had a note in #1222 about just making a separate non-bucketed index for elasticsearch and just move docs to it. There's no TTL support in mysql anyway, so noop there. Cassandra might take some thinking.

We can address this by making it possible to query the "infinite" index routinely, and we could also make a "request save" api, which either moves the trace there or returns a message if unsupported.

cc @openzipkin/elasticsearch @openzipkin/cassandra

codefromthecrypt · 2016-10-07T00:10:50Z

also in mysql I suppose we could double the table-count to provide an "infinite" index separate from the routine one (cc @jcarres-mdsol)

michaelsembwever · 2016-10-07T00:57:55Z

Cassandra
Simplest approach is to rewrite the data with new TTL=-1

Technical: This will leave some old sstables around and prevent them from being wiped off disk in one go (tombstone compactions will instead be required to clean them out). But I can't see this being a visible issue to the zipkin operator.

codefromthecrypt · 2016-10-07T01:28:31Z

thx @michaelsembwever

We'd likely want some signal to imply that the trace is special. One way is to add a binary annotation (tag) to the saved trace, like..

"representative" -> "fastestest"
"representative" -> "discovery name failure"

We wouldn't care what the values are, but it allows zipkin query of "representative" to include them, and also would allow any UI to distinguish them from something else (cc @rogeralsing)

This would also permit those doing modeling or analysis research to ask folks for representative traces in some easy-to-grab fashion (cc @adrianco @rfonseca)

dangets · 2016-10-07T14:46:17Z

Moving discussion from #1222 - Briefly I'm trying to design a way for generic saving and eviction of threads and I proposed adding some methods - SpanStore.setTraceExpiration(long traceId, Date date) & SpanConsumer.setDefaultTraceExpiration(int amount, TimeUnit unit). It would be up to the implementations on if they could actually handle ttl eviction and how granular the units could be. It would be valid for these to be noops if the store couldn't support it - though this might be confusing on the UI.

I could see MySql having a ttl column in the Spans table that could be used, Elasticsearch could just drop daily indexes, etc...

I not a huge fan of this as it is backwards incompatible with existing implementations, but throwing it out there.

codefromthecrypt · 2016-10-10T06:39:25Z

I thought about this a bit over the weekend. Here's what that ended up as.

once we start worrying about retention policy for "saved tweets" we approach the complexity of the span store (again)
this is compounded by a need for an api change, which would break folks to support a feature that hasn't been requested widely
the more complexity we add our component apis, the harder future changes get

I've an alternative proposal: do it all on the client

Instead of creating a secondary tier storage in our api, simply deploy twice. Ex use a keyspace/DB for the transient trace depot and another for the "permanent" one.

Ex. index=zipkin (for transient) and index=zipkin-4ever for permanent.

The second is fronted by vanilla zipkin-servers that don't run any collectors except http. The act of "saving a trace" is just taking the json from the transient one and POSTing it to the permanent one.

Someone could later change the zipkin-ui (or as some sort of plugin) to query across both and/or create a automatic flow (such as a button which clicked posts to the "permanent" zipkin). cc @rogeralsing @eirslett

This automatically solves any future needs around retention, as the same mechanics can be used. The only difference is that in the case of cassandra, the keyspace should be affected prior to use, notably to remove the TTL (or set it to a very long value). The best win is that there's no code impact on server components. They remain simple and probably more "microservice" as a result.

thoughts?

basvanbeek · 2016-10-10T11:04:12Z

Sounds good to me if we take the plugin approach for zipkin-ui, that way people have the biggest flexibility to tie this together with their usage concerns and make their own infrastructure choices with the greatest ease of use.

mansu · 2016-10-10T16:42:32Z

@adriancole making favoriting/saving a tracing part of the client and then the client copy the traces between the 2 stores is my preferred approach for the following reasons:
(a) the spans will remain immutable
(b) In addition to the UI, other applications can also write to this end point to save traces for ever.
(c) with an additional annotation on the root span we can create save or favorite feature.

However, I prefer making this a feature of the current backend instead of having separate clusters though. I think that way the backend would be easier to operate. Multiple clusters increase operational overhead in large organizations.

codefromthecrypt · 2016-10-11T00:01:32Z

Thanks for the feedback. I have one question on your comment.

However, I prefer making this a feature of the current backend instead of
having separate clusters though. I think that way the backend would be
easier to operate. Multiple clusters increase operational overhead in large
organizations.
By backend, if you mean storage, I think this is already possible because
you can use different index in the same cluster.

If by backend you mean zipkin servers that would really complicate
configuration as currently they are designed for a single storage
component. I don't think this feature is worth complexifying that as it is
quite easy to spin up api servers.

codefromthecrypt · 2016-10-11T02:00:32Z

FYI "permanent traces" will eventually clash, even if unlikely for some. While not a strict dependency, this is certainly related to the 128bit trace id work #1262

codefromthecrypt · 2018-10-23T10:04:33Z

ps the original version of zipkin had a "favorite" button (trivia)

jorgheymans · 2020-04-11T11:03:25Z

As of Zipkin 2.21, trace archiving is now supported: https://github.com/openzipkin/zipkin/tree/master/zipkin-server#trace-archival .

In the screenshot below you can see the 'Archive Trace' button appearing once everything is configured:

Note there is an ongoing discussion whether queries should fan out to archival instances.

virtuald mentioned this issue May 28, 2016

Add JSON button to trace page #1124

Merged

codefromthecrypt mentioned this issue Oct 6, 2016

Document retention practice for elasticsearch #1222

Closed

naoman mentioned this issue Sep 27, 2017

Save trace to permanent store #1747

Closed

codefromthecrypt added enhancement ui Zipkin UI labels Oct 23, 2018

jorgheymans mentioned this issue Mar 11, 2020

Add Archive Trace button #3018

Merged

jorgheymans closed this as completed Apr 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add "save a trace" functionality. #1093

Add "save a trace" functionality. #1093

mjbryant commented Apr 19, 2016 •

edited

Loading

codefromthecrypt commented Apr 22, 2016 via email

yurishkuro commented Apr 22, 2016

codefromthecrypt commented Apr 22, 2016

yurishkuro commented Apr 22, 2016

codefromthecrypt commented Jun 3, 2016

codefromthecrypt commented Oct 7, 2016

codefromthecrypt commented Oct 7, 2016

michaelsembwever commented Oct 7, 2016 •

edited

Loading

codefromthecrypt commented Oct 7, 2016

dangets commented Oct 7, 2016

codefromthecrypt commented Oct 10, 2016

basvanbeek commented Oct 10, 2016

mansu commented Oct 10, 2016

codefromthecrypt commented Oct 11, 2016

codefromthecrypt commented Oct 11, 2016

codefromthecrypt commented Oct 23, 2018

jorgheymans commented Apr 11, 2020

Add "save a trace" functionality. #1093

Add "save a trace" functionality. #1093

Comments

mjbryant commented Apr 19, 2016 • edited Loading

codefromthecrypt commented Apr 22, 2016 via email

yurishkuro commented Apr 22, 2016

codefromthecrypt commented Apr 22, 2016

yurishkuro commented Apr 22, 2016

codefromthecrypt commented Jun 3, 2016

codefromthecrypt commented Oct 7, 2016

codefromthecrypt commented Oct 7, 2016

michaelsembwever commented Oct 7, 2016 • edited Loading

codefromthecrypt commented Oct 7, 2016

dangets commented Oct 7, 2016

codefromthecrypt commented Oct 10, 2016

basvanbeek commented Oct 10, 2016

mansu commented Oct 10, 2016

codefromthecrypt commented Oct 11, 2016

codefromthecrypt commented Oct 11, 2016

codefromthecrypt commented Oct 23, 2018

jorgheymans commented Apr 11, 2020

mjbryant commented Apr 19, 2016 •

edited

Loading

michaelsembwever commented Oct 7, 2016 •

edited

Loading